Hi Monty, I apologize for not getting back to you sooner as I was on vacation last week. Anyway, below may be a better explanation of what we are trying to accomplish and may provide you with answers to your questions. This was provided from our senior developer who also provided the information above:
Senior Developer:
No, we are not using ‘thread pool’. Our systems tend to be ‘batch’ processing orientated and establish many SQL queries that are individually treated as a ‘open cursor’ where the results are iterated across by the client application.
Yes, Please!
Having the ‘gettid()’ value present in the information_schema.PROCESSLIST would greatly help achieve the association between a Unix LWP child of the mysqld (visible through the tools used by Sys Admins & DBAs) and the offending SQL query. Once the offending SQL query is identified, the originating process can be tracked down.
Regarding the display of gettid() vs pthread_self(), in the info_schema.PROCESSLIST, this seems to be platform specific where the value displayed should correlate to the output of tools used by Sys Admins (unix: ps -ef, htop, top, pgrep; Windows: taskmgr, plist). However, IF you know of a means to translate the pthread_self() value into those reported by unix Sys Admin tools (ps -efL, htop, top), then I think everyone would desire the POSIX standard value.
To facilitate our DBAs, our developers are required to embedded comments into the SQL queries that identify the originating process. Ex: “-- Program:$0 Host:$HOSTNAME ProgPid:$$ Params:$* DESC:Customer Cursor --\n SELECT x,y,z FROM table WHERE filter = ?” – Where $$ is the unix pid of the process/script.
Ultimately, the problem we are experiencing is a random MySQL LWP thread that sustains 100% busy for many minutes, sometimes hours. This is very visible from the unix tools ‘top’, ‘htop’ and ‘ps -efL’. However, we have not been able to identify which SQL query to which the mysqld LWP thread is servicing. During this situation, the info_schema.PROCESSLIST does not report any state or activity changes, thus we are unable to identify the offending SQL query. Once the offending SQL query is identified, the affected processes can be identified and debugged.
Thanks, Harry
Following is a suggestion, provided by one of our senior developers, which could be used to accomplish this feature request.
There is a code suggestion, made by Chrisian Hammers, 2008, back for MySQL 5.0.x,
documented in MySQL feature request:
http://bugs.mysql.com/bug.php?id=33799.
However, I have a correction for the 'FIXME' line:
- thd_info->posix_thread_id = *((unsigned int *)(tmp->real_id + 72)); // FIXME this is from glibc-2.6.1/nptl/descr.h
+ thd_info->posix_thread_id = gettid(); // #include <sys/types.h>
The PID &/or Thread IDs are easily available, but only to the parent process and child thread or process.
A) From the parent process perspective, the child PID or thread_id is only made available during thread creation:
The return code from fork(), is the PID of the child process. (Doc here http://pubs.opengroup.org/onlinepubs/007908799/xsh/fork.html)
The first parameter from the POSIX function pthread_create(). (Doc here http://pubs.opengroup.org/onlinepubs/007908799/xsh/pthread_create.html)
The sixth parameter from the Solaris function thr_create(). (Doc here http://docs.oracle.com/cd/E19253-01/816-5137/sthreads-69878/index.html)
B) From within the child Thread/Lwp (via pthread_create()), the thread_id is made available through:
The return code from POSIX function, gettid().
EX:
#include <sys/types.h>
pid_t gettid(void);
The return code from syscall(SYS_gettid), where SYS_gettid is an enum.
EX:
#include <sys/types.h>
#include <sys/syscall.h>
#define gettid() syscall(SYS_gettid);
The return code from Solaris function, thr_self().
EX:
#include <thread.h>
thread_t thr_self();
C) From within the child process (via fork()), the PID is made available, at anytime through:
The return code from getpid().
EX:
#include <unistd.h>
pid_t getpid();