[MXS-217] MaxScale fatal signal 11 Created: 2015-06-23 Updated: 2015-11-30 Resolved: 2015-11-30 |
|
| Status: | Closed |
| Project: | MariaDB MaxScale |
| Component/s: | Core, mariadbbackend, readwritesplit |
| Affects Version/s: | 1.1.1 |
| Fix Version/s: | 1.3.0 |
| Type: | Bug | Priority: | Blocker |
| Reporter: | Alex Lee | Assignee: | Johan Wikman |
| Resolution: | Cannot Reproduce | Votes: | 0 |
| Labels: | None | ||
| Environment: |
CentOS 5.4(Final) |
||
| Attachments: |
|
| Sprint: | 10.1.8-2 |
| Description |
|
2015-06-17 11:39:55 Fatal: MaxScale received fatal signal 11. Attempting backtrace. |
| Comments |
| Comment by Dipti Joshi (Inactive) [ 2015-06-23 ] | |||||||||||||||||||
|
Alex Lee Please provide this information Exact version of the OS. Then we will provide you a debug build to produce core with and analyze the core for addressing the crash. Thanks | |||||||||||||||||||
| Comment by Alex Lee [ 2015-06-24 ] | |||||||||||||||||||
|
[root@localhost ~]# cat /etc/issue If you need a coredump file I will provide it. | |||||||||||||||||||
| Comment by Dipti Joshi (Inactive) [ 2015-06-24 ] | |||||||||||||||||||
|
Alex Lee Here you can download MaxScale debug build as below
Then do following to reinstall maxscale
Next, enable coredump as below
After this run your traffic that causes fatal singal 11, you should find the core file in /tmp directory. It will be named core-maxscale-* Also include all the log files for MaxScale. | |||||||||||||||||||
| Comment by Alex Lee [ 2015-06-25 ] | |||||||||||||||||||
|
Thank you for support. | |||||||||||||||||||
| Comment by Dipti Joshi (Inactive) [ 2015-06-25 ] | |||||||||||||||||||
|
Alex Lee Have you been able to produce core dump ? | |||||||||||||||||||
| Comment by Alex Lee [ 2015-06-25 ] | |||||||||||||||||||
|
No. I have not. Fatal signal 11 have not occured since I reinstalled MaxScale debug build rpm package. | |||||||||||||||||||
| Comment by Dipti Joshi (Inactive) [ 2015-06-26 ] | |||||||||||||||||||
|
Alex Lee Can you please reinstall the debug build ? - the earlier debug build did not have useful debug symbol. Just do Then do following to enable coredump
After this run your traffic that causes fatal singal 11, you should find the core file in /tmp directory. It will be named core-maxscale-* | |||||||||||||||||||
| Comment by Dipti Joshi (Inactive) [ 2015-06-28 ] | |||||||||||||||||||
|
Alex Lee Could you also provide us the version of your backend database ? As well as the log files from MaxScale ? | |||||||||||||||||||
| Comment by Alex Lee [ 2015-06-29 ] | |||||||||||||||||||
|
Sure. After i discuss with my boss i will provide you them. | |||||||||||||||||||
| Comment by Alex Lee [ 2015-06-30 ] | |||||||||||||||||||
|
The new debug build has a problem. Maxscale daemon is suddenly downed after several hours and can not any clue in log files. When daemon is downed, coredump file is not made. | |||||||||||||||||||
| Comment by Dipti Joshi (Inactive) [ 2015-06-30 ] | |||||||||||||||||||
|
Try yum remove maxscale Then start maxscale service with core enabled. | |||||||||||||||||||
| Comment by Alex Lee [ 2015-07-03 ] | |||||||||||||||||||
|
I reinstalled maxscale to follow your direction. 2015-07-03 11:05:01 Client error event handling. | |||||||||||||||||||
| Comment by Dipti Joshi (Inactive) [ 2015-07-08 ] | |||||||||||||||||||
|
Alex Lee Have you been able to reproduce the crash ? | |||||||||||||||||||
| Comment by Dipti Joshi (Inactive) [ 2015-08-16 ] | |||||||||||||||||||
|
markus makela Can you analyze the stack trace and pin point location of crash ? Thanks, | |||||||||||||||||||
| Comment by markus makela [ 2015-08-20 ] | |||||||||||||||||||
|
This is a crash in the embedded library. The bug should be assigned to the server team for further analysis. | |||||||||||||||||||
| Comment by Dipti Joshi (Inactive) [ 2015-08-20 ] | |||||||||||||||||||
|
Even though it is in embedded library it is being called from MySQLBackend protocol module - can we locate where we are calling the embeded library in this stack trace ? johan.wikman | |||||||||||||||||||
| Comment by markus makela [ 2015-08-21 ] | |||||||||||||||||||
|
The replace_mysql_users calls the getUsers function. This function has two calls to mysql_store_result at line 1279 and line 1385 in the 1.1.1 version of the source code. The first returns the result for the number of users, the second returns the actual user data. | |||||||||||||||||||
| Comment by Dipti Joshi (Inactive) [ 2015-08-23 ] | |||||||||||||||||||
|
ratzpoPlease get connector team to work with Johan, Markus to diagnose and find fix for this. | |||||||||||||||||||
| Comment by Alexey Botchkov [ 2015-08-24 ] | |||||||||||||||||||
|
Just to note - it fails in client-server operation, not the internal embedded-server one. The embedded-server library supposed to work as the ordinary client library here. | |||||||||||||||||||
| Comment by Dipti Joshi (Inactive) [ 2015-09-08 ] | |||||||||||||||||||
|
johan.wikman, holyfoot Has there been any further insight into this ? | |||||||||||||||||||
| Comment by Johan Wikman [ 2015-09-09 ] | |||||||||||||||||||
|
I analyzed this again and reached the same conclusion as Markus, that is, it is in either of the mysql_store_result calls in dbusers.c@getUsers where the crash occurs. A strange thing is that the addresses of the strack trace do not to match the binaries of CentOS 5, and not CentOS 6 or CentOS 7 either. Considering that replace_mysql_users is executed frequently, i.e. it is not a function that would be called only under exceptional circumstances, unless we get core files and/or detailed instructions for how to make the problem appear, it is very hard to do something about it. | |||||||||||||||||||
| Comment by Alexey Botchkov [ 2015-09-09 ] | |||||||||||||||||||
|
I have one idea about the possible problem. Can you tell what version of libmysqld is the MaxScale compiled against? | |||||||||||||||||||
| Comment by Dipti Joshi (Inactive) [ 2015-09-12 ] | |||||||||||||||||||
|
johan.wikman, Have we provided version of libmysqld used to complie MaxScale to Holyfoot ? | |||||||||||||||||||
| Comment by Johan Wikman [ 2015-09-14 ] | |||||||||||||||||||
|
Yes | |||||||||||||||||||
| Comment by markus makela [ 2015-09-25 ] | |||||||||||||||||||
|
I've managed to get this crash in the embedded library on the release-1.2.1 (commit 746dcd4111999aebd67eb7f397720dddae00d706) branch with the following embedded library:
I was doing an insert of 775353 rows when the crash happened. It seems interrupting the mysql command line client triggers this. | |||||||||||||||||||
| Comment by Johan Wikman [ 2015-11-30 ] | |||||||||||||||||||
|
Although it seems that this may occasionally occur, as it cannot be reproduced on purpose it is very hard to do something about. |