[MDEV-29098] mariadb crash on shutdown with signal 11 Created: 2022-07-13 Updated: 2024-01-05 |
|
| Status: | Open |
| Project: | MariaDB Server |
| Component/s: | Server |
| Affects Version/s: | 10.5.13 |
| Fix Version/s: | 10.5 |
| Type: | Bug | Priority: | Major |
| Reporter: | Matthew Walker | Assignee: | Oleksandr Byelkin |
| Resolution: | Unresolved | Votes: | 1 |
| Labels: | None | ||
| Attachments: |
|
| Description |
|
On shutdown mariadb logs "InnoDB: FTS optimize thread exiting." then gets a signal 11 error after 3 minutes. Seems to be a regression of
|
| Comments |
| Comment by Daniel Black [ 2022-07-14 ] | ||||||||||||||||||||||||
|
Matthwal thanks for the bug report. Is this happening every shutdown? If you upgrade from a previous version, was it happening on the previous version? Did it occur immediately after upgrading to 10.5.13? I'm hoping you can provide more details related to this bug report as currently there is too few details to go off attempting a resolution on this. Was there addition messages below what you have put already, a more detailed stack trace would help narrow it down. If a core dump was saved, locally or via a handler like systemd's coredumpctl extracting a backtrace using gdb after installing debug symbol packages (earlier on same link) would be useful. Was there a reason you listed the wsrep component? Have you tried, or are you willing to try a 10.5.16 version? | ||||||||||||||||||||||||
| Comment by Matthew Walker [ 2022-07-14 ] | ||||||||||||||||||||||||
|
>Is this happening every shutdown? >was it happening on the previous version >Did it occur immediately after upgrading to 10.5.13? >Was there addition messages below what you have put already >If a core dump was saved >Was there a reason you listed the wsrep component? >Have you tried, or are you willing to try a 10.5.16 version? Hope this info helps, I'll try to get that BT ASAP. If theres any other info I can grab, I'd be happy to. | ||||||||||||||||||||||||
| Comment by Daniel Black [ 2022-07-15 ] | ||||||||||||||||||||||||
|
Looking forward to the backtrace. I looked at what comes between the "InnoDB: FTS optimize thread exiting." message and the next message "InnoDB: Starting shutdown..." (that didn't arrive), and its a) the pre-shutdown of other plugins (probably less likely but if you've got non-default installed plugins please say so), or b) the srv_purge_shutdown, shutdown of the purge threads. Making the assumption that its the purge shutdown for now, do you have innodb_fast_shutdown set to a non-default 1, or any other non-default innodb purge settings. Do you have any global status information before shutdown, particularly innodb, and those like Innodb_history_list_length relevant to the purge. I haven't found any duplicate bug reports to indicate a later version fixes this (based on purge related segfaults, | ||||||||||||||||||||||||
| Comment by Daniel Black [ 2022-07-15 ] | ||||||||||||||||||||||||
|
Note with the stacktrace, some data structures around the SEGV would be appreciated even if you are guessing these. Please keep the gdb session handy as with a stack trace we can provide better guidance as to what data might be useful to determine how it got into this state. | ||||||||||||||||||||||||
| Comment by Matthew Walker [ 2022-07-18 ] | ||||||||||||||||||||||||
|
Still working on getting the backtrace, having issues becuase this is an older version of our product, so it's hard to get the exact binary that failed. >if you've got non-default installed plugins please say so >do you have innodb_fast_shutdown set to a non-default 1 innodb-buffer-pool-size=384M | ||||||||||||||||||||||||
| Comment by Matthew Walker [ 2022-07-19 ] | ||||||||||||||||||||||||
|
Still having a bit of trouble getting a full back trace, I'll paste what I have #0 0x000014adfdb47241 in ?? () | ||||||||||||||||||||||||
| Comment by Daniel Black [ 2022-07-20 ] | ||||||||||||||||||||||||
|
Try going up to the frame 4, and p *current_thd. It might be null as its in shutdown, and if not will contain a query and possibly a small bit of senstive data so look closely before pasting. Other global variables that might be saved in the core that might describe some current state:
| ||||||||||||||||||||||||
| Comment by Matthew Walker [ 2022-07-20 ] | ||||||||||||||||||||||||
|
Here's the info on frame 4, I'm not seeing anything for current_thd unfortunatly. (gdb) info frame 4 I've collected the output for those vars, some of them are quite long, so I've put each into a text file titled the var name, and packaged them so this comment doesn't become unreadable. variable_output.tar Hope this helps! | ||||||||||||||||||||||||
| Comment by Marko Mäkelä [ 2022-09-26 ] | ||||||||||||||||||||||||
|
I do not know if this might be related, but I just came across to this shutdown crash of a 10.6-based branch:
The output suggests that some thread crashed while InnoDB shutdown was still in progress in other threads. According to the output, InnoDB did shut down successfully despite the crash that occurred somewhere outside InnoDB. | ||||||||||||||||||||||||
| Comment by Marko Mäkelä [ 2022-09-26 ] | ||||||||||||||||||||||||
|
The ERR_remove_thread_state() in "my" crash is defined in OpenSSL and invoked via a macro at the end of handle_one_connection():
I do not currently think that the crash involves InnoDB at all. | ||||||||||||||||||||||||
| Comment by Marko Mäkelä [ 2024-01-05 ] | ||||||||||||||||||||||||
|
Based on the contents of variable_output.tar |