[MDEV-22209] InnoDB repeatedly crashes upon startup with crash recovery Created: 2020-04-10  Updated: 2020-09-21  Resolved: 2020-09-21

Status: Closed
Project: MariaDB Server
Component/s: Storage Engine - InnoDB
Affects Version/s: 10.3.16
Fix Version/s: N/A

Type: Bug Priority: Major
Reporter: Alexander Keremidarski Assignee: Alexander Keremidarski
Resolution: Incomplete Votes: 1
Labels: need_feedback


 Description   

Customer reports server crashing repeatedly during crash recovery. The cause of the initial crash is still unkown, but during subsequent restarts it crashes with:

2020-04-09 21:36:15 0 [Note] InnoDB: Starting final batch to recover 11194 pages from redo log.
200409 21:36:15 [ERROR] mysqld got signal 11 ;
This could be because you hit a bug. It is also possible that this binary
or one of the libraries it was linked against is corrupt, improperly built,
or misconfigured. This error can also be caused by malfunctioning hardware.

To report this bug, see https://mariadb.com/kb/en/reporting-bugs

We will try our best to scrape up some info that will hopefully help
diagnose the problem, but since we have already crashed,
something is definitely wrong and this may fail.

Server version: 10.3.16-MariaDB-log
key_buffer_size=67108864
read_buffer_size=131072
max_used_connections=0
max_threads=1002
thread_count=0
It is possible that mysqld could use up to
key_buffer_size + (read_buffer_size + sort_buffer_size)*max_threads = 2268410 K bytes of memory
Hope that's ok; if not, decrease some variables in the equation.

Thread pointer: 0x0
Attempting backtrace. You can use the following information to find out
where mysqld died. If you see no messages after this, something went
terribly wrong...
stack_bottom = 0x0 thread_stack 0x40000
/usr/sbin/mysqld(my_print_stacktrace+0x2e)[0x562a0648764e]
/usr/sbin/mysqld(handle_fatal_signal+0x30f)[0x562a05f28dff]
sigaction.c:0(__restore_rt)[0x7f2fd0214680]
/usr/sbin/mysqld(+0x99b350)[0x562a0613d350]
/usr/sbin/mysqld(+0x99aa7d)[0x562a0613ca7d]
/usr/sbin/mysqld(+0x99b26a)[0x562a0613d26a]
/usr/sbin/mysqld(+0x978eea)[0x562a0611aeea]
/usr/sbin/mysqld(+0x979e23)[0x562a0611be23]
/usr/sbin/mysqld(+0x4c8739)[0x562a05c6a739]
/usr/sbin/mysqld(+0xab2ac8)[0x562a06254ac8]
/usr/sbin/mysqld(+0xb19431)[0x562a062bb431]
/usr/sbin/mysqld(+0xa25cf8)[0x562a061c7cf8]
/lib64/libpthread.so.0(+0x7dd5)[0x7f2fd020cdd5]
/lib64/libc.so.6(clone+0x6d)[0x7f2fce5afb3d]



 Comments   
Comment by Marko Mäkelä [ 2020-05-06 ]

salle, can you try to get the following from the core dump (or by starting up the recovery in gdb on the corrupted data directory):

thread 1
frame 10
print/x bpage.id

Then, armed with this, start recovery with a debug build of mysqld, and invoke

mysqld --debug=d,ib_log

and then filter the output for the page number. If it is a clustered index page (this we could find out by checking if PAGE_INDEX_ID on the page corresponds to the SYS_INDEXES.ID of the clustered index), then this corruption most likely was caused by MDEV-19916.

Comment by Marko Mäkelä [ 2020-07-24 ]

I would strongly suspect that this problem was fixed by MDEV-19916 in MariaDB 10.3.17.

Generated at Thu Feb 08 09:13:01 UTC 2024 using Jira 8.20.16#820016-sha1:9d11dbea5f4be3d4cc21f03a88dd11d8c8687422.