[MDEV-30989] MariaDB InnoDB Deadlock after upgrading to 10.6.12 Created: 2023-04-03 Updated: 2023-06-12 Resolved: 2023-06-12 |
|
| Status: | Closed |
| Project: | MariaDB Server |
| Component/s: | Storage Engine - InnoDB |
| Affects Version/s: | 10.6.12 |
| Fix Version/s: | N/A |
| Type: | Bug | Priority: | Critical |
| Reporter: | Sebastian Stamm | Assignee: | Marko Mäkelä |
| Resolution: | Duplicate | Votes: | 1 |
| Labels: | None | ||
| Environment: |
Oracle Linux 8.5 |
||
| Attachments: |
|
||||||||
| Issue Links: |
|
||||||||
| Description |
|
One week after updating from 10.6.11 to .12 the database hang up. systemctl was not able to kill/restart the service (had to sudo kill ......) Some days later it happend again, attached Engine Status and stack-trace from gdb. Also opend a case: CS0555748 |
| Comments |
| Comment by Marko Mäkelä [ 2023-04-03 ] | |||||||||||
|
sstamm, thank you for the report. This could be a duplicate of
For a deeper analysis, in case you saved a core dump of the hung process, could you share the output of thread apply all backtrace full from the same hang? Or at least the output of the following:
| |||||||||||
| Comment by Sebastian Stamm [ 2023-04-03 ] | |||||||||||
|
Have to wait for the next occurrence, but will try to get a thread apply all backtrace full. | |||||||||||
| Comment by Sebastian Stamm [ 2023-04-22 ] | |||||||||||
|
Here it is: gdb-2023-04-22.7z | |||||||||||
| Comment by Marko Mäkelä [ 2023-04-24 ] | |||||||||||
|
The file gdb-2023-04-22.txt in gdb-2023-04-22.7z
in a re-entrant call to btr_cur_pessimistic_index(). A shared latch on the block descriptor 0x7f094c0c8f00 is being held by Thread 49 and Thread 16, both executing btr_cur_t::search_leaf(). Thread 49 is waiting for
Thread 16 is waiting for
Both these blocks are being held by Thread 51:
We can see exclusive latches held on both block descriptors by Thread 51: object = 0x7f094c0c8e60, type = MTR_MEMO_PAGE_X_FIX and object = 0x7f3404024680, type = MTR_MEMO_PAGE_X_FIX. That is, Thread 51 is blocking both threads that are holding a shared latch on a block on which it is waiting for an exclusive latch. This deadlock was fixed in |