[MDEV-24035] Failing assertion: UT_LIST_GET_LEN(lock.trx_locks) == 0 causing disruption and replication failure Created: 2020-10-27 Updated: 2024-02-06 |
|
| Status: | Open |
| Project: | MariaDB Server |
| Component/s: | Replication, Storage Engine - InnoDB |
| Affects Version/s: | 10.5.5, 10.5.6, 10.11.3, 10.6.12, 10.11.6 |
| Fix Version/s: | 10.6, 10.11, 11.0, 11.1, 11.2 |
| Type: | Bug | Priority: | Critical |
| Reporter: | Martin Nix | Assignee: | Sergei Petrunia |
| Resolution: | Unresolved | Votes: | 4 |
| Labels: | crash | ||
| Environment: |
Centos 7.8.2003 |
||
| Attachments: |
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Issue Links: |
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Description |
|
See below error dump, happens sporadically (every 2 to 3 weeks) resulting in dropping of all connections and evacuation of cache taking around 20 seconds to recover. In each case it is a random simple select that causes the issue. All tables are innodb and have been mysqlcheck'd with no issue found. Issue did not occur under 10.4 but coincided with move to 10.5.5. System is not under excessive load at the time the problems manifests. Recent upgrade to 10.5.6 has not resolved the issue IMPORTANT : Knock on effect for replication slaves is duplicate records causing replication to fail - so this effectively looks like two bugs (duplicate updates can be skipped). I believe the assertion failure handling may be causing this secondary serious issue : 1) assertion failure on master host
mysqld --print-defaults
|
| Comments |
| Comment by Alice Sherepa [ 2020-10-27 ] |
|
Please provide the steps to reproduce or describe the steps you took in attempt to reproduce. + the output of SHOW CREATE TABLE lookup_permissions_grid; |
| Comment by Martin Nix [ 2020-10-27 ] |
|
Cannot reproduce on demand - it occurs when it occurs, there appears to be no pattern to behaviour. Each time is also a different table. The table below has a total of 57 records CREATE TABLE `lookup_permissions_grid` ( |
| Comment by Alice Sherepa [ 2020-11-10 ] |
|
Do you happen to use sequences, by any chance? There was a bug with a similar assertion( |
| Comment by Martin Nix [ 2020-11-11 ] |
|
We do not use sequences at all Alice |
| Comment by Elena Stepanova [ 2020-12-15 ] |
|
Let's track it in |
| Comment by Marko Mäkelä [ 2023-01-19 ] |
|
Apparently, vlad.lesin has debugged and fixed locking issues between replication and InnoDB in the past. |
| Comment by Rick Pizzi [ 2023-02-15 ] |
|
Just got a core out of this crash. |
| Comment by Marko Mäkelä [ 2023-05-15 ] |
|
This is a long shot, but I wonder if |
| Comment by Vladislav Lesin [ 2023-05-15 ] |
|
marko, I think, yes, |
| Comment by Marko Mäkelä [ 2023-05-19 ] |
|
In case |
| Comment by Ivan [ 2023-08-01 ] |
|
I'm seeing this error every 15-20 minutes on our server version 10.6.14. Is there any additional information I could provide to help debug the problem? I understand there might be a fix of Do you have any hint on what can we check / change to improve our situation? |
| Comment by Sergei Golubchik [ 2023-08-16 ] |
|
Rey, 10.6.15 is released. Please, add a comment if this error will keep happening in 10.6.15 and we'll reopen this issue. |
| Comment by Marko Mäkelä [ 2023-08-17 ] |
|
According to comments in |
| Comment by Marko Mäkelä [ 2023-08-17 ] |
|
In MDEV-31941 there is an example where a SELECT that is executed ‘inside’ LOCK TABLES can cause this assertion failure. |
| Comment by Marko Mäkelä [ 2023-09-04 ] |
|
Theoretically, 10.6 and later major versions could hit this assertion due to some other bug than 10.5 or earlier major versions. |
| Comment by Vladislav Lesin [ 2023-09-26 ] |
|
I am not sure this is a duplicate of |
| Comment by Martin Nix [ 2023-09-26 ] |
|
Not reproducible since moving from 10.6.8 to 10.6.11 (running reliably for almost a year now) Have since moved to 10.11 on rocky 8 and despite some trouble over swap necessity (kernel tuning) seem all good |
| Comment by Marko Mäkelä [ 2023-09-26 ] |
|
Based on recent progress (reproducing and fixing I think that we will need feedback (not any longer from marnixgb, but from our support customers) to find out if there are further causes of this bug that have not yet been fixed. |
| Comment by Michael Widenius [ 2023-11-05 ] |
|
What is the state of this MDEV? |