[MDEV-23693] Failing assertion: my_atomic_load32_explicit(&lock->lock_word, MY_MEMORY_ORDER_RELAXED) == X_LOCK_DECR Created: 2020-09-08 Updated: 2020-12-14 Resolved: 2020-10-27 |
|
| Status: | Closed |
| Project: | MariaDB Server |
| Component/s: | Storage Engine - InnoDB |
| Affects Version/s: | 10.3.24 |
| Fix Version/s: | 10.2.35, 10.3.26, 10.4.16, 10.5.7 |
| Type: | Bug | Priority: | Blocker |
| Reporter: | Geza Lucz | Assignee: | Thirunarayanan Balathandayuthapani |
| Resolution: | Fixed | Votes: | 2 |
| Labels: | rr-profile | ||
| Environment: |
Centos 7 / CloudLinux 7 |
||
| Issue Links: |
|
||||||||||||||||||||||||||||||||||||
| Description |
|
I started seeing the following crash on August 26th. Since I received 4 of them. I don't have any hardware errors and nothing else suspicious is logged. InnoDB is not complaining about data corruption etc.. I only get this assertion failure and then a crash.
I now also think that it is data related. I was running 10.3.24 for 2 weeks before getting the first crash. Since then It came once every 2 days. I rolled back to 10.3.23 4 days ago, and I have not had crashes since... it's probably too early to say but it seems like only 10.3.24 is affected. I run a lot of 10.0 and 10.1 versions and they don't crash out with the latest updates. |
| Comments |
| Comment by Ross Campbell [ 2020-09-15 ] | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
I am on Ubuntu 18.04 LTS, and upgraded to 10.3.24 (from from 10.3.23+maria~bionic) on Sep 9th. I immediately started seeing this error in my automated test suite several times per day. What exactly triggers it seems to be random; sometimes during the database load (before tests have begun) and sometimes during the tests themsevles. I can't find a way to reproduce it reliably, although I honestly haven't tried all that hard. This definitely did NOT occur on 10.3.23 - my test suite runs at minimum once per day on a timer, and far more than that during active development, so I'm sure I would have seen it. We had been running 10.3.23 since July 13th 2020 with zero crashes. Update: Due to my test suite failing most of time, I downgraded to 10.3.23 and this crash immediately went away. I would be happy to provide more information or any other assistance is troubleshooting this issue, please just let me know if I can help. In the meantime, I am strongly considering moving to 10.5. Logs are as follows:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Comment by Robert Klikics [ 2020-09-29 ] | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
We're facing the same issue since 10.3.24 on CentOS 7.8 with MariaDB from the official repos. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Comment by Marko Mäkelä [ 2020-09-29 ] | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
This might be related to the regression | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Comment by Ulrich Moser (Inactive) [ 2020-10-01 ] | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Since this is a bug that also affects users of the community edition I would expect it to be available there too. Is there a bugfix version >10.3.24 on the way to be released? | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Comment by Daniel Black [ 2020-10-05 ] | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Is https://github.com/mariadb-corporation/MariaDBEnterprise/commit/5a73c44cf08259af41f952415d2fbeaf5bf44e7c (that I can't read) in the community branches? It seemed the only one not by thiru | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Comment by Marko Mäkelä [ 2020-10-05 ] | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
danblack, https://github.com/mariadb-corporation/MariaDBEnterprise/commit/5a73c44cf08259af41f952415d2fbeaf5bf44e7c is the MariaDB Enterprise Server 10.5.5-3 version of the first of the two
In 10.5 the change was this much simpler than in 10.2 due to Because the MariaDB Enterprise Server releases occur some time after the Community Server releases that they are based on, some bug fixes that missed the underlying Community Server release may be included. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Comment by Matthias Leich [ 2020-10-16 ] | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Comment by Marko Mäkelä [ 2020-10-20 ] | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Bug #31036301 ASSERTION FAILURE: SYNC0RW.IC:429:LOCK->LOCK_WORD in MySQL 5.7.32 might be addressing this issue. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Comment by Marko Mäkelä [ 2020-10-23 ] | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
The port of the fix looks correct. I made a few minor comments. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Comment by Marko Mäkelä [ 2020-10-27 ] | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
While testing this, we encountered another race condition with buffer pool resizing, related to buf_pool->page_hash. We will handle that separately later, in | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Comment by Matthias Leich [ 2020-10-27 ] | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
bb-10.2- |