Details
-
Bug
-
Status: Closed (View Workflow)
-
Critical
-
Resolution: Fixed
-
10.11.3, 10.6.13
Description
After uploading MariaDB 10.11.3 to Launchpad builders several of them failed on test innodb.recovery_memory:
innodb.recovery_memory 'innodb,release' w3 [ fail ] Found warnings/errors in server log file!
|
Test ended at 2023-05-11 22:44:39
|
line
|
2023-05-11 22:44:34 0 [Warning] InnoDB: Difficult to find free blocks in the buffer pool (21 search iterations)! 21 failed attempts to flush a page! Consider increasing innodb_buffer_pool_size. Pending flushes (fsync): 0. 166 OS file reads, 66 OS file writes, 0 OS fsyncs.
|
Examples from amd64 and arm64 builds:
https://launchpadlibrarian.net/665736950/buildlog_ubuntu-mantic-arm64.mariadb_1%3A10.11.3-1~ubuntu23.10.1~1683836249.0a0f09bbe32.dev.otto_BUILDING.txt.gz
https://launchpadlibrarian.net/665721775/buildlog_ubuntu-mantic-amd64.mariadb_1%3A10.11.3-1~ubuntu23.10.1~1683836249.0a0f09bbe32.dev.otto_BUILDING.txt.gz
I see thiru worked on this test in February/March of 2023, he perhaps would know best what might have regressed here?
Attachments
Issue Links
- is caused by
-
MDEV-26827 Make page flushing even faster
-
- Closed
-
- relates to
-
MDEV-26827 Make page flushing even faster
-
- Closed
-
-
MDEV-31353 InnoDB recovery hangs after reporting corruption
-
- Closed
-
-
MDEV-31354 SIGSEGV in log_sort_flush_list() in InnoDB crash recovery
-
- Closed
-
-
MDEV-29911 InnoDB recovery and mariadb-backup --prepare fail to report detailed progress
-
- Closed
-
Activity
Field | Original Value | New Value |
---|---|---|
Link |
This issue relates to |
Component/s | Storage Engine - InnoDB [ 10129 ] | |
Fix Version/s | 10.6 [ 24028 ] | |
Fix Version/s | 10.9 [ 26905 ] | |
Fix Version/s | 10.10 [ 27530 ] | |
Fix Version/s | 10.11 [ 27614 ] | |
Fix Version/s | 11.0 [ 28320 ] | |
Fix Version/s | 11.1 [ 28549 ] | |
Assignee | Marko Mäkelä [ marko ] | |
Labels | hang recovery | |
Priority | Minor [ 4 ] | Critical [ 2 ] |
Status | Open [ 1 ] | In Progress [ 3 ] |
Link |
This issue relates to |
Link |
This issue relates to |
Link |
This issue relates to |
issue.field.resolutiondate | 2023-05-26 14:03:13.0 | 2023-05-26 14:03:13.745 |
Fix Version/s | 10.6.14 [ 28914 ] | |
Fix Version/s | 10.9.7 [ 28916 ] | |
Fix Version/s | 10.10.5 [ 28917 ] | |
Fix Version/s | 10.11.4 [ 28918 ] | |
Fix Version/s | 11.0.3 [ 28920 ] | |
Fix Version/s | 11.1.2 [ 28921 ] | |
Fix Version/s | 10.6 [ 24028 ] | |
Fix Version/s | 10.9 [ 26905 ] | |
Fix Version/s | 10.10 [ 27530 ] | |
Fix Version/s | 10.11 [ 27614 ] | |
Fix Version/s | 11.0 [ 28320 ] | |
Fix Version/s | 11.1 [ 28549 ] | |
Resolution | Fixed [ 1 ] | |
Status | In Progress [ 3 ] | Closed [ 6 ] |
Link |
This issue is caused by |
Labels | hang recovery | hang recovery regression |
Affects Version/s | 10.6.13 [ 28514 ] |
Link | This issue blocks MENT-1832 [ MENT-1832 ] |
Link | This issue blocks MENT-1835 [ MENT-1835 ] |
Fix Version/s | 10.6.15 [ 29013 ] | |
Fix Version/s | 10.9.8 [ 29015 ] | |
Fix Version/s | 10.10.6 [ 29017 ] | |
Fix Version/s | 10.11.5 [ 29019 ] | |
Fix Version/s | 10.6.14 [ 28914 ] | |
Fix Version/s | 10.9.7 [ 28916 ] | |
Fix Version/s | 10.10.5 [ 28917 ] | |
Fix Version/s | 10.11.4 [ 28918 ] |
Fix Version/s | 10.6.14 [ 28914 ] | |
Fix Version/s | 10.9.7 [ 28916 ] | |
Fix Version/s | 10.10.5 [ 28917 ] | |
Fix Version/s | 10.11.4 [ 28918 ] | |
Fix Version/s | 11.0.2 [ 28706 ] | |
Fix Version/s | 11.1.1 [ 28704 ] | |
Fix Version/s | 11.0.3 [ 28920 ] | |
Fix Version/s | 11.1.2 [ 28921 ] | |
Fix Version/s | 10.6.15 [ 29013 ] | |
Fix Version/s | 10.9.8 [ 29015 ] | |
Fix Version/s | 10.10.6 [ 29017 ] | |
Fix Version/s | 10.11.5 [ 29019 ] |
The purpose of this test is to exercise crash recovery in the case that multiple recovery batches will be needed, because the parsed log records will not fit in the buffer pool at once.
I recently improved the memory management of crash recovery in
MDEV-29911. That fix has not been merged beyond the 10.9 branch yet. However, there was a failure of that test in 10.9 yesterday, with a recovery hang:10.9 44c9008ba65686abf1c82c9166255a8c52d61f74
2023-05-25 10:42:54 0 [Note] InnoDB: End of log at LSN=1540268
2023-05-25 10:42:54 0 [Note] InnoDB: To recover: LSN 528647/1540268; 279 pages
2023-05-25 10:42:54 0 [Note] InnoDB: To recover: LSN 1004400/1540268; 269 pages
CURRENT_TEST: innodb.recovery_memory
…
2023-05-25 10:51:38 0 [Note] Starting MariaDB 10.9.7-MariaDB-log source revision 44c9008ba65686abf1c82c9166255a8c52d61f74 as process 325396
Recovery was apparently stuck for almost 9 minutes, without any further messages being issued. After
MDEV-29911, they are supposed to be issued at the start of each recovery batch and every 15 seconds within a batch. Based on the reported LSN, it looks like at least 3 batches would have been needed.