[MDEV-32552] Write-ahead logging is broken for freed pages Created: 2023-10-23  Updated: 2023-11-20  Resolved: 2023-10-23

Status: Closed
Project: MariaDB Server
Component/s: Storage Engine - InnoDB
Affects Version/s: 10.5, 10.6, 10.7, 10.8, 10.9, 10.10, 10.11, 11.0, 11.1, 11.2, 11.3
Fix Version/s: 10.5.23, 10.6.16, 10.10.7, 10.11.6, 11.0.4, 11.1.3, 11.2.2

Type: Bug Priority: Blocker
Reporter: Marko Mäkelä Assignee: Marko Mäkelä
Resolution: Fixed Votes: 0
Labels: recovery, regression

Issue Links:
Problem/Incident
is caused by MDEV-15528 Avoid writing freed InnoDB pages Closed

 Description   

MDEV-15528 implemented an optimization: pages that have been marked as freed will not be written back to data files. Unfortunately, this optimization violates the write-ahead logging protocol, because we fail to ensure that all log up to the freeing of the page has been durably written before we elide the write of the data page.

If InnoDB is killed before the log for freeing a page was durably written, crash recovery could fail because it would read an older version of the data page and attempt to apply log records that are for a newer version of the page.

We can observe some occasional recovery test failures that could be explained by this, such as the following:

10.6 56c9b0bca0576985c31f20b46dcb060a

atomic.alter_table 'innodb'              w1 [ fail ]  Found warnings/errors in server log file!
        Test ended at 2023-01-10 11:25:39
line
2023-01-10 11:24:24 0 [ERROR] InnoDB: Corrupted page [page id: space=228, page number=0] of datafile './test/t1.ibd' could not be found in the doublewrite buffer.

The scenario that I have in mind would be fixed by making buf_page_free() mark the freed block as modified in the mini-transaction, and buf_flush_page() check that everything up to the FIL_PAGE_LSN of the page has been durably written to the redo log. After first change, that field would be updated by buf_flush_note_modification(), which is invoked by mtr_t::commit().



 Comments   
Comment by Marko Mäkelä [ 2023-10-23 ]

In 10.6, buf_page_free() invokes mtr->memo_push(block, MTR_MEMO_PAGE_X_MODIFY) ever since MDEV-29374 was fixed. But the check in buf_page_t::flush() is missing.

Generated at Thu Feb 08 10:32:13 UTC 2024 using Jira 8.20.16#820016-sha1:9d11dbea5f4be3d4cc21f03a88dd11d8c8687422.