[MDEV-26860] MariaDB 10.5 does not do idle checkpoint (regression) Created: 2021-10-19  Updated: 2022-11-11  Resolved: 2021-11-22

Status: Closed
Project: MariaDB Server
Component/s: Server, Storage Engine - InnoDB
Affects Version/s: 10.5.12
Fix Version/s: 10.5.13, 10.6.5

Type: Bug Priority: Major
Reporter: carlos tutte Assignee: Marko Mäkelä
Resolution: Duplicate Votes: 0
Labels: need_feedback, performance

Attachments: PNG File MariaDB_10_5_checkpoint.png    
Issue Links:
Duplicate
is duplicated by MDEV-26626 InnoDB fails to advance the log check... Closed
Relates
relates to MDEV-27295 MariaDB 10.5 does not do idle checkpo... Closed

 Description   

MariaDB 10.5 does not checkpoint as I would expect (i.e same as in MariaDB 10.4 or PS 5.7 ) .

Attached is the graph of how MariaDB 10.5 checkpoints, i.e it let's redo log get to max checkpoint age, then checkpoints everything , then start over.
It is easily repeatable by creating a new instance from scratch with all the defaults and insert a few records, checking that checkpoint age does not move.

SEIS On MariaDB :
Log sequence number 1850969
Log flushed up to 1850969
Pages flushed up to 1846760
Last checkpoint at 1846748

SEIS on PS 5.7:
Log sequence number 4441325019
Log flushed up to 4441325019
Pages flushed up to 4441325019
Last checkpoint at 4441325010

It can be seen that "Last checkpoint at xxxx" is close to LSN on PS, but on MariaDB 10.5 it will remain with a fixed value until max checkpoint age.

In the attachment it can be seen that every ~4 hours the redo logs gets full, does an almost complete checkpoint and starts again. This causes an stall of a few seconds while the disks (which luckily are fast) do the checkpoint

On MariaDB 10.4 this does not happen and I have also compared configuration between 10.4 and 10.5 and could not find anything.

This other bug report MDEV-25093 mentions that any flushing is skipped but it seems something still broken.



 Comments   
Comment by Marko Mäkelä [ 2021-10-19 ]

Thank you for the report. This feels like a duplicate of MDEV-26626. Can you please test if the problem is repeatable with a recent 10.5 development snapshot from http://hasky.askmonty.org/archive/10.5/ (or with recent 10.5 source code)?

Generated at Thu Feb 08 09:48:30 UTC 2024 using Jira 8.20.16#820016-sha1:9d11dbea5f4be3d4cc21f03a88dd11d8c8687422.