[MDEV-33009] Server hangs for a long time with innodb_undo_log_truncate=ON - Jira

Details

Type: Bug
Status: Closed (View Workflow)
Priority: Blocker
Resolution: Fixed
Affects Version/s: 10.5, 10.6, 10.11, 11.0(EOL), 11.1(EOL), 11.2(EOL), 11.3(EOL)
Fix Version/s: 10.5.24, 10.6.17, 10.11.7, 11.0.5, 11.1.4, 11.2.3, 11.3.2
Component/s: Storage Engine - InnoDB
Labels:
- hang
- performance
Environment:
Ubuntu 18.04 on AMD64
Ubuntu 20.04 on AMD64

Description

After implementing ~~MDEV-32757~~, we are seeing a performance anomaly with innodb_undo_log_truncate=ON. The server is not actually hung or deadlocked (it will eventually recover), but buf_pool.mutex is being occupied for an extremely long time (several minutes).

trx_purge_truncate_history() writes the message InnoDB: Truncating and is about to truncate an undo log tablespace.
trx_purge_truncate_history() is busy-looping in a scan of buf_pool.flush_list because one of the pages belonging to the undo tablespace is write-fixed.
During the time trx_purge_truncate_history() releases and re-acquires buf_pool.flush_list_mutex, buf_flush_page_cleaner (which is holding buf_pool.mutex in buf_do_flush_list_batch()) cannot grab it, in this Ubuntu 18.04 version of GNU libc and Linux kernel (4.15.0-112-generic). This could be similar to ~~MDEV-31343~~ and ~~MDEV-30180~~, which could only be reproduced in the same particular environment.
Most threads are blocked because the buf_flush_page_cleaner thread is holding buf_pool.mutex.

There is some indication that buf_flush_list_batch() may be making some progress (writing out some pages), but it would be extremely slow.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending
- Thumbnails
- List
- Download All

10.6-MDEV-33009-1.png
39 kB
2023-12-13 11:32
24x5_high_threads.pdf
104 kB
2023-12-18 14:52
mariadb-10.5-MDEV-33009-e1c545486ad+1.zip
2.06 MB
2023-12-15 16:33
mariadb-10.6-MDEV-33009-76b99fccb44.zip
3.03 MB
2023-12-15 16:33
mariadb-ES-10.5-MDEV-33009.zip
2.44 MB
2023-12-13 11:25
mariadb-ES-10.5-MDEV-33009-fb800bd29ab.zip
2.02 MB
2023-12-13 13:42
MDEV-33009-9682add5cdf.png
32 kB
2023-12-19 13:38
MDEV-33062-3ef0e678b1c.png
57 kB
2023-12-19 13:40

Issue Links

relates to

MDEV-26733 assert on shutdown lock->lock_word == X_LOCK_DECR in test

Open

MDEV-33062 innodb_undo_log_truncate=ON prevents fast shutdown

Closed

MDEV-33112 innodb_undo_log_truncate=ON is blocking page writes

Closed

MDEV-33213 History list is not shrunk unless there is a pause in the workload

Closed

MDEV-30180 Server hang with innodb_undo_log_truncate=ON

Closed

MDEV-31343 Another server hang with innodb_undo_log_truncate=ON

Closed

MDEV-32757 innodb_undo_log_truncate=ON is not crash safe

Closed

(2 relates to)

Activity

Ascending order - Click to sort in descending order

View 16 older comments

Axel Schwenke added a comment - 2023-12-18 14:54

Here is a summary plot of the performance/behavior of various 10.5 and 10.6 commits for community server. 4 commits are shown:

red line: last release (broken, corruption!)
green line: HEAD of development
blue line: HEAD of ~~MDEV-33009~~ branch (less aggressive version)
pink line: HEAD of ~~MDEV-33009~~ branch (aggressive version)

Attachment: 24x5_high_threads.pdf

the tests with data set size 12x5 (12 thd) and data set size 24x5 (24 thd) did not make the undo logs grow and thus caused no truncate operation.

It seems the pink line gives the best (but not good) result.

Axel Schwenke added a comment - 2023-12-18 14:54 Here is a summary plot of the performance/behavior of various 10.5 and 10.6 commits for community server. 4 commits are shown: red line: last release (broken, corruption!) green line: HEAD of development blue line: HEAD of MDEV-33009 branch (less aggressive version) pink line: HEAD of MDEV-33009 branch (aggressive version) Attachment: 24x5_high_threads.pdf the tests with data set size 12x5 (12 thd) and data set size 24x5 (24 thd) did not make the undo logs grow and thus caused no truncate operation. It seems the pink line gives the best (but not good) result.

Marko Mäkelä added a comment - 2023-12-18 15:22

Yes, this bug occurs during a scan of dirty pages that is enabled by setting innodb_undo_log_truncate=ON.

Marko Mäkelä added a comment - 2023-12-18 15:22 Yes, this bug occurs during a scan of dirty pages that is enabled by setting innodb_undo_log_truncate=ON .

Axel Schwenke added a comment - 2023-12-19 13:42 - edited

commit 9682add5cdf "solves" the problem with performance, but effectively prevents undo log truncates during the benchmark. This can be clearly seen in the history list length here:

When we stop sysbench for 20 seconds every 6 minutes, the purge thread and with it the undo log trucate can run:

When we include 9682add5cdf we should add a KB comment that innodb_undo_log_truncate=ON will only have an effect when there are times when the server is not under stress.

Axel Schwenke added a comment - 2023-12-19 13:42 - edited commit 9682add5cdf "solves" the problem with performance, but effectively prevents undo log truncates during the benchmark. This can be clearly seen in the history list length here: When we stop sysbench for 20 seconds every 6 minutes, the purge thread and with it the undo log trucate can run: When we include 9682add5cdf we should add a KB comment that innodb_undo_log_truncate=ON will only have an effect when there are times when the server is not under stress.

Marko Mäkelä added a comment - 2023-12-19 14:05

axel, the commit that you mentioned only modifies some code that would be run after the InnoDB: Truncating message has been written to the server error log, to truncate an undo log tablespace. By design, the purge of history during a heavy workload is hard to predict. You might see some undo log truncation events during a workload if you let the workload run for a significantly longer time. In the bottom graph of 24x5_high_threads.pdf we have some results where a similar change (purple line) caused some stalls during the workload. Compared to that change, the revised commit would make trx_purge_truncate_history() acquire and release buf_pool.mutex also when a buffer page latch cannot be acquired without waiting. Some other testing suggested that this could significantly increase the probability of goto rescan and transform the loop from a linear scan into a quadratic scan. When there are millions or more pages in buf_pool.flush_list, this could be significant.

Based on these results, I think that the setting innodb_undo_log_truncate=ON may only be useful when the concurrent write workload is light or moderate.

Marko Mäkelä added a comment - 2023-12-19 14:05 axel , the commit that you mentioned only modifies some code that would be run after the InnoDB: Truncating message has been written to the server error log, to truncate an undo log tablespace. By design, the purge of history during a heavy workload is hard to predict. You might see some undo log truncation events during a workload if you let the workload run for a significantly longer time. In the bottom graph of 24x5_high_threads.pdf we have some results where a similar change (purple line) caused some stalls during the workload. Compared to that change, the revised commit would make trx_purge_truncate_history() acquire and release buf_pool.mutex also when a buffer page latch cannot be acquired without waiting. Some other testing suggested that this could significantly increase the probability of goto rescan and transform the loop from a linear scan into a quadratic scan. When there are millions or more pages in buf_pool.flush_list , this could be significant. Based on these results, I think that the setting innodb_undo_log_truncate=ON may only be useful when the concurrent write workload is light or moderate.

Marko Mäkelä added a comment - 2023-12-22 08:39

I filed ~~MDEV-33112~~ for an idea that we can employ a lazy approach and avoid the intrusive buffer pool scan altogether.

Marko Mäkelä added a comment - 2023-12-22 08:39 I filed MDEV-33112 for an idea that we can employ a lazy approach and avoid the intrusive buffer pool scan altogether.

MariaDB Server

Server hangs for a long time with innodb_undo_log_truncate=ON

Details

Description

Attachments

Attachments

Issue Links

Activity

People

Dates

Git Integration