[MDEV-33853] Async rollback prepared transactions during binlog crash recovery - Jira

Details

Type: New Feature
Status: Closed (View Workflow)
Priority: Critical
Resolution: Fixed
Fix Version/s: 11.7.0
Component/s: Server, Storage Engine - InnoDB
Labels:
None

Description

When doing server recovery, the active transactions will be rolled
back by InnoDB background rollback thread automatically. The
prepared transactions will be committed or rolled back accordingly
by binlog recovery. Binlog recovery is done in main thread before
the server can provide service to users. If there is a big
transaction to rollback, the server will not available for a long
time.

It is better to make the prepared transactions to be rolled back by the background rollback thread.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending
- Thumbnails
- List
- Download All

MDEV-33853.pdf
2024-09-04 12:08
30 kB
Axel Schwenke
MDEV-33853C.pdf
2024-09-05 08:29
37 kB
Axel Schwenke

Issue Links

causes

MDEV-35265 wsrep.wsrep-recover, wsrep.wsrep-recover-v25 fail on assertion !wsrep_is_wsrep_xid(&trx->xid)

Closed

relates to

MDEV-34909 DDL during SET GLOBAL innodb_log_file_size may hang when using PMEM

Closed

Activity

Ascending order - Click to sort in descending order

View 16 older comments

Axel Schwenke added a comment - 2024-09-04 12:13

I have run a general InnoDB mixed workload performance test on commit e5145b22629 in branch bb-11.6-MDEV-33853 and its predecessor commit 9811d23b6d0. The binlog was enabled in all tests; once async and once sync:

MDEV-33853.pdf

The only worrying result is for t_oltp_writes_innodb (OLTP write-only) with sync binlog. There are also differences in t_oltp_full_innodb (OLTP read/write) but they are in favor of ~~MDEV-33853~~. And for t_oltp_insert_innodb_batched (10x INSERT per trx) - those are probably bogus. The numbers are generally quite unstable for async binlog. I will repeat the test to see if it's reproducible ...

Axel Schwenke added a comment - 2024-09-04 12:13 I have run a general InnoDB mixed workload performance test on commit e5145b22629 in branch bb-11.6- MDEV-33853 and its predecessor commit 9811d23b6d0 . The binlog was enabled in all tests; once async and once sync: MDEV-33853.pdf The only worrying result is for t_oltp_writes_innodb (OLTP write-only) with sync binlog. There are also differences in t_oltp_full_innodb (OLTP read/write) but they are in favor of MDEV-33853 . And for t_oltp_insert_innodb_batched (10x INSERT per trx) - those are probably bogus. The numbers are generally quite unstable for async binlog. I will repeat the test to see if it's reproducible ...

Axel Schwenke added a comment - 2024-09-05 08:36

The second run of the general InnoDB tests completed. I put all results in one plot:

MDEV-33853C.pdf

It turns out, that for the workload in t_oltp_full_innodb we have a high likelihood that ~~MDEV-33853~~ is indeed faster with async binlog. The differences for t_oltp_insert_innodb_batched seem also seem to be real. But they concern just 2 thread counts. For 32 threads ~~MDEV-33853~~ is slower and for 64 threads it's faster. So overall it's a draw.

All other differences turned out bogus. So from that point of view ~~MDEV-33853~~ is ok.

Axel Schwenke added a comment - 2024-09-05 08:36 The second run of the general InnoDB tests completed. I put all results in one plot: MDEV-33853C.pdf It turns out, that for the workload in t_oltp_full_innodb we have a high likelihood that MDEV-33853 is indeed faster with async binlog. The differences for t_oltp_insert_innodb_batched seem also seem to be real. But they concern just 2 thread counts. For 32 threads MDEV-33853 is slower and for 64 threads it's faster. So overall it's a draw. All other differences turned out bogus. So from that point of view MDEV-33853 is ok.

Marko Mäkelä added a comment - 2024-09-05 08:57

axel, thank you for testing this. If I understood it correctly, you are running Sysbench workloads that do not involve restarting the server, nor any XA transactions for that matter. For that kind of test scenario, I don’t think that this code change can make any difference. The InnoDB changes are strictly limited to server startup, and I guess so are the changes to xarecover_do_commit_or_rollback() and xarecover_complete_and_count().

That is, the observed differences ought to be due to random noise, or possibly due to slightly changed code layout (the number of MMU pages that the busy part of the executable code is residing on).

Marko Mäkelä added a comment - 2024-09-05 08:57 axel , thank you for testing this. If I understood it correctly, you are running Sysbench workloads that do not involve restarting the server, nor any XA transactions for that matter. For that kind of test scenario, I don’t think that this code change can make any difference. The InnoDB changes are strictly limited to server startup, and I guess so are the changes to xarecover_do_commit_or_rollback() and xarecover_complete_and_count() . That is, the observed differences ought to be due to random noise, or possibly due to slightly changed code layout (the number of MMU pages that the busy part of the executable code is residing on).

Axel Schwenke added a comment - 2024-09-05 11:15

Thanks marko. I have not yet a test case for measuring server startup time. I thought of running a sysbench prepare job and killing the server in between. Then save the resulting datadir and use it for starting different server builds.
But if this reqires the transactions to be XA, then it will not work out of the box. Sysbench does not use XA:

Axel Schwenke added a comment - 2024-09-05 11:15 Thanks marko . I have not yet a test case for measuring server startup time. I thought of running a sysbench prepare job and killing the server in between. Then save the resulting datadir and use it for starting different server builds. But if this reqires the transactions to be XA, then it will not work out of the box. Sysbench does not use XA:

Marko Mäkelä added a comment - 2024-09-05 18:20

libing, thank you for your contribution and patience!

Marko Mäkelä added a comment - 2024-09-05 18:20 libing , thank you for your contribution and patience!

People

Assignee:: Marko Mäkelä

Reporter:: Libing Song

Votes:: 0 Vote for this issue

Watchers:: 12 Start watching this issue

Dates

Created:: 2024-04-08 07:15

Updated:: 2024-10-28 18:21

Resolved:: 2024-09-05 18:20

Git Integration

Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.

MariaDB Server

Details

Description

Attachments

Attachments

Issue Links

Activity

People

Dates

Git Integration