[MDEV-35155] Small innodb_log_file_size leads to excessive write amplification - Jira

XML

Word

Printable

Details

Type: Bug
Status: Closed (View Workflow)
Priority: Critical
Resolution: Fixed
Affects Version/s: 10.2.10, 10.6.19
Fix Version/s: 10.11.15, 11.4.9, 11.8.4, 12.1.2, 12.2.1
Component/s: Storage Engine - InnoDB
Labels:
- performance
Environment:
Centos 7.9 X86_64

Bug Category:
Related to performance
Release Note Summary:
Workloads that are bound by innodb_log_file_size would write out unnecessarily many data pages in an attempt to advance the log checkpoint.
Sprint:
Q3/2025 Maintenance

Description

I plan to migrate our MariaDB instances from `10.2.10` to `10.6.19`, and have run some performance benchmarks. And I observed performance is not stable compared to `10.2.10`, especially for in-memory workload.

Here is my test setup.
Test tool: sysbench 1.0.X
OS: CentOS 7.9 X86_64
MariaDB version: 10.2.10 10.6.19
Dataset: create 10 tables and each with 5M rows, each table ~ 1.2GB, the total size is ~ 12GB
Almost all config options are the same, except I removed some options which is deprecated/removed in 10.6, e.g. `innodb_buffer_pool_instances`, `innodb_page_cleaners`,`innodb-thread-concurrency`,`innodb_checksum_algorithm` etc.

Test 1.
In-memory workset, with `innodb_buffer_pool_size`=188GB
> NOTE:
> TPS-X means using X threads run sysbench `oltp_read_write.lua` test

10.2.10

10.6.19

We can see there are performance drops periodically with version `10.6.19`. The `10.6.19` can keep stable only in `4` threads case, while `10.2.10` 's performance is always stable with threads `4, 8, 16, and 32`.

Test 2:
Disk io bund test with `innodb_buffer_pool_size=2G`

10.2.10

10.6.19

you can see `10.2.10` is also more stable compared to `10.6.19`.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending
- Thumbnails
- List
- Download All

10.6.20_write_only.zip
1.08 MB
2025-01-14 13:07
10.6-March25..31.pdf
52 kB
2025-04-01 08:57
image-2024-10-15-11-41-04-176.png
44 kB
2024-10-15 03:41
image-2024-10-15-11-43-21-120.png
60 kB
2024-10-15 03:43
image-2024-10-15-11-46-38-886.png
101 kB
2024-10-15 03:46
image-2024-10-15-11-47-23-300.png
140 kB
2024-10-15 03:47
image-2024-10-16-14-52-49-455.png
59 kB
2024-10-16 06:52
image-2024-10-16-16-03-42-724.png
30 kB
2024-10-16 08:03
screenshot-1.png
56 kB
2024-10-16 06:44
screenshot-2.png
42 kB
2024-10-16 06:46
screenshot-3.png
40 kB
2024-10-16 09:09
screenshot-4.png
44 kB
2024-10-16 09:10
screenshot-5.png
40 kB
2024-10-16 09:14
screenshot-6.png
145 kB
2024-10-16 09:15
screenshot-7.png
39 kB
2024-10-16 09:34
timeseries-77bebe9eb08.png
23 kB
2025-04-07 08:49

Issue Links

is blocked by

MDEV-16168 Performance regression on sysbench write benchmarks from 10.2 to 10.3

Closed

MDEV-36931 performance regression in TPROC-C workload in 10.6.17

Closed

relates to

MDEV-33966 sysbench performance regression with concurrent workloads

In Progress

MDEV-37924 I/O Performance 'Use all warehouses' results in 40% mutex contention

Open

SAMU-322 Loading...

Activity

People

Assignee:: Marko Mäkelä

Reporter:: Luke

Votes:: 1 Vote for this issue

Watchers:: 10 Start watching this issue

Dates

Created:: 2024-10-15 03:49

Updated:: 12 hours ago

Resolved:: 12 hours ago

Git Integration

Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.