[MDEV-26779] reduce lock_sys.wait_mutex contention by using spinloop construct - Jira

Details

Type: Bug
Status: Closed (View Workflow)
Priority: Major
Resolution: Fixed
Affects Version/s: 10.6
Fix Version/s: 10.6.5
Component/s: Storage Engine - InnoDB
Labels:
- performance
Environment:
GNU/Linux on ARMv8 (Aarch64)

Description

reduce lock_sys.wait_mutex contention by using spinloop construct

wait_mutex plays an important role when the workload involves conflicting transactions.

On a heavily contented system with increasing scalability
quite possible that the majority of the transactions may have to wait
before acquiring resources.

This causes a lot of contention of wait_mutex but most of this
the contention is short-lived that tend to suggest the use of spin loop
to avoid giving up compute core that in turn will involve os-scheduler with additional latency.

Idea has shown promising results with performance improving up to 70-100% for write workload.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending
- Thumbnails
- List
- Download All

custom spin vs pthread spin.png
97 kB
2021-10-14 10:21
MDEV-26779-1.pdf
22 kB
2021-10-26 12:19
MDEV-26779-2.pdf
15 kB
2021-10-27 14:00
MDEV-26779-3.pdf
16 kB
2021-10-27 14:00
patch (custom spin) and patch (pthread spin).png
16 kB
2021-10-14 10:21
sysbench.pdf
22 kB
2021-10-15 14:16
update-index (512 threads) x86 custom spin vs pthread spin loop.png
65 kB
2021-10-14 15:03
update-index (uniform) - baseline vs patched - arm.png
39 kB
2021-10-07 04:16
update-index (uniform) - baseline vs patched - x86.png
39 kB
2021-10-07 04:16
update-index (zipfian) - baseline vs patched - arm.png
43 kB
2021-10-07 04:16
update-index (zipfian) - baseline vs patched - x86.png
43 kB
2021-10-07 04:16
update-index (zipfian) - inline, update-index (zipfian) - noniline, update-non-index (zipfian) - inline and update-non-index (zipfian) - noninline.png
20 kB
2021-10-19 11:23
update-non-index (uniform) - baseline vs patched - arm.png
43 kB
2021-10-07 04:16
update-non-index (uniform) - baseline vs patched - x86.png
63 kB
2021-10-07 04:16
update-non-index (zipfian) - baseline vs patched - arm.png
46 kB
2021-10-07 04:16
update-non-index (zipfian) - baseline vs patched - x86.png
43 kB
2021-10-07 04:16
x86-wait-mutex-run.png
95 kB
2021-10-18 11:41

Issue Links

is caused by

MDEV-21452 Use condition variables and normal mutexes instead of InnoDB os_event and mutex

Closed

relates to

MDEV-16232 Use fewer mini-transactions

Stalled

MDEV-16406 Refactor the InnoDB record locks

Open

Activity

Ascending order - Click to sort in descending order

View 13 older comments

Krunal Bauskar added a comment - 2021-10-25 07:13

After folding of the multiple changes related to buf_pool mutex optimization I re-evaluated the said patch.

1. For uniform (as we observed before) there is no change in performance.
2. For zipfian (contention case), for ARM there is consistent improvement in performance for higher scalability. For x86, update-non-index 1024 scalability showed some regression. update-index and lower scalability continued to perform well. (Could be due to flushing issue).

Krunal Bauskar added a comment - 2021-10-25 07:13 After folding of the multiple changes related to buf_pool mutex optimization I re-evaluated the said patch. 1. For uniform (as we observed before) there is no change in performance. 2. For zipfian (contention case), for ARM there is consistent improvement in performance for higher scalability. For x86, update-non-index 1024 scalability showed some regression. update-index and lower scalability continued to perform well. (Could be due to flushing issue).

Axel Schwenke added a comment - 2021-10-26 12:22

MDEV-26779-1.pdf shows no significant performance change. This was however run with uniform RNG and datadir on RAM-disk.

Axel Schwenke added a comment - 2021-10-26 12:22 MDEV-26779-1.pdf shows no significant performance change. This was however run with uniform RNG and datadir on RAM-disk.

Axel Schwenke added a comment - 2021-10-27 14:01

MDEV-26779-2.pdf completes the picture with the inlined version. MDEV-26779-3.pdf shows results for the Zipf RNG. In any case it looks like for x86 the non-inlined variant shows the better behavior.

Axel Schwenke added a comment - 2021-10-27 14:01 MDEV-26779-2.pdf completes the picture with the inlined version. MDEV-26779-3.pdf shows results for the Zipf RNG. In any case it looks like for x86 the non-inlined variant shows the better behavior.

Marko Mäkelä added a comment - 2021-10-27 14:11

I think that for now, we can apply a simple ARMv8-specific change of initializing lock_sys.wait_mutex with MY_MUTEX_INIT_FAST a.k.a. PTHREAD_ADAPTIVE_MUTEX_INITIALIZER_NP, similar to how we did to the log_sys mutexes in ~~MDEV-26855~~.

I would not change other platforms than ARMv8, because our own tests on AMD64 do not show any significant improvement.

Spinning is basically a hack to work around contention. The lock_sys.wait_mutex must be split in some way to properly fix this, in a future task.

Marko Mäkelä added a comment - 2021-10-27 14:11 I think that for now, we can apply a simple ARMv8-specific change of initializing lock_sys.wait_mutex with MY_MUTEX_INIT_FAST a.k.a. PTHREAD_ADAPTIVE_MUTEX_INITIALIZER_NP , similar to how we did to the log_sys mutexes in MDEV-26855 . I would not change other platforms than ARMv8, because our own tests on AMD64 do not show any significant improvement. Spinning is basically a hack to work around contention. The lock_sys.wait_mutex must be split in some way to properly fix this, in a future task.

Marko Mäkelä added a comment - 2021-10-27 14:45

I think that before attempting to split lock_sys.wait_mutex, we should implement the following and re-evaluate the situation:

MDEV-16232 so that UPDATE and DELETE will avoid setting non-gap locks in the non-contended case
MDEV-16406 so that accessing the record locks will hopefully be faster and the critical sections of lock_sys.wait_mutex smaller.

Marko Mäkelä added a comment - 2021-10-27 14:45 I think that before attempting to split lock_sys.wait_mutex , we should implement the following and re-evaluate the situation: MDEV-16232 so that UPDATE and DELETE will avoid setting non-gap locks in the non-contended case MDEV-16406 so that accessing the record locks will hopefully be faster and the critical sections of lock_sys.wait_mutex smaller.

People

Assignee:: Marko Mäkelä

Reporter:: Krunal Bauskar

Votes:: 0 Vote for this issue

Watchers:: 7 Start watching this issue

Dates

Created:: 2021-10-07 04:13

Updated:: 2021-10-27 14:45

Resolved:: 2021-10-27 14:42

Git Integration

Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.

MariaDB Server

Details

Description

Attachments

Attachments

Issue Links

Activity

People

Dates

Git Integration