[MDEV-17346] parallel slave start and stop races to workers disappeared - Jira

XML

Word

Printable

Details

Type: Bug
Status: Closed (View Workflow)
Priority: Major
Resolution: Fixed
Affects Version/s: 10.1(EOL)
Fix Version/s: 10.1.37
Component/s: Replication
Labels:
None

Description

The bug appears as a slave SQL thread hanging in
rpl_parallel_thread_pool::get_thread() while there are no slave worker
threads to awake it.

The reason of the hang is that at the parallel slave worker pool
activation the being stared SQL thread could read the worker pool size
concurrently with pool deactivation. At reading the SQL thread did not
employ necessary protection from a race.

One way to fix it is to make the SQL thread at the pool activation first
to grab the same lock as potential deactivator also does prior
to access the pool size.

Attachments

Issue Links

relates to

MDEV-9573 'Stop slave' hangs on replication slave

Closed

Activity

People

Assignee:: Andrei Elkin

Reporter:: Andrei Elkin

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 2018-10-02 11:01

Updated:: 2024-07-07 23:38

Resolved:: 2018-10-08 16:56

Git Integration

Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.