[MDEV-27850] MTR tests can hang due to DEBUG_SYNC race condition Created: 2022-02-15 Updated: 2024-01-26 |
|
| Status: | Stalled |
| Project: | MariaDB Server |
| Component/s: | Replication, Tests |
| Affects Version/s: | 10.2, 10.3, 10.4, 10.5, 10.6, 10.7, 10.8, 10.9, 10.10, 10.11 |
| Fix Version/s: | 10.4, 10.5, 10.6, 10.11 |
| Type: | Bug | Priority: | Major |
| Reporter: | Brandon Nesterenko | Assignee: | Brandon Nesterenko |
| Resolution: | Unresolved | Votes: | 0 |
| Labels: | None | ||
| Issue Links: |
|
||||||||
| Description |
|
If a DEBUG_SYNC signal is overwritten before the target thread acknowledges the signal, the thread will become stuck (until timeout) due to awaiting the missed signal. rpl.rpl_seconds_behind_master_spike highlights this problem with an example fix commit cdf19cd. Other tests which may be impacted by this issue are rpl.rpl_dump_request_retry_warning, main.query_cache_debug, and main.partition_debug_sync. A comprehensive list of effected tests should be created, and then they should be fixed. Edit:
|
| Comments |
| Comment by Brandon Nesterenko [ 2022-02-22 ] |
|
Hey Andrei! Can you review my patch for fixing tests main.query_cache_debug, main.partition_debug_sync, and rpl.rpl_dump_request_retry_warning? Commit: 883fe83 Buildbot: bb-10.2-MDEV-27850 Thanks! |
| Comment by Andrei Elkin [ 2022-02-25 ] |
|
The test changes look good. Let's address DEBUG_SYNC 's single signal limitation as a followup. |