[MDEV-34445] Rare Futex deadlocks - Jira

XML

Word

Printable

Details

Type: Bug
Status: Closed (View Workflow)
Priority: Major
Resolution: Incomplete
Affects Version/s: 10.11.6
Fix Version/s: N/A
Component/s: Storage Engine - InnoDB
Labels:
- hang
Environment:
Debian 12 / Stable.

Description

Every ~2-7 days, around midnight, one of our SQL servers is experiencing an issue where it deadlocks near-completely. The log, as expected, just stops abruptly with no indication of what's wrong.

I can still connect using the 'root' account using a unix socket while this happens. Active queries (show processlist) seems independent.

Not the queries are deadlocking, the program itself is. No queries will process or complete as the program internally waits endlessly for mutexes.

I researched a possible cause; the most common appears to be calling unsafe functions in signal handlers.

I'm not too well versed in gdb. I don't know how to reproduce the problem (that's our entire issue). It can and does occur periodically. THe situation in [info threads] of gdb looks a bit like this:

About 300 threads stuck at `0x7f7bf8fa86c0 (LWP 684969) "mariadbd" syscall () at ../sysdeps/unix/sysv/linux/x86_64/syscall.S:38`
10-12 threads inside __futex_abstimed_wait_common64
5-6 threads in _GI__poll
About twenty entries like this (unknown); `Thread 0x7f7bf838d6c0 (LWP 329146) "iou-wrk-298768" 0x0000000000000000 in ?? ()`

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending
- Thumbnails
- List
- Download All

backtrace.log
2.65 MB
2024-07-11 16:33
keyQuery.txt
112 kB
2024-10-08 08:30

Activity

People

Assignee:: Debarun Banerjee

Reporter:: npr

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Dates

Created:: 2024-06-24 08:52

Updated:: 2024-11-18 10:47

Resolved:: 2024-11-18 10:47

Git Integration

Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.