Details

    Description

      Every ~2-7 days, around midnight, one of our SQL servers is experiencing an issue where it deadlocks near-completely. The log, as expected, just stops abruptly with no indication of what's wrong.

      I can still connect using the 'root' account using a unix socket while this happens. Active queries (show processlist) seems independent.

      Not the queries are deadlocking, the program itself is. No queries will process or complete as the program internally waits endlessly for mutexes.

      I researched a possible cause; the most common appears to be calling unsafe functions in signal handlers.

      I'm not too well versed in gdb. I don't know how to reproduce the problem (that's our entire issue). It can and does occur periodically. THe situation in [info threads] of gdb looks a bit like this:

      • About 300 threads stuck at `0x7f7bf8fa86c0 (LWP 684969) "mariadbd" syscall () at ../sysdeps/unix/sysv/linux/x86_64/syscall.S:38`
      • 10-12 threads inside __futex_abstimed_wait_common64
      • 5-6 threads in _GI__poll
      • About twenty entries like this (unknown); `Thread 0x7f7bf838d6c0 (LWP 329146) "iou-wrk-298768" 0x0000000000000000 in ?? ()`

      Attachments

        1. backtrace.log
          2.65 MB
        2. keyQuery.txt
          112 kB

        Activity

          npr npr created issue -
          serg Sergei Golubchik made changes -
          Field Original Value New Value
          Status Open [ 1 ] Needs Feedback [ 10501 ]
          npr npr made changes -
          Attachment backtrace.log [ 73840 ]
          marko Marko Mäkelä made changes -
          Assignee Debarun Banerjee [ JIRAUSER54513 ]
          marko Marko Mäkelä made changes -
          Status Needs Feedback [ 10501 ] Open [ 1 ]
          debarun Debarun Banerjee made changes -
          Status Open [ 1 ] Needs Feedback [ 10501 ]
          serg Sergei Golubchik made changes -
          Fix Version/s N/A [ 14700 ]
          Resolution Incomplete [ 4 ]
          Status Needs Feedback [ 10501 ] Closed [ 6 ]
          npr npr made changes -
          Attachment keyQuery.txt [ 74109 ]
          npr npr made changes -
          Affects Version/s 10.11.6 [ 29020 ]
          Affects Version/s 10.11.8 [ 29630 ]
          marko Marko Mäkelä made changes -
          Resolution Incomplete [ 4 ]
          Status Closed [ 6 ] Stalled [ 10000 ]
          marko Marko Mäkelä made changes -
          Component/s Storage Engine - InnoDB [ 10129 ]
          Component/s OTHER [ 10125 ]
          Fix Version/s 10.6 [ 24028 ]
          Fix Version/s 10.11 [ 27614 ]
          Fix Version/s N/A [ 14700 ]
          Labels innodb hang
          marko Marko Mäkelä made changes -
          Status Stalled [ 10000 ] Needs Feedback [ 10501 ]
          serg Sergei Golubchik made changes -
          Fix Version/s N/A [ 14700 ]
          Fix Version/s 10.6 [ 24028 ]
          Fix Version/s 10.11 [ 27614 ]
          Resolution Incomplete [ 4 ]
          Status Needs Feedback [ 10501 ] Closed [ 6 ]

          People

            debarun Debarun Banerjee
            npr npr
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Git Integration

                Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.