Hello,
Last night right after midnight one of my servers seemed to hang. All (or at least a great many?) queries would hang forever in the Execute phase. Of course this hung up the entire Galera cluster as well.
I was awakened by an alert around 12:40am. My innodb_fatal_semaphore_wait_threshold is set to 32 seconds, so this hang was not caught by that watchdog.
At 12:49 I was able to send MariaDB a signal which caused it to crash and dump core. So I do have a stack trace for this situation which I don't want to post publicly, but which is available upon request. I also have SHOW PROCESSLIST logs for much of the time in case that helps.
If a MariaDB expert could take a look at the situation I would appreciate it! Thanks.
- duplicates
-
MDEV-32371
Deadlock between buf_page_get_zip() and buf_pool_t::corrupted_evict() on InnoDB ROW_FORMAT=COMPRESSED table corruption
-
-
Closed
{"report":{"fcp":1226,"ttfb":403.09999990463257,"pageVisibility":"visible","entityId":126673,"key":"jira.project.issue.view-issue","isInitial":true,"threshold":1000,"elementTimings":{},"userDeviceMemory":8,"userDeviceProcessors":32,"apdex":0.5,"journeyId":"058af1ee-a6bb-4393-ac73-206d1029ef6b","navigationType":0,"readyForUser":1310.5999999046326,"redirectCount":0,"resourceLoadedEnd":1812.9000000953674,"resourceLoadedStart":410,"resourceTiming":[{"duration":326.90000009536743,"initiatorType":"link","name":"https://jira.mariadb.org/s/2c21342762a6a02add1c328bed317ffd-CDN/lu2cib/820016/12ta74/0a8bac35585be7fc6c9cc5a0464cd4cf/_/download/contextbatch/css/_super/batch.css","startTime":410,"connectEnd":0,"connectStart":0,"domainLookupEnd":0,"domainLookupStart":0,"fetchStart":410,"redirectEnd":0,"redirectStart":0,"requestStart":0,"responseEnd":736.9000000953674,"responseStart":0,"secureConnectionStart":0},{"duration":326.69999980926514,"initiatorType":"link","name":"https://jira.mariadb.org/s/7ebd35e77e471bc30ff0eba799ebc151-CDN/lu2cib/820016/12ta74/494e4c556ecbb29f90a3d3b4f09cb99c/_/download/contextbatch/css/jira.browse.project,project.issue.navigator,jira.view.issue,jira.general,jira.global,atl.general,-_super/batch.css?agile_global_admin_condition=true&jag=true&jira.create.linked.issue=true&slack-enabled=true&whisper-enabled=true","startTime":410.40000009536743,"connectEnd":0,"connectStart":0,"domainLookupEnd":0,"domainLookupStart":0,"fetchStart":410.40000009536743,"redirectEnd":0,"redirectStart":0,"requestStart":0,"responseEnd":737.0999999046326,"responseStart":0,"secureConnectionStart":0},{"duration":335.7000002861023,"initiatorType":"script","name":"https://jira.mariadb.org/s/0917945aaa57108d00c5076fea35e069-CDN/lu2cib/820016/12ta74/0a8bac35585be7fc6c9cc5a0464cd4cf/_/download/contextbatch/js/_super/batch.js?locale=en","startTime":410.59999990463257,"connectEnd":410.59999990463257,"connectStart":410.59999990463257,"domainLookupEnd":410.59999990463257,"domainLookupStart":410.59999990463257,"fetchStart":410.59999990463257,"redirectEnd":0,"redirectStart":0,"requestStart":410.59999990463257,"responseEnd":746.3000001907349,"responseStart":746.3000001907349,"secureConnectionStart":410.59999990463257},{"duration":389.19999980926514,"initiatorType":"script","name":"https://jira.mariadb.org/s/2d8175ec2fa4c816e8023260bd8c1786-CDN/lu2cib/820016/12ta74/494e4c556ecbb29f90a3d3b4f09cb99c/_/download/contextbatch/js/jira.browse.project,project.issue.navigator,jira.view.issue,jira.general,jira.global,atl.general,-_super/batch.js?agile_global_admin_condition=true&jag=true&jira.create.linked.issue=true&locale=en&slack-enabled=true&whisper-enabled=true","startTime":410.80000019073486,"connectEnd":410.80000019073486,"connectStart":410.80000019073486,"domainLookupEnd":410.80000019073486,"domainLookupStart":410.80000019073486,"fetchStart":410.80000019073486,"redirectEnd":0,"redirectStart":0,"requestStart":410.80000019073486,"responseEnd":800,"responseStart":800,"secureConnectionStart":410.80000019073486},{"duration":392.90000009536743,"initiatorType":"script","name":"https://jira.mariadb.org/s/a9324d6758d385eb45c462685ad88f1d-CDN/lu2cib/820016/12ta74/c92c0caa9a024ae85b0ebdbed7fb4bd7/_/download/contextbatch/js/atl.global,-_super/batch.js?locale=en","startTime":411,"connectEnd":411,"connectStart":411,"domainLookupEnd":411,"domainLookupStart":411,"fetchStart":411,"redirectEnd":0,"redirectStart":0,"requestStart":411,"responseEnd":803.9000000953674,"responseStart":803.9000000953674,"secureConnectionStart":411},{"duration":393.09999990463257,"initiatorType":"script","name":"https://jira.mariadb.org/s/d41d8cd98f00b204e9800998ecf8427e-CDN/lu2cib/820016/12ta74/1.0/_/download/batch/jira.webresources:calendar-en/jira.webresources:calendar-en.js","startTime":411.30000019073486,"connectEnd":411.30000019073486,"connectStart":411.30000019073486,"domainLookupEnd":411.30000019073486,"domainLookupStart":411.30000019073486,"fetchStart":411.30000019073486,"redirectEnd":0,"redirectStart":0,"requestStart":411.30000019073486,"responseEnd":804.4000000953674,"responseStart":804.4000000953674,"secureConnectionStart":411.30000019073486},{"duration":393.2000002861023,"initiatorType":"script","name":"https://jira.mariadb.org/s/d41d8cd98f00b204e9800998ecf8427e-CDN/lu2cib/820016/12ta74/1.0/_/download/batch/jira.webresources:calendar-localisation-moment/jira.webresources:calendar-localisation-moment.js","startTime":411.5,"connectEnd":411.5,"connectStart":411.5,"domainLookupEnd":411.5,"domainLookupStart":411.5,"fetchStart":411.5,"redirectEnd":0,"redirectStart":0,"requestStart":411.5,"responseEnd":804.7000002861023,"responseStart":804.5999999046326,"secureConnectionStart":411.5},{"duration":484.6000003814697,"initiatorType":"link","name":"https://jira.mariadb.org/s/b04b06a02d1959df322d9cded3aeecc1-CDN/lu2cib/820016/12ta74/a2ff6aa845ffc9a1d22fe23d9ee791fc/_/download/contextbatch/css/jira.global.look-and-feel,-_super/batch.css","startTime":411.59999990463257,"connectEnd":0,"connectStart":0,"domainLookupEnd":0,"domainLookupStart":0,"fetchStart":411.59999990463257,"redirectEnd":0,"redirectStart":0,"requestStart":0,"responseEnd":896.2000002861023,"responseStart":0,"secureConnectionStart":0},{"duration":393.40000009536743,"initiatorType":"script","name":"https://jira.mariadb.org/rest/api/1.0/shortcuts/820016/47140b6e0a9bc2e4913da06536125810/shortcuts.js?context=issuenavigation&context=issueaction","startTime":411.80000019073486,"connectEnd":411.80000019073486,"connectStart":411.80000019073486,"domainLookupEnd":411.80000019073486,"domainLookupStart":411.80000019073486,"fetchStart":411.80000019073486,"redirectEnd":0,"redirectStart":0,"requestStart":411.80000019073486,"responseEnd":805.2000002861023,"responseStart":805.2000002861023,"secureConnectionStart":411.80000019073486},{"duration":484.5,"initiatorType":"link","name":"https://jira.mariadb.org/s/3ac36323ba5e4eb0af2aa7ac7211b4bb-CDN/lu2cib/820016/12ta74/d176f0986478cc64f24226b3d20c140d/_/download/contextbatch/css/com.atlassian.jira.projects.sidebar.init,-_super,-project.issue.navigator,-jira.view.issue/batch.css?jira.create.linked.issue=true","startTime":411.90000009536743,"connectEnd":0,"connectStart":0,"domainLookupEnd":0,"domainLookupStart":0,"fetchStart":411.90000009536743,"redirectEnd":0,"redirectStart":0,"requestStart":0,"responseEnd":896.4000000953674,"responseStart":0,"secureConnectionStart":0},{"duration":393.69999980926514,"initiatorType":"script","name":"https://jira.mariadb.org/s/5d5e8fe91fbc506585e83ea3b62ccc4b-CDN/lu2cib/820016/12ta74/d176f0986478cc64f24226b3d20c140d/_/download/contextbatch/js/com.atlassian.jira.projects.sidebar.init,-_super,-project.issue.navigator,-jira.view.issue/batch.js?jira.create.linked.issue=true&locale=en","startTime":412.2000002861023,"connectEnd":412.2000002861023,"connectStart":412.2000002861023,"domainLookupEnd":412.2000002861023,"domainLookupStart":412.2000002861023,"fetchStart":412.2000002861023,"redirectEnd":0,"redirectStart":0,"requestStart":412.2000002861023,"responseEnd":805.9000000953674,"responseStart":805.9000000953674,"secureConnectionStart":412.2000002861023},{"duration":1102.1000003814697,"initiatorType":"script","name":"https://jira.mariadb.org/s/d41d8cd98f00b204e9800998ecf8427e-CDN/lu2cib/820016/12ta74/1.0/_/download/batch/jira.webresources:bigpipe-js/jira.webresources:bigpipe-js.js","startTime":413.09999990463257,"connectEnd":413.09999990463257,"connectStart":413.09999990463257,"domainLookupEnd":413.09999990463257,"domainLookupStart":413.09999990463257,"fetchStart":413.09999990463257,"redirectEnd":0,"redirectStart":0,"requestStart":413.09999990463257,"responseEnd":1515.2000002861023,"responseStart":1515.0999999046326,"secureConnectionStart":413.09999990463257},{"duration":1395.3000001907349,"initiatorType":"script","name":"https://jira.mariadb.org/s/d41d8cd98f00b204e9800998ecf8427e-CDN/lu2cib/820016/12ta74/1.0/_/download/batch/jira.webresources:bigpipe-init/jira.webresources:bigpipe-init.js","startTime":417.59999990463257,"connectEnd":417.59999990463257,"connectStart":417.59999990463257,"domainLookupEnd":417.59999990463257,"domainLookupStart":417.59999990463257,"fetchStart":417.59999990463257,"redirectEnd":0,"redirectStart":0,"requestStart":417.59999990463257,"responseEnd":1812.9000000953674,"responseStart":1812.9000000953674,"secureConnectionStart":417.59999990463257},{"duration":621,"initiatorType":"xmlhttprequest","name":"https://jira.mariadb.org/rest/webResources/1.0/resources","startTime":908.3000001907349,"connectEnd":908.3000001907349,"connectStart":908.3000001907349,"domainLookupEnd":908.3000001907349,"domainLookupStart":908.3000001907349,"fetchStart":908.3000001907349,"redirectEnd":0,"redirectStart":0,"requestStart":908.3000001907349,"responseEnd":1529.3000001907349,"responseStart":1529.3000001907349,"secureConnectionStart":908.3000001907349}],"fetchStart":0,"domainLookupStart":0,"domainLookupEnd":0,"connectStart":0,"connectEnd":0,"requestStart":234,"responseStart":403,"responseEnd":412,"domLoading":407,"domInteractive":1857,"domContentLoadedEventStart":1857,"domContentLoadedEventEnd":1928,"domComplete":2870,"loadEventStart":2870,"loadEventEnd":2871,"userAgent":"Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; ClaudeBot/1.0; +claudebot@anthropic.com)","marks":[{"name":"bigPipe.sidebar-id.start","time":1815.3000001907349},{"name":"bigPipe.sidebar-id.end","time":1816.0999999046326},{"name":"bigPipe.activity-panel-pipe-id.start","time":1816.3000001907349},{"name":"bigPipe.activity-panel-pipe-id.end","time":1820.8000001907349},{"name":"activityTabFullyLoaded","time":1944.3000001907349}],"measures":[],"correlationId":"5d7b5d2fe841b7","effectiveType":"4g","downlink":10,"rtt":0,"serverDuration":116,"dbReadsTimeInMs":20,"dbConnsTimeInMs":29,"applicationHash":"9d11dbea5f4be3d4cc21f03a88dd11d8c8687422","experiments":[]}}
Just today, while working on additional instrumentation for
MDEV-32899, I realized that the dict_sys.latch watchdog does not cover waits for a shared latch, only waits for an exclusive latch.You have already filed
MDEV-32371, whose fix I hope to push tomorrow, once some additional testing has been completed. The only reliable way to diagnose any hang is to produce the stack traces of all threads at the time of the hang, like you did in that bug. Is this bug something different?