Type:
Bug
Priority:
Major
Resolution:
Fixed
Affects Version/s:
10.4(EOL) , 10.5 , 10.6 , 10.11 , 11.0(EOL) , 11.1(EOL) , 11.2(EOL) , 11.3(EOL) , 11.4
mleich provided two rr replay traces where mariadb-backup --backup would fail as follows:
10.6 ec7db2bdf849fc1a5bad906764920edda4121bd6
2024-04-22 12:33:25 0 [ERROR] InnoDB: Checksum mismatch in the first page of file .//undo001
2024-04-22 12:33:25 0 [ERROR] InnoDB: Unable to read first page of file .//undo001
[00] 2024-04-22 12:33:25 merror: xb_load_tablespaces() failed with error Data structure corruption.
It is obvious that the copy of the page that was read is a mix of two versions, because the least significant 32 bits of log sequence numbers at the start and the end of the page differ:
0x000055cee647bbe4 595 if (crc32 != ut_crc32(read_buf,
(rr) display/i $pc
1: x/i $pc
=> 0x55cee647bbe4 <_Z21buf_page_is_corruptedbPKhm+620>: cmp %eax,%r13d
(rr) i reg eax
eax 0x3dc71f83 1036459907
(rr) i reg r13d
r13d 0xece2f286 -320671098
(rr) p/x read_buf[16]@8
$1 = {0x0, 0x0, 0x0, 0x0, 0x1, 0x29, 0xb3, 0x19}
(rr) p/x read_buf[srv_page_size-8]@8
$2 = {0x1, 0x1, 0x3, 0x2c, 0xec, 0xe2, 0xf2, 0x86}
(rr) bt
#0 0x000055cee647bbe4 in buf_page_is_corrupted (check_lsn=check_lsn@entry=false, read_buf=read_buf@entry=0x5c352ffe0000 "", fsp_flags=fsp_flags@entry=23)
at /data/Server/10.6B/storage/innobase/buf/buf0buf.cc:595
#1 0x000055cee672e0a4 in srv_undo_tablespace_open (create=create@entry=false, name=<optimized out>, name@entry=0x7fff9fd92ea0 ".//undo001", i=i@entry=0)
at /data/Server/10.6B/storage/innobase/srv/srv0start.cc:537
#2 0x000055cee6730322 in srv_all_undo_tablespaces_open (create_new_db=create_new_db@entry=false, n_undo=16) at /data/Server/10.6B/storage/innobase/srv/srv0start.cc:654
#3 0x000055cee6730c7e in srv_undo_tablespaces_init (create_new_db=create_new_db@entry=false) at /data/Server/10.6B/storage/innobase/srv/srv0start.cc:739
#4 0x000055cee5cf6f7f in xb_load_tablespaces () at /data/Server/10.6B/extra/mariabackup/xtrabackup.cc:4081
#5 0x000055cee5d030a7 in xtrabackup_backup_func () at /data/Server/10.6B/extra/mariabackup/xtrabackup.cc:4861
#6 0x000055cee5d03df5 in main_low (argv=0x55cee86d7650) at /data/Server/10.6B/extra/mariabackup/xtrabackup.cc:7156
We do have retry logic for most other page reads; see the calls to buf_page_is_corrupted in fil_cur.cc . For the TRX_SYS page in xb_assign_undo_space_start() there is special handling of 5 reread attempts.
Having to re-read pages in case they were concurrently written by the server that is being backed up is needed by the current design. A better design would be to have the server responsible for making backups (MDEV-14992 ). But, we need to fix this bug in GA releases, especially given that MDEV-29986 made multiple undo tablespaces the default.
relates to
MDEV-14992
BACKUP: in-server backup
Open
{"report":{"fcp":1093.3000001907349,"ttfb":385.40000009536743,"pageVisibility":"visible","entityId":128882,"key":"jira.project.issue.view-issue","isInitial":true,"threshold":1000,"elementTimings":{},"userDeviceMemory":8,"userDeviceProcessors":64,"apdex":0.5,"journeyId":"8502293f-ba98-42e2-bc7e-c40234315da9","navigationType":0,"readyForUser":1212.8000001907349,"redirectCount":0,"resourceLoadedEnd":1342,"resourceLoadedStart":406.7000002861023,"resourceTiming":[{"duration":57.59999990463257,"initiatorType":"link","name":"https://jira.mariadb.org/s/2c21342762a6a02add1c328bed317ffd-CDN/lu2cib/820016/12ta74/0a8bac35585be7fc6c9cc5a0464cd4cf/_/download/contextbatch/css/_super/batch.css","startTime":406.7000002861023,"connectEnd":0,"connectStart":0,"domainLookupEnd":0,"domainLookupStart":0,"fetchStart":406.7000002861023,"redirectEnd":0,"redirectStart":0,"requestStart":0,"responseEnd":464.30000019073486,"responseStart":0,"secureConnectionStart":0},{"duration":57.80000019073486,"initiatorType":"link","name":"https://jira.mariadb.org/s/7ebd35e77e471bc30ff0eba799ebc151-CDN/lu2cib/820016/12ta74/494e4c556ecbb29f90a3d3b4f09cb99c/_/download/contextbatch/css/jira.browse.project,project.issue.navigator,jira.view.issue,jira.general,jira.global,atl.general,-_super/batch.css?agile_global_admin_condition=true&jag=true&jira.create.linked.issue=true&slack-enabled=true&whisper-enabled=true","startTime":407,"connectEnd":0,"connectStart":0,"domainLookupEnd":0,"domainLookupStart":0,"fetchStart":407,"redirectEnd":0,"redirectStart":0,"requestStart":0,"responseEnd":464.80000019073486,"responseStart":0,"secureConnectionStart":0},{"duration":113.2999997138977,"initiatorType":"script","name":"https://jira.mariadb.org/s/0917945aaa57108d00c5076fea35e069-CDN/lu2cib/820016/12ta74/0a8bac35585be7fc6c9cc5a0464cd4cf/_/download/contextbatch/js/_super/batch.js?locale=en","startTime":407.2000002861023,"connectEnd":407.2000002861023,"connectStart":407.2000002861023,"domainLookupEnd":407.2000002861023,"domainLookupStart":407.2000002861023,"fetchStart":407.2000002861023,"redirectEnd":0,"redirectStart":0,"requestStart":407.2000002861023,"responseEnd":520.5,"responseStart":520.4000000953674,"secureConnectionStart":407.2000002861023},{"duration":204.30000019073486,"initiatorType":"script","name":"https://jira.mariadb.org/s/2d8175ec2fa4c816e8023260bd8c1786-CDN/lu2cib/820016/12ta74/494e4c556ecbb29f90a3d3b4f09cb99c/_/download/contextbatch/js/jira.browse.project,project.issue.navigator,jira.view.issue,jira.general,jira.global,atl.general,-_super/batch.js?agile_global_admin_condition=true&jag=true&jira.create.linked.issue=true&locale=en&slack-enabled=true&whisper-enabled=true","startTime":407.40000009536743,"connectEnd":407.40000009536743,"connectStart":407.40000009536743,"domainLookupEnd":407.40000009536743,"domainLookupStart":407.40000009536743,"fetchStart":407.40000009536743,"redirectEnd":0,"redirectStart":0,"requestStart":407.40000009536743,"responseEnd":611.7000002861023,"responseStart":611.7000002861023,"secureConnectionStart":407.40000009536743},{"duration":210,"initiatorType":"script","name":"https://jira.mariadb.org/s/a9324d6758d385eb45c462685ad88f1d-CDN/lu2cib/820016/12ta74/c92c0caa9a024ae85b0ebdbed7fb4bd7/_/download/contextbatch/js/atl.global,-_super/batch.js?locale=en","startTime":407.59999990463257,"connectEnd":407.59999990463257,"connectStart":407.59999990463257,"domainLookupEnd":407.59999990463257,"domainLookupStart":407.59999990463257,"fetchStart":407.59999990463257,"redirectEnd":0,"redirectStart":0,"requestStart":407.59999990463257,"responseEnd":617.5999999046326,"responseStart":617.5999999046326,"secureConnectionStart":407.59999990463257},{"duration":210.2999997138977,"initiatorType":"script","name":"https://jira.mariadb.org/s/d41d8cd98f00b204e9800998ecf8427e-CDN/lu2cib/820016/12ta74/1.0/_/download/batch/jira.webresources:calendar-en/jira.webresources:calendar-en.js","startTime":407.80000019073486,"connectEnd":407.80000019073486,"connectStart":407.80000019073486,"domainLookupEnd":407.80000019073486,"domainLookupStart":407.80000019073486,"fetchStart":407.80000019073486,"redirectEnd":0,"redirectStart":0,"requestStart":407.80000019073486,"responseEnd":618.0999999046326,"responseStart":618,"secureConnectionStart":407.80000019073486},{"duration":210.59999990463257,"initiatorType":"script","name":"https://jira.mariadb.org/s/d41d8cd98f00b204e9800998ecf8427e-CDN/lu2cib/820016/12ta74/1.0/_/download/batch/jira.webresources:calendar-localisation-moment/jira.webresources:calendar-localisation-moment.js","startTime":407.90000009536743,"connectEnd":407.90000009536743,"connectStart":407.90000009536743,"domainLookupEnd":407.90000009536743,"domainLookupStart":407.90000009536743,"fetchStart":407.90000009536743,"redirectEnd":0,"redirectStart":0,"requestStart":407.90000009536743,"responseEnd":618.5,"responseStart":618.5,"secureConnectionStart":407.90000009536743},{"duration":337.69999980926514,"initiatorType":"link","name":"https://jira.mariadb.org/s/b04b06a02d1959df322d9cded3aeecc1-CDN/lu2cib/820016/12ta74/a2ff6aa845ffc9a1d22fe23d9ee791fc/_/download/contextbatch/css/jira.global.look-and-feel,-_super/batch.css","startTime":408.2000002861023,"connectEnd":0,"connectStart":0,"domainLookupEnd":0,"domainLookupStart":0,"fetchStart":408.2000002861023,"redirectEnd":0,"redirectStart":0,"requestStart":0,"responseEnd":745.9000000953674,"responseStart":0,"secureConnectionStart":0},{"duration":210.59999990463257,"initiatorType":"script","name":"https://jira.mariadb.org/rest/api/1.0/shortcuts/820016/47140b6e0a9bc2e4913da06536125810/shortcuts.js?context=issuenavigation&context=issueaction","startTime":408.30000019073486,"connectEnd":408.30000019073486,"connectStart":408.30000019073486,"domainLookupEnd":408.30000019073486,"domainLookupStart":408.30000019073486,"fetchStart":408.30000019073486,"redirectEnd":0,"redirectStart":0,"requestStart":408.30000019073486,"responseEnd":618.9000000953674,"responseStart":618.9000000953674,"secureConnectionStart":408.30000019073486},{"duration":337.59999990463257,"initiatorType":"link","name":"https://jira.mariadb.org/s/3ac36323ba5e4eb0af2aa7ac7211b4bb-CDN/lu2cib/820016/12ta74/d176f0986478cc64f24226b3d20c140d/_/download/contextbatch/css/com.atlassian.jira.projects.sidebar.init,-_super,-project.issue.navigator,-jira.view.issue/batch.css?jira.create.linked.issue=true","startTime":408.5,"connectEnd":0,"connectStart":0,"domainLookupEnd":0,"domainLookupStart":0,"fetchStart":408.5,"redirectEnd":0,"redirectStart":0,"requestStart":0,"responseEnd":746.0999999046326,"responseStart":0,"secureConnectionStart":0},{"duration":211,"initiatorType":"script","name":"https://jira.mariadb.org/s/5d5e8fe91fbc506585e83ea3b62ccc4b-CDN/lu2cib/820016/12ta74/d176f0986478cc64f24226b3d20c140d/_/download/contextbatch/js/com.atlassian.jira.projects.sidebar.init,-_super,-project.issue.navigator,-jira.view.issue/batch.js?jira.create.linked.issue=true&locale=en","startTime":408.7000002861023,"connectEnd":408.7000002861023,"connectStart":408.7000002861023,"domainLookupEnd":408.7000002861023,"domainLookupStart":408.7000002861023,"fetchStart":408.7000002861023,"redirectEnd":0,"redirectStart":0,"requestStart":408.7000002861023,"responseEnd":619.7000002861023,"responseStart":619.7000002861023,"secureConnectionStart":408.7000002861023},{"duration":891,"initiatorType":"script","name":"https://jira.mariadb.org/s/d41d8cd98f00b204e9800998ecf8427e-CDN/lu2cib/820016/12ta74/1.0/_/download/batch/jira.webresources:bigpipe-js/jira.webresources:bigpipe-js.js","startTime":409.59999990463257,"connectEnd":409.59999990463257,"connectStart":409.59999990463257,"domainLookupEnd":409.59999990463257,"domainLookupStart":409.59999990463257,"fetchStart":409.59999990463257,"redirectEnd":0,"redirectStart":0,"requestStart":409.59999990463257,"responseEnd":1300.5999999046326,"responseStart":1300.5999999046326,"secureConnectionStart":409.59999990463257},{"duration":892.2999997138977,"initiatorType":"script","name":"https://jira.mariadb.org/s/d41d8cd98f00b204e9800998ecf8427e-CDN/lu2cib/820016/12ta74/1.0/_/download/batch/jira.webresources:bigpipe-init/jira.webresources:bigpipe-init.js","startTime":409.7000002861023,"connectEnd":409.7000002861023,"connectStart":409.7000002861023,"domainLookupEnd":409.7000002861023,"domainLookupStart":409.7000002861023,"fetchStart":409.7000002861023,"redirectEnd":0,"redirectStart":0,"requestStart":409.7000002861023,"responseEnd":1302,"responseStart":1302,"secureConnectionStart":409.7000002861023},{"duration":207.7999997138977,"initiatorType":"xmlhttprequest","name":"https://jira.mariadb.org/rest/webResources/1.0/resources","startTime":767.7000002861023,"connectEnd":767.7000002861023,"connectStart":767.7000002861023,"domainLookupEnd":767.7000002861023,"domainLookupStart":767.7000002861023,"fetchStart":767.7000002861023,"redirectEnd":0,"redirectStart":0,"requestStart":767.7000002861023,"responseEnd":975.5,"responseStart":975.5,"secureConnectionStart":767.7000002861023},{"duration":314.5,"initiatorType":"script","name":"https://jira.mariadb.org/s/d41d8cd98f00b204e9800998ecf8427e-CDN/lu2cib/820016/12ta74/e65b778d185daf5aee24936755b43da6/_/download/contextbatch/js/browser-metrics-plugin.contrib,-_super,-jira.view.issue,-jira.navigator.kickass,-viewissue.standalone,-atl.general/batch.js?agile_global_admin_condition=true&jag=true&jira.create.linked.issue=true&slack-enabled=true&whisper-enabled=true","startTime":1027.5,"connectEnd":1027.5,"connectStart":1027.5,"domainLookupEnd":1027.5,"domainLookupStart":1027.5,"fetchStart":1027.5,"redirectEnd":0,"redirectStart":0,"requestStart":1027.5,"responseEnd":1342,"responseStart":1342,"secureConnectionStart":1027.5}],"fetchStart":0,"domainLookupStart":0,"domainLookupEnd":0,"connectStart":0,"connectEnd":0,"requestStart":221,"responseStart":386,"responseEnd":402,"domLoading":404,"domInteractive":1382,"domContentLoadedEventStart":1382,"domContentLoadedEventEnd":1425,"domComplete":1630,"loadEventStart":1630,"loadEventEnd":1631,"userAgent":"Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; ClaudeBot/1.0; +claudebot@anthropic.com)","marks":[{"name":"bigPipe.sidebar-id.start","time":1360.4000000953674},{"name":"bigPipe.sidebar-id.end","time":1361.3000001907349},{"name":"bigPipe.activity-panel-pipe-id.start","time":1361.5},{"name":"bigPipe.activity-panel-pipe-id.end","time":1362.3000001907349},{"name":"activityTabFullyLoaded","time":1438.4000000953674}],"measures":[],"correlationId":"f44ef243715119","effectiveType":"4g","downlink":10,"rtt":0,"serverDuration":103,"dbReadsTimeInMs":18,"dbConnsTimeInMs":27,"applicationHash":"9d11dbea5f4be3d4cc21f03a88dd11d8c8687422","experiments":[]}}
https://github.com/MariaDB/server/pull/3233