Type:
Bug
Priority:
Critical
Resolution:
Fixed
Affects Version/s:
11.0.1 , 11.0.2 , 11.0.3 , 11.0.4 , 10.11
Environment:
bitnami/mariadb-galera:11.0.4-debian-11-r0 in k8s cluster
After upgrading initially from 10.6->10.11->11.0.3 and now to 11.0.4 we see that 2 of the members start up without any issues but the 3rd member(db-0) fails to start up due to an old sequence no.(117376) that i believe is being passed on from the donor.
We are unable to find the old seqno anywhere except in the `ibdata1` file of the donor(by searching for the hex of it). But not sure how to get rid of this old seqno.
What we did try is starting db-0 with fresh volume by removing K8s PVC for it, so it does a SST but that fails as well.
Logs from member db-0:
[Note] WSREP: SST received
[Note] WSREP: Server status change joiner -> initializing
[Note] WSREP: wsrep_notify_cmd is not defined, skipping notification.
[Note] mysqld: Aria engine: starting recovery
recovered pages: 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% (0.0 seconds); tables to flush: 2 1(0.0 seconds);
[Note] mysqld: Aria engine: recovery done
[Note] InnoDB: Compressed tables use zlib 1.2.11
[Note] InnoDB: Number of transaction pools: 1
[Note] InnoDB: Using crc32 + pclmulqdq instructions
[Note] InnoDB: Using Linux native AIO
[Note] InnoDB: Initializing buffer pool, total size = 2.000GiB, chunk size = 32.000MiB
[Note] InnoDB: Completed initialization of buffer pool
[Note] InnoDB: File system buffers for log disabled (block size=512 bytes)
[Note] InnoDB: End of log at LSN=180964319
[Note] InnoDB: Resizing redo log from 12.016KiB to 96.000MiB; LSN=180964319
[Note] InnoDB: File system buffers for log disabled (block size=512 bytes)
[Note] InnoDB: Reinitializing innodb_undo_tablespaces= 3 from 0
[Note] InnoDB: Data file . //undo001 did not exist: new to be created
[Note] InnoDB: Setting file . //undo001 size to 10.000MiB
[Note] InnoDB: Database physically writes the file full: wait...
[Note] InnoDB: Data file . //undo002 did not exist: new to be created
[Note] InnoDB: Setting file . //undo002 size to 10.000MiB
[Note] InnoDB: Database physically writes the file full: wait...
[Note] InnoDB: Data file . //undo003 did not exist: new to be created
[Note] InnoDB: Setting file . //undo003 size to 10.000MiB
[Note] InnoDB: Database physically writes the file full: wait...
[Note] InnoDB: 128 rollback segments in 3 undo tablespaces are active.
[Note] InnoDB: Setting file './ibtmp1' size to 12.000MiB. Physically writing the file full; Please wait ...
[Note] InnoDB: File './ibtmp1' size is now 12.000MiB.
[Note] InnoDB: log sequence number 180964319; transaction id 73558
[Note] InnoDB: Loading buffer pool(s) from /bitnami/mariadb/data/ib_buffer_pool
[Note] InnoDB: Cannot open '/bitnami/mariadb/data/ib_buffer_pool' for reading: No such file or directory
[Note] Plugin 'FEEDBACK' is disabled.
[Warning] 'innodb-log-files-in-group' was removed. It does nothing now and exists only for compatibility with old my.cnf files.
[Warning] 'innodb-file-format' was removed. It does nothing now and exists only for compatibility with old my.cnf files.
[Note] Recovering after a crash using mysql-bin
[Note] Starting table crash recovery...
[Note] Crash table recovery finished.
[Note] Server socket created on IP: '0.0.0.0' .
[Warning] 'proxies_priv' entry '@% root@db-0' ignored in --skip-name-resolve mode.
[Note] WSREP: wsrep_init_schema_and_SR (nil)
[Note] WSREP: Server initialized
[Note] WSREP: Server status change initializing -> initialized
[Note] WSREP: wsrep_notify_cmd is not defined, skipping notification.
[Note] WSREP: Recovered position from storage: 6aa53efc-db72-11ec-880f-a282ce494905:117376
[Note] WSREP: Starting applier thread 6
[Note] WSREP: Starting applier thread 7
[Note] WSREP: Starting applier thread 8
[Note] WSREP: Recovered view from SST:
id : 6aa53efc-db72-11ec-880f-a282ce494905:128934
status: primary
protocol_version: 4
capabilities: MULTI-MASTER, CERTIFICATION, PARALLEL_APPLYING, REPLAY, ISOLATION, PAUSE, CAUSAL_READ, INCREMENTAL_WS, UNORDERED, PREORDERED, STREAMING, NBO
final: no
own_index: 0
members(3):
0: 46250027-95c8-11ee-9d94-06eb82413a07, db
1: 4ed771b5-941f-11ee-82bf-56ec952abcf6, db
2: bfa6e688-941f-11ee-97c5-d323f02068c1, db
[ERROR] WSREP: sst_received failed: SST script passed bogus GTID: 6aa53efc-db72-11ec-880f-a282ce494905:117376. Preceding view GTID: 6aa53efc-db72-11ec-880f-a282ce494905:128934
[Note] WSREP: SST received: 00000000-0000-0000-0000-000000000000:-1
[Note] WSREP: Joiner monitor thread ended with total time 14 sec
[ERROR] WSREP: Application received wrong state:
Received: 00000000-0000-0000-0000-000000000000
Required: 6aa53efc-db72-11ec-880f-a282ce494905
[ERROR] WSREP: Application state transfer failed. This is unrecoverable condition, restart required.
[Note] WSREP: ReplicatorSMM::abort()
[Note] WSREP: Closing send monitor...
[Note] WSREP: Closed send monitor.
[Note] WSREP: gcomm: terminating thread
[Note] WSREP: gcomm: joining thread
[Note] WSREP: gcomm: closing backend
[Note] /opt/bitnami/mariadb/sbin/mysqld : ready for connections.
Version: '11.0.3-MariaDB-log' socket: '/opt/bitnami/mariadb/tmp/mysql.sock' port: 3306 Source distribution
[Note] WSREP: view(view_id(NON_PRIM,46250027-9d94,6636) memb {
46250027-9d94,0
} joined {
} left {
} partitioned {
4ed771b5-82bf,0
bfa6e688-97c5,0
})
[Note] WSREP: PC protocol downgrade 1 -> 0
[Note] WSREP: view((empty))
[Note] WSREP: gcomm: closed
[Note] WSREP: New COMPONENT: primary = no, bootstrap = no, my_idx = 0, memb_num = 1
[Note] WSREP: Flow-control interval: [128, 160]
[Note] WSREP: Received NON-PRIMARY.
[Note] WSREP: Shifting JOINER -> OPEN (TO: 128934)
[Note] WSREP: New SELF-LEAVE.
[Note] WSREP: Flow-control interval: [0, 0]
[Note] WSREP: Received SELF-LEAVE. Closing connection.
[Note] WSREP: Shifting OPEN -> CLOSED (TO: 128934)
[Note] WSREP: RECV thread exiting 0: Success
[Note] WSREP: recv_thread() joined.
[Note] WSREP: Closing replication queue.
[Note] WSREP: Closing slave action queue.
[Note] WSREP: /opt/bitnami/mariadb/sbin/mysqld : Terminated.
{"report":{"fcp":1421.6999998092651,"ttfb":418.30000019073486,"pageVisibility":"visible","entityId":126844,"key":"jira.project.issue.view-issue","isInitial":true,"threshold":1000,"elementTimings":{},"userDeviceMemory":8,"userDeviceProcessors":64,"apdex":0.5,"journeyId":"510ea52e-9b29-4157-9def-afa297e0d1cf","navigationType":0,"readyForUser":1553.6999998092651,"redirectCount":0,"resourceLoadedEnd":1040.8000001907349,"resourceLoadedStart":430.8999996185303,"resourceTiming":[{"duration":185.20000076293945,"initiatorType":"link","name":"https://jira.mariadb.org/s/2c21342762a6a02add1c328bed317ffd-CDN/lu2cib/820016/12ta74/0a8bac35585be7fc6c9cc5a0464cd4cf/_/download/contextbatch/css/_super/batch.css","startTime":430.8999996185303,"connectEnd":0,"connectStart":0,"domainLookupEnd":0,"domainLookupStart":0,"fetchStart":430.8999996185303,"redirectEnd":0,"redirectStart":0,"requestStart":0,"responseEnd":616.1000003814697,"responseStart":0,"secureConnectionStart":0},{"duration":185.39999961853027,"initiatorType":"link","name":"https://jira.mariadb.org/s/7ebd35e77e471bc30ff0eba799ebc151-CDN/lu2cib/820016/12ta74/494e4c556ecbb29f90a3d3b4f09cb99c/_/download/contextbatch/css/jira.browse.project,project.issue.navigator,jira.view.issue,jira.general,jira.global,atl.general,-_super/batch.css?agile_global_admin_condition=true&jag=true&jira.create.linked.issue=true&slack-enabled=true&whisper-enabled=true","startTime":431.1000003814697,"connectEnd":0,"connectStart":0,"domainLookupEnd":0,"domainLookupStart":0,"fetchStart":431.1000003814697,"redirectEnd":0,"redirectStart":0,"requestStart":0,"responseEnd":616.5,"responseStart":0,"secureConnectionStart":0},{"duration":405.70000076293945,"initiatorType":"script","name":"https://jira.mariadb.org/s/0917945aaa57108d00c5076fea35e069-CDN/lu2cib/820016/12ta74/0a8bac35585be7fc6c9cc5a0464cd4cf/_/download/contextbatch/js/_super/batch.js?locale=en","startTime":431.3999996185303,"connectEnd":431.3999996185303,"connectStart":431.3999996185303,"domainLookupEnd":431.3999996185303,"domainLookupStart":431.3999996185303,"fetchStart":431.3999996185303,"redirectEnd":0,"redirectStart":0,"requestStart":624.8999996185303,"responseEnd":837.1000003814697,"responseStart":656.3999996185303,"secureConnectionStart":431.3999996185303},{"duration":609.1999998092651,"initiatorType":"script","name":"https://jira.mariadb.org/s/2d8175ec2fa4c816e8023260bd8c1786-CDN/lu2cib/820016/12ta74/494e4c556ecbb29f90a3d3b4f09cb99c/_/download/contextbatch/js/jira.browse.project,project.issue.navigator,jira.view.issue,jira.general,jira.global,atl.general,-_super/batch.js?agile_global_admin_condition=true&jag=true&jira.create.linked.issue=true&locale=en&slack-enabled=true&whisper-enabled=true","startTime":431.6000003814697,"connectEnd":431.6000003814697,"connectStart":431.6000003814697,"domainLookupEnd":431.6000003814697,"domainLookupStart":431.6000003814697,"fetchStart":431.6000003814697,"redirectEnd":0,"redirectStart":0,"requestStart":624.8000001907349,"responseEnd":1040.8000001907349,"responseStart":650.6000003814697,"secureConnectionStart":431.6000003814697},{"duration":227.19999980926514,"initiatorType":"script","name":"https://jira.mariadb.org/s/a9324d6758d385eb45c462685ad88f1d-CDN/lu2cib/820016/12ta74/c92c0caa9a024ae85b0ebdbed7fb4bd7/_/download/contextbatch/js/atl.global,-_super/batch.js?locale=en","startTime":431.69999980926514,"connectEnd":431.69999980926514,"connectStart":431.69999980926514,"domainLookupEnd":431.69999980926514,"domainLookupStart":431.69999980926514,"fetchStart":431.69999980926514,"redirectEnd":0,"redirectStart":0,"requestStart":625.1000003814697,"responseEnd":658.8999996185303,"responseStart":657.8000001907349,"secureConnectionStart":431.69999980926514},{"duration":234.80000019073486,"initiatorType":"script","name":"https://jira.mariadb.org/s/d41d8cd98f00b204e9800998ecf8427e-CDN/lu2cib/820016/12ta74/1.0/_/download/batch/jira.webresources:calendar-en/jira.webresources:calendar-en.js","startTime":431.8999996185303,"connectEnd":431.8999996185303,"connectStart":431.8999996185303,"domainLookupEnd":431.8999996185303,"domainLookupStart":431.8999996185303,"fetchStart":431.8999996185303,"redirectEnd":0,"redirectStart":0,"requestStart":636,"responseEnd":666.6999998092651,"responseStart":660.1999998092651,"secureConnectionStart":431.8999996185303},{"duration":237.0999994277954,"initiatorType":"script","name":"https://jira.mariadb.org/s/d41d8cd98f00b204e9800998ecf8427e-CDN/lu2cib/820016/12ta74/1.0/_/download/batch/jira.webresources:calendar-localisation-moment/jira.webresources:calendar-localisation-moment.js","startTime":432.1000003814697,"connectEnd":432.1000003814697,"connectStart":432.1000003814697,"domainLookupEnd":432.1000003814697,"domainLookupStart":432.1000003814697,"fetchStart":432.1000003814697,"redirectEnd":0,"redirectStart":0,"requestStart":636.1999998092651,"responseEnd":669.1999998092651,"responseStart":661.5,"secureConnectionStart":432.1000003814697},{"duration":193.0999994277954,"initiatorType":"link","name":"https://jira.mariadb.org/s/b04b06a02d1959df322d9cded3aeecc1-CDN/lu2cib/820016/12ta74/a2ff6aa845ffc9a1d22fe23d9ee791fc/_/download/contextbatch/css/jira.global.look-and-feel,-_super/batch.css","startTime":432.30000019073486,"connectEnd":0,"connectStart":0,"domainLookupEnd":0,"domainLookupStart":0,"fetchStart":432.30000019073486,"redirectEnd":0,"redirectStart":0,"requestStart":0,"responseEnd":625.3999996185303,"responseStart":0,"secureConnectionStart":0},{"duration":236.80000019073486,"initiatorType":"script","name":"https://jira.mariadb.org/rest/api/1.0/shortcuts/820016/47140b6e0a9bc2e4913da06536125810/shortcuts.js?context=issuenavigation&context=issueaction","startTime":432.69999980926514,"connectEnd":432.69999980926514,"connectStart":432.69999980926514,"domainLookupEnd":432.69999980926514,"domainLookupStart":432.69999980926514,"fetchStart":432.69999980926514,"redirectEnd":0,"redirectStart":0,"requestStart":636.3000001907349,"responseEnd":669.5,"responseStart":662.3999996185303,"secureConnectionStart":432.69999980926514},{"duration":209,"initiatorType":"link","name":"https://jira.mariadb.org/s/3ac36323ba5e4eb0af2aa7ac7211b4bb-CDN/lu2cib/820016/12ta74/d176f0986478cc64f24226b3d20c140d/_/download/contextbatch/css/com.atlassian.jira.projects.sidebar.init,-_super,-project.issue.navigator,-jira.view.issue/batch.css?jira.create.linked.issue=true","startTime":432.8999996185303,"connectEnd":0,"connectStart":0,"domainLookupEnd":0,"domainLookupStart":0,"fetchStart":432.8999996185303,"redirectEnd":0,"redirectStart":0,"requestStart":0,"responseEnd":641.8999996185303,"responseStart":0,"secureConnectionStart":0},{"duration":237.69999980926514,"initiatorType":"script","name":"https://jira.mariadb.org/s/5d5e8fe91fbc506585e83ea3b62ccc4b-CDN/lu2cib/820016/12ta74/d176f0986478cc64f24226b3d20c140d/_/download/contextbatch/js/com.atlassian.jira.projects.sidebar.init,-_super,-project.issue.navigator,-jira.view.issue/batch.js?jira.create.linked.issue=true&locale=en","startTime":433.1000003814697,"connectEnd":433.1000003814697,"connectStart":433.1000003814697,"domainLookupEnd":433.1000003814697,"domainLookupStart":433.1000003814697,"fetchStart":433.1000003814697,"redirectEnd":0,"redirectStart":0,"requestStart":636.3999996185303,"responseEnd":670.8000001907349,"responseStart":663.6999998092651,"secureConnectionStart":433.1000003814697},{"duration":573.3999996185303,"initiatorType":"script","name":"https://jira.mariadb.org/s/d41d8cd98f00b204e9800998ecf8427e-CDN/lu2cib/820016/12ta74/1.0/_/download/batch/jira.webresources:bigpipe-js/jira.webresources:bigpipe-js.js","startTime":435.80000019073486,"connectEnd":435.80000019073486,"connectStart":435.80000019073486,"domainLookupEnd":435.80000019073486,"domainLookupStart":435.80000019073486,"fetchStart":435.80000019073486,"redirectEnd":0,"redirectStart":0,"requestStart":794.6000003814697,"responseEnd":1009.1999998092651,"responseStart":995.5,"secureConnectionStart":435.80000019073486},{"duration":583.3000001907349,"initiatorType":"script","name":"https://jira.mariadb.org/s/d41d8cd98f00b204e9800998ecf8427e-CDN/lu2cib/820016/12ta74/1.0/_/download/batch/jira.webresources:bigpipe-init/jira.webresources:bigpipe-init.js","startTime":435.80000019073486,"connectEnd":435.80000019073486,"connectStart":435.80000019073486,"domainLookupEnd":435.80000019073486,"domainLookupStart":435.80000019073486,"fetchStart":435.80000019073486,"redirectEnd":0,"redirectStart":0,"requestStart":954.6999998092651,"responseEnd":1019.1000003814697,"responseStart":1011.3000001907349,"secureConnectionStart":435.80000019073486},{"duration":278.30000019073486,"initiatorType":"xmlhttprequest","name":"https://jira.mariadb.org/rest/webResources/1.0/resources","startTime":1045.8000001907349,"connectEnd":1045.8000001907349,"connectStart":1045.8000001907349,"domainLookupEnd":1045.8000001907349,"domainLookupStart":1045.8000001907349,"fetchStart":1045.8000001907349,"redirectEnd":0,"redirectStart":0,"requestStart":1288,"responseEnd":1324.1000003814697,"responseStart":1323.5,"secureConnectionStart":1045.8000001907349}],"fetchStart":0,"domainLookupStart":0,"domainLookupEnd":0,"connectStart":0,"connectEnd":0,"requestStart":224,"responseStart":418,"responseEnd":423,"domLoading":426,"domInteractive":1666,"domContentLoadedEventStart":1666,"domContentLoadedEventEnd":1737,"domComplete":3005,"loadEventStart":3005,"loadEventEnd":3006,"userAgent":"Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; ClaudeBot/1.0; +claudebot@anthropic.com)","marks":[{"name":"bigPipe.sidebar-id.start","time":1626.6999998092651},{"name":"bigPipe.sidebar-id.end","time":1627.5},{"name":"bigPipe.activity-panel-pipe-id.start","time":1627.6999998092651},{"name":"bigPipe.activity-panel-pipe-id.end","time":1630.6999998092651},{"name":"activityTabFullyLoaded","time":1757.1000003814697}],"measures":[],"correlationId":"377d5556775e60","effectiveType":"4g","downlink":10,"rtt":0,"serverDuration":116,"dbReadsTimeInMs":12,"dbConnsTimeInMs":20,"applicationHash":"9d11dbea5f4be3d4cc21f03a88dd11d8c8687422","experiments":[]}}
marko You are correct on that test does not pass with those settings. However, it does not reproduce original problem i.e. I cant find error message similar to "[ERROR] WSREP: sst_received failed: SST script passed bogus GTID: 6aa53efc-db72-11ec-880f-a282ce494905:117376. Preceding view GTID: 6aa53efc-db72-11ec-880f-a282ce494905:128934". Using manual testing I could reproduce this error. In last test did you verify that InnoDB really did create 3 tablespaces ?