[MDEV-28423] IST is failing on Joiner node when active data load on donor node Created: 2022-04-27  Updated: 2022-05-24  Resolved: 2022-05-23

Status: Closed
Project: MariaDB Server
Component/s: Galera
Affects Version/s: 10.9
Fix Version/s: 10.3.35, 10.4.25, 10.5.16, 10.6.8, 10.7.4, 10.8.3, 10.9.1

Type: Bug Priority: Critical
Reporter: Ramesh Sivaraman Assignee: Julius Goryavsky
Resolution: Fixed Votes: 0
Labels: None

Issue Links:
Problem/Incident
causes MDEV-28656 Inability to roll upgrade without sto... Closed
is caused by MDEV-26971 JSON file interface to wsrep node sta... Closed
Relates
relates to MDEV-28583 Galera: binlogs disappear after rsync... Closed

 Description   

Error log

2022-04-27 12:54:05 0 [Note] WSREP: 1.0 (ramesh): State transfer to 0.0 (ramesh) complete.
/test/GAL_MD260422-mariadb-10.9.0-linux-x86_64-opt//bin/wsrep_sst_mariabackup: line 829: [: -ge: unary operator expected
2022-04-27 12:54:05 0 [Note] WSREP: Member 1.0 (ramesh) synced with group.
WSREP_SST: [INFO] 'xtrabackup_ist' received from donor: Running IST (20220427 12:54:05.657)
 
[..]
 
Version: '10.9.0-MariaDB-log'  socket: '/test/GAL_MD260422-mariadb-10.9.0-linux-x86_64-opt/node2/node2_socket.sock'  port: 12102  MariaDB Server
2022-04-27 12:54:06 2 [Note] WSREP: Receiving IST: 4433 writesets, seqnos 1473-5905
2022-04-27 12:54:06 0 [Note] WSREP: ####### IST applying starts with 1473
2022-04-27 12:54:06 0 [Note] WSREP: ####### IST current seqno initialized to 1443
2022-04-27 12:54:06 0 [Note] WSREP: Receiving IST...  0.0% (   0/4463 events) complete.
2022-04-27 12:54:06 0 [Note] WSREP: IST preload starting at 1443
2022-04-27 12:54:06 0 [Note] WSREP: Service thread queue flushed.
2022-04-27 12:54:06 0 [Note] WSREP: ####### Assign initial position for certification: 00000000-0000-0000-0000-000000000000:1442, protocol version: 5
2022-04-27 12:54:06 0 [ERROR] WSREP: got exception while reading IST stream: error receiving trx header: 71 (Protocol error)
	 at /test/10.5_galera_opt/galera/src/ist_proto.hpp:recv_ordered():580
2022-04-27 12:54:06 0 [Note] WSREP: Receiving IST...  0.0% (   2/4463 events) complete.
2022-04-27 12:54:06 0 [ERROR] WSREP: IST didn't contain all write sets, expected last: 5905 last received: 1445
2022-04-27 12:54:06 6 [ERROR] WSREP: Receiving IST failed, node restart required: IST receiver reported failure: 71 (Protocol error)
	 at /test/10.5_galera_opt/galera/src/replicator_smm.hpp:pop_front():336. Null event.
2022-04-27 12:54:06 6 [Note] WSREP: Closing send monitor...
2022-04-27 12:54:06 6 [Note] WSREP: Closed send monitor.
2022-04-27 12:54:06 6 [Note] WSREP: gcomm: terminating thread



 Comments   
Comment by Jan Lindström (Inactive) [ 2022-05-05 ]

sysprg ok to push after you have checked bb and we have tested this.

Comment by Julius Goryavsky [ 2022-05-23 ]

Fixed, https://github.com/MariaDB/server/commit/b081ad8c65d3a94210841477cb5f0683ce64a7e3

Generated at Thu Feb 08 10:00:38 UTC 2024 using Jira 8.20.16#820016-sha1:9d11dbea5f4be3d4cc21f03a88dd11d8c8687422.