[MDEV-27777] Some Galera tests fail on FreeBSD with "Could not find target log" Created: 2022-02-08 Updated: 2022-03-22 Resolved: 2022-02-22 |
|
| Status: | Closed |
| Project: | MariaDB Server |
| Component/s: | Galera SST, Platform FreeBSD, Tests, wsrep |
| Affects Version/s: | 10.5 |
| Fix Version/s: | 10.9.0, 10.2.44, 10.3.35, 10.4.25, 10.5.16, 10.6.8, 10.7.4, 10.8.3 |
| Type: | Bug | Priority: | Major |
| Reporter: | Elena Stepanova | Assignee: | Julius Goryavsky |
| Resolution: | Fixed | Votes: | 0 |
| Labels: | None | ||
| Issue Links: |
|
||||||||||||||||||||||||
| Description |
|
The output below is from a 10.5-based branch. I didn't check if it's reproducible on other versions.
Indeed,
There is the file for the 1st node:
From the error log on the 2nd node:
|
| Comments |
| Comment by Elena Stepanova [ 2022-02-08 ] | ||||||||||||||||||||||||
|
The problem doesn't seem to be caused by SST transferring wrong logs. At the very beginning of the test it does RESET MASTER on both nodes.
Adding a sleep after RESET MASTER doesn't help, so it's not a race condition (or if it is, it's longer than 10 seconds which I tried). But at the same time RESET MASTER on the 1st node seems to work as expected. | ||||||||||||||||||||||||
| Comment by Elena Stepanova [ 2022-02-08 ] | ||||||||||||||||||||||||
|
Further experiments show that RESET does work, but it resets to 000003 instead of the expected 000001:
| ||||||||||||||||||||||||
| Comment by Andrei Elkin [ 2022-02-08 ] | ||||||||||||||||||||||||
|
This is a wsrep case. The failure to reset is caused by a left-over 'mysqld-bin.000002' that survived
the block's RESET MASTER. SET GLOBAL wsrep_on=0 must be doing something to confuse the following RM. | ||||||||||||||||||||||||
| Comment by Jan Lindström (Inactive) [ 2022-02-09 ] | ||||||||||||||||||||||||
|
I can't repeat this on my laptop using Ubuntu
| ||||||||||||||||||||||||
| Comment by Jan Lindström (Inactive) [ 2022-02-10 ] | ||||||||||||||||||||||||
|
It is now clear that this is because wsrep_sst_rsync script. Binlog files are incorrectly transfered to node_2
Binlog file mysqld-bin.000002 is transferred to joiner (node_2) but mysqld-bin.index has incorrect contents. RESET MASTER works as designed and finds 02 and creates next available i.e. 03. | ||||||||||||||||||||||||
| Comment by Julius Goryavsky [ 2022-02-18 ] | ||||||||||||||||||||||||
|
jplindst The content of the changes regarding binlogs is common to all systems and has already been fixed in | ||||||||||||||||||||||||
| Comment by Julius Goryavsky [ 2022-02-22 ] | ||||||||||||||||||||||||
|
Fixed, https://github.com/MariaDB/server/commit/fac9224d85e879ff06e549c18446375d850ef83a |