[MDEV-30214] rpl.rpl_manual_change_index_file occasionally fails in BB with Result length mismatch Created: 2022-12-12 Updated: 2023-06-07 Resolved: 2023-05-26 |
|
| Status: | Closed |
| Project: | MariaDB Server |
| Component/s: | Replication, Tests |
| Affects Version/s: | 10.10, 10.11, 11.0 |
| Fix Version/s: | 10.4.31, 10.5.22, 10.6.15, 10.9.8, 10.10.6, 10.11.5, 11.0.3, 11.1.2 |
| Type: | Bug | Priority: | Major |
| Reporter: | Angelique Sklavounos (Inactive) | Assignee: | Angelique Sklavounos (Inactive) |
| Resolution: | Fixed | Votes: | 0 |
| Labels: | None | ||
| Attachments: |
|
| Description |
|
https://buildbot.mariadb.org/#/builders/208/builds/9538
Error logs and var dir attached (mysqld.x.err.30, var_30214.tar.gz). |
| Comments |
| Comment by Angelique Sklavounos (Inactive) [ 2023-05-02 ] | |||||||||||||||||||||
|
The test appears to always be retry-pass. The result file expects the error message to contain ‘master-bin.000001’, but instead the error log shows the IO thread with an empty string or 'FIRST' for the log file:
This failure was first seen with: https://buildbot.mariadb.org/#/builders/121/builds/6504 and is only seen on 10.10 and higher. The error log for the failure shows (attached mysqld.2.err.30) that the CHANGE MASTER statement at the start of the test does not contain the filename for the New state:
before this, when the slave is stopped so CM can be changed to USING_GTID=NO, the IO thread knows of the master log file but not the SQL thread (let’s call this Case A):
while for the passing test in another build (https://buildbot.mariadb.org/#/builders/208) the error log shows that the CHANGE MASTER statement contains the master_log_file filename:
and both IO and SQL threads know of the log file:
Notably, the error log for the failure does NOT show the message “Error reading relay log event: slave SQL thread was killed” Elkin I am unsure if there is a timing bug with Case A to investigate. Otherwise, the expected error message can simply be made more general so that it would accept either ‘.’ or ‘master-bin.000001’, or the "USING_GTID=NO" could be removed as the 1236 error is still achievable when the index file is manually changed. | |||||||||||||||||||||
| Comment by Andrei Elkin [ 2023-05-03 ] | |||||||||||||||||||||
|
angelique.sklavounos, I agree with making 'the expected error message ... more general'. After all the test verifies what belongs to If you have it reproducible with rr we could gain some better understanding. But no need to persevere. |