[MDEV-29934] rpl.rpl_start_alter_chain_basic, rpl.rpl_start_alter_restart_slave sometimes fail in BB with result content mismatch Created: 2022-11-02 Updated: 2024-01-30 |
|
| Status: | Open |
| Project: | MariaDB Server |
| Component/s: | Replication, Tests |
| Affects Version/s: | 10.8, 10.9, 10.10, 10.11, 11.0 |
| Fix Version/s: | 10.11, 11.0 |
| Type: | Bug | Priority: | Major |
| Reporter: | Angelique Sklavounos (Inactive) | Assignee: | Andrei Elkin |
| Resolution: | Unresolved | Votes: | 0 |
| Labels: | None | ||
| Attachments: |
|
||||||||
| Issue Links: |
|
||||||||
| Description |
rpl.rpl_start_alter_chain_basichttps://buildbot.mariadb.org/#/builders/203/builds/11894
Seems to happen only on amd64 platforms. Could not reproduce locally with test repeats. rpl.rpl_start_alter_restart_slaveThe following output occurs on amd64 platforms.
|
| Comments |
| Comment by Angelique Sklavounos (Inactive) [ 2022-12-07 ] | |||||||||||||||
|
Error logs and var directory for https://buildbot.mariadb.org/#/builders/172/builds/9990 attached. | |||||||||||||||
| Comment by Angelique Sklavounos (Inactive) [ 2023-01-25 ] | |||||||||||||||
|
For rpl.rpl_start_alter_chain_basic, all servers should be synced by include/rpl_sync.inc, as below code (10.8 88c35781) shows. The mismatch only seems to occur with server_3, which has parallel threads and gtid_strict_mode=1.
For rpl.rpl_start_alter_restart_slave, the slave (which like server_3 in rpl_start_alter_chain_basic also has parallel threads and gtid_strict_mode=1). Likewise, it should be synced with sync_slave_with_master right before the mismatched select domain_id, seq_no…:
With this mismatch, gtid_slave_pos is 0-1-6 but gtid_binlog_state is 0-1-7. I wonder if using the macro sync_with_master_gtid.inc would be more suitable. I added checking the gtid_binlog_state for server_3 to rpl.rpl_start_alter_chain_basic, and calling sync_with_master_gtid.inc to rpl.rpl_start_alter_restart_slave. Did this here: https://github.com/MariaDB/server/commit/943989c9ef9b7d01333ad14a09547585666c9eeb (incorrectly put 29943). |