[MDEV-9501] rpl.rpl_binlog_index, rpl.rpl_gtid_crash, rpl.rpl_stm_multi_query fail sporadically in buildbot with Master command COM_REGISTER_SLAVE failed Created: 2016-02-01  Updated: 2022-06-09  Resolved: 2020-09-08

Status: Closed
Project: MariaDB Server
Component/s: Platform Debian, Tests
Affects Version/s: 5.5, 10.0, 10.1, 10.2, 10.5
Fix Version/s: 10.1.48, 10.2.35, 10.3.26, 10.4.16, 10.5.7

Type: Bug Priority: Major
Reporter: Elena Stepanova Assignee: Sujatha Sivakumar (Inactive)
Resolution: Fixed Votes: 0
Labels: None

Issue Links:
Blocks
blocks MDEV-7069 Fix buildbot failures in main server ... Stalled
Relates
relates to MDEV-12694 test failure: encryption.create_or_re... Closed
relates to MDEV-28788 rpl.rpl_checksum_cache sometimes fail... Open
Sprint: 5.5.59

 Description   

http://buildbot.askmonty.org/buildbot/builders/p8-rhel7-bintar/builds/1060/steps/test/logs/stdio

rpl.rpl_binlog_index 'row'               w4 [ fail ]  Found warnings/errors in server log file!
        Test ended at 2016-01-26 20:02:26
line
160126 20:02:17 [Warning] Slave I/O: Master command COM_REGISTER_SLAVE failed: failed registering on master, reconnecting to try again, log 'master-bin.000002' at position 549, Internal MariaDB error code: 1597
^ Found warnings in /home/buildbot/maria-slave/power8-vlp03-bintar/build/mysql-test/var/4/log/mysqld.2.err
ok



 Comments   
Comment by Otto Kekäläinen [ 2016-03-11 ]

This also happened in a Debian production build: https://buildd.debian.org/status/fetch.php?pkg=mariadb-10.0&arch=powerpc&ver=10.0.24-4&stamp=1457659375
Disabled for official Debian builds in: https://github.com/ottok/mariadb-10.0/commit/b1a48e7eba6681890b44756f6b6feb3ad0f3b62a

FYI also bar

Comment by Elena Stepanova [ 2017-05-08 ]

Recent occurrence on 10.2:
http://buildbot.askmonty.org/buildbot/builders/p8-rhel7-bintar-debug/builds/2435/steps/test/logs/stdio

Comment by Elena Stepanova [ 2017-05-09 ]

http://buildbot.askmonty.org/buildbot/builders/kvm-bintar-centos5-amd64/builds/5119/steps/test/logs/stdio

rpl.rpl_stm_multi_query 'mix'            w3 [ fail ]  Found warnings/errors in server log file!
        Test ended at 2017-05-05 09:35:43
line
2017-05-05  9:35:43 1331239232 [Warning] Slave I/O: Master command COM_REGISTER_SLAVE failed: failed registering on master, reconnecting to try again, log 'master-bin.000001' at position 4, Internal MariaDB error code: 1597
^ Found warnings in /usr/local/mariadb-10.2.6-linux-x86_64/mysql-test/var/3/log/mysqld.2.err

Comment by Elena Stepanova [ 2017-05-09 ]

rpl.rpl_temporal_mysql56_to_mariadb53 is also affected

Comment by Alice Sherepa [ 2017-09-08 ]

rpl.rpl_gtid_crash on 10.3 http://buildbot.askmonty.org/buildbot/builders/kvm-fulltest2/builds/9508/steps/test_3/logs/stdio

rpl.rpl_gtid_crash 'innodb,row'          w1 [ fail ]  Found warnings/errors in server log file!
        Test ended at 2017-09-08 03:53:39
line
2017-09-08  3:53:28 10 [ERROR] Slave I/O: Master command COM_REGISTER_SLAVE failed: Lost connection to MySQL server during query (Errno: 2013), Internal MariaDB error code: 1597
2017-09-08  3:53:28 10 [ERROR] Slave I/O thread couldn't register on master
2017-09-08  3:53:29 10 [Warning] Slave I/O: Master command COM_REGISTER_SLAVE failed: failed registering on master, reconnecting to try again, log 'master-bin.000006' at position 785; GTID position '1-1-2,0-1-9,2-1-1', Internal MariaDB error code: 1597
^ Found warnings in /mnt/buildbot/build/mariadb-10.3.2/mysql-test/var/1/log/mysqld.2.err

Comment by Alice Sherepa [ 2017-11-17 ]

on 5.5 http://buildbot.askmonty.org/buildbot/builders/kvm-rpm-centos74-amd64/builds/591/steps/mtr/logs/stdio

rpl.rpl_binlog_index 'row'               w2 [ fail ]  Found warnings/errors in server log file!
        Test ended at 2017-11-16 13:48:23
line
171116 13:48:08 [Warning] Slave I/O: Master command COM_REGISTER_SLAVE failed: failed registering on master, reconnecting to try again, log 'master-bin.000002' at position 457, Error_code: 1597
^ Found warnings in /dev/shm/var/2/log/mysqld.2.err

Comment by Marko Mäkelä [ 2020-06-10 ]

This affects 10.5 as well:

10.5 17a7bafec068d6436f3f6c5ca67b9d6c98b31ef5

rpl.rpl_binlog_index 'row'               w1 [ fail ]  Found warnings/errors in server log file!
        Test ended at 2020-06-10 08:46:08
line
2020-06-10  8:45:58 7 [Warning] Slave I/O: SET @master_heartbeat_period to master failed with error: Lost connection to server during query, Internal MariaDB error code: 2013
2020-06-10  8:45:59 7 [Warning] Slave I/O: Master command COM_REGISTER_SLAVE failed: failed registering on master, reconnecting to try again, log 'master-bin.000002' at position 633, Internal MariaDB error code: 1597

I did not see a sign of a crash in the server logs, but the log of the replica server contains some error messages about failing to connect to the master.

Comment by Sujatha Sivakumar (Inactive) [ 2020-07-02 ]

Hello Andrei,

Can you please review the fix for MDEV-9501.

Patch: https://github.com/MariaDB/server/commit/965663f090c4af83e8c4aca913568f6065509ff4

BuildBot Testing: http://buildbot.askmonty.org/buildbot/grid?category=main&branch=bb-10.1-sujatha

Thank you.

Comment by Sujatha Sivakumar (Inactive) [ 2020-09-08 ]

Merged patch to higher versions and tested the changes.

Minor merge conflicts were observed from 10.1 to 10.2.
The result files need to be re-recorded.

10.2: https://github.com/MariaDB/server/commit/37273575004598b459d7bbdd453ed3a28df66037

The patch remains the same for 10.3+ versions.
10.3: https://github.com/MariaDB/server/commit/5b2540397ca45a124027f0b276a6f7b00d0b80b8

Generated at Thu Feb 08 07:35:09 UTC 2024 using Jira 8.20.16#820016-sha1:9d11dbea5f4be3d4cc21f03a88dd11d8c8687422.