[MXS-1651] failed master with old external master connection tries rejoin causes master to be lost Created: 2018-02-06  Updated: 2018-08-17  Resolved: 2018-08-17

Status: Closed
Project: MariaDB MaxScale
Component/s: mariadbmon
Affects Version/s: None
Fix Version/s: 2.2.13

Type: Bug Priority: Major
Reporter: Dipti Joshi (Inactive) Assignee: Esa Korhonen
Resolution: Fixed Votes: 0
Labels: None


 Description   

ignore_external_master is ON, and a failed master with old external connection tries rejoin, which causes master to be lost

2/5 19:36:05 on app-1 (10.16.1.18): systemctl stop mariadb

App-0 (10.16.1.17) is promoted to master
Fixed replication 10.16.1.10 <-> 10.16.1.17
2/5 19:37:40 on app-1 (10.16.1.18): systemctl start mariadb

App-1 is re-joined to cluster as Slave
Success

2/5 19:42:38 on app-0 (10.16.1.17): systemctl stop mariadb

App-1 (10.16.1.18) is promoted to master
Fixed replication 10.16.1.10 <-> 10.16.1.18
2/5 19:44:38 on app-0 (10.16.1.17): systemctl start mariadb

Maxscale loses track of 10.16.1.18 as master
FAILURE – maxscale has no master

2/5 19:48:37 on app-0: echo “STOP SLAVE;” | mysql –-password=*****

Maxscale fixes itself and 10.16.1.18 is again master
App-0 is “Running”
2/5 19:49:27: on app-0: echo “START SLAVE;” | mysql –-password=*****

App-0 is “Slave, Running”



 Comments   
Comment by Esa Korhonen [ 2018-08-17 ]

Haven't heard of this in a long time. I assume that recent fixes to the monitor should have fixed this aswell. Please reopen if that is not the case.

Generated at Thu Feb 08 04:08:23 UTC 2024 using Jira 8.20.16#820016-sha1:9d11dbea5f4be3d4cc21f03a88dd11d8c8687422.