[MDEV-24696] Cluster fails: Downstream hosts crashing, leaving the surviving host in a non-Primary cluster Created: 2021-01-26  Updated: 2021-12-23  Resolved: 2021-12-23

Status: Closed
Project: MariaDB Server
Component/s: Galera
Affects Version/s: 10.4.17
Fix Version/s: N/A

Type: Bug Priority: Major
Reporter: Kent Hoover Assignee: Jan Lindström (Inactive)
Resolution: Duplicate Votes: 0
Labels: None
Environment:

CentOS release 6.10 (Final)
MariaDB-server-10.4.17-1.el6.x86_64
galera-4-26.4.6-1.el6.x86_64


Attachments: Microsoft Word mysql-error.log.20210123.rtf     Microsoft Word survivor-mysql-error.rtf    

 Description   

Our application deletes old records from 2 tables, which have a foreign key shared between them. The queries apparently run in succession on our primary host, but deadlock on the down stream servers. The downstreams crash, leave the cluster such that the remaining host starts looping, trying to reconnect. The survivor downgrades to non-Cluster, so it is useless to our applications until it is rebooted. Restarting the downstreams is arduous – SST fails, and have to be recovered manually (only to fail again at 02:00 the next night, when the deletes are repeated).

I've attached the mysql-error.log from one of the downstream hosts, which reflects what happens on each of them, and survivor-mysql-error , which shows how our primary host logs the events.



 Comments   
Comment by Gabor Orosz [ 2021-02-17 ]

This seems to be a duplicate of MDEV-23851

Generated at Thu Feb 08 09:31:58 UTC 2024 using Jira 8.20.16#820016-sha1:9d11dbea5f4be3d4cc21f03a88dd11d8c8687422.