[MDEV-16011] Galera fails to recover frequently when deployed in kubernetes container environment Created: 2018-04-24  Updated: 2018-07-02  Resolved: 2018-07-02

Status: Closed
Project: MariaDB Server
Component/s: Galera
Affects Version/s: None
Fix Version/s: N/A

Type: Bug Priority: Major
Reporter: Richard Lane Assignee: Unassigned
Resolution: Incomplete Votes: 0
Labels: need_feedback
Environment:

RHEL 7, kubernetes environment using Helm



 Description   

When deploying a galera cluster in a kubernetes container environment (via helm), the likelyhood of an entire cluster failure increases significantly. The ability to manually resolve issues where galera fails to come up due to possible loss of transactions is limited.

Need ability (via option?) for galera to automatically attempt to come up even in the case where data loss could occur. The conditions that I have seen so far are:

1. grastate.dat safe_to_bootstrap is 0 (which we have already automatically set to 10
2. Galera fails to come up and requires mysqld --user=mysql --tc-heuristic-recover rollback to be run to come up

In kubernetes container environment things need to be more automatic



 Comments   
Comment by Elena Stepanova [ 2018-05-31 ]

If you want to submit a feature request, please describe more precisely what you would want to have.

Generated at Thu Feb 08 08:25:41 UTC 2024 using Jira 8.20.16#820016-sha1:9d11dbea5f4be3d4cc21f03a88dd11d8c8687422.