[MDEV-29950] one galera node got hardware issue but caused other 2 nodes split brain - Jira

XML

Word

Printable

Details

Type: Bug
Status: Open (View Workflow)
Priority: Major
Resolution: Unresolved
Affects Version/s: 10.6.7
Fix Version/s: None
Component/s: Galera
Labels:
None
Environment:
redhat x86-64 on vmware

Description

our galera cluster is 3 nodes configration (2 db nodes + 1 arbitrator). 2 days ago, one db node is down due to hardware issue. The remaining db node and arbitrator got split brain and db service down.

Checked from log, remaining nodes do not have message of each other until the dead node is confirmed down. There is around 10s time. We don't know why the good nodes do not declare each other stable in this 10s.

Kindly advise the directory to troubleshoot the problem.

Only 2 galera timeout are set while other timeout settings are still default values.

gmcast.peer_timeout=PT10S;
evs.suspect_timeout=PT12S;

DB configuration file and error log of each node are attached

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending
- Thumbnails
- List
- Download All

p1vdbs-wfsesub-esptdb1-1-01.log
2022-11-05 15:53
33 kB
William Wong
p2vdbs-wfsesub-esptdb1-2-01.log
2022-11-05 15:53
45 kB
William Wong
galera-garbd-wfsesub_esptdb1.log
2022-11-05 15:54
21 kB
William Wong
p1vdbs-wfsesub-esptdb1-1-01.mariadb.cnf
2022-11-05 15:54
7 kB
William Wong
p2vdbs-wfsesub-esptdb1-2-01.mariadb.cnf
2022-11-05 15:54
7 kB
William Wong
galera-garbd-wfsesub_esptdb1.cnf
2022-11-05 15:54
0.6 kB
William Wong

Activity

People

Assignee:: Unassigned

Reporter:: William Wong

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Dates

Created:: 2022-11-05 15:54

Updated:: 2022-11-05 15:54

Git Integration

Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.