[MDEV-27410] Galera cluster hangs after one node reboots Created: 2022-01-03 Updated: 2022-01-17 |
|
| Status: | Open |
| Project: | MariaDB Server |
| Component/s: | None |
| Affects Version/s: | 10.5.12 |
| Fix Version/s: | None |
| Type: | Bug | Priority: | Major |
| Reporter: | PhilJing | Assignee: | Unassigned |
| Resolution: | Unresolved | Votes: | 0 |
| Labels: | None | ||
| Environment: |
centos7.5 |
||
| Description |
|
Happy new year! We have a gelera cluster with 3 nodes in production, and we upgraded to 10.5.12 recently from 10.3.10. The galera cluster would hang (can only select and unable to do update or delete) after one node reboots occationally. To recover the cluster, it would work to restart the mariadb instance which has differrent wsrep_last_commited or reboot the cluster with --wsrep-new-cluster. The possibility would be much higher when one node is poweroff, and reboot another node. |
| Comments |
| Comment by PhilJing [ 2022-01-04 ] | |||
|
update:
I don't know if it is related. | |||
| Comment by PhilJing [ 2022-01-10 ] | |||
|
Recently I did a lot of test, I would post any information that migh be useful | |||
| Comment by PhilJing [ 2022-01-17 ] | |||
|
Found that some DDL (truncate) would run periodically every 1 hour, dont know if it is realted. More tests needed... |