Details
Description
Hi,
Our Galera cluster architecture is 2 DB nodes + 1 witness node. Most DB traffic is redirected to one DB node.
One of our Galera cluster encounter one node down (wsrep_ready=OFF). We restarted the DB node but IST encountered the same error. We need to remove data directory and let Galera to trigger SST in order to resume the DB node.
DB version is 10.6.10 and Galera version is 26.4.12
We have another similar incident MDEV-30303
2023-01-06 22:04:55 2 [Note] WSREP: MDL BF-BF conflict |
schema: tswtrn1
|
request: (2 seqno 31040753 wsrep (high priority, exec, executing) cmd 0 161 select nextval(`SEQUENCE_LPCO_ID`)<87>*<B8>c^S^A) |
granted: (6 seqno 31040752 wsrep (high priority, exec, preparing) cmd 0 161 (null)) |
2023-01-06 22:04:55 2 [ERROR] Aborting |
Attachments
Issue Links
- relates to
-
MDEV-30303 run optimize table got [Note] WSREP: MDL BF-BF conflict and [ERROR] Aborting
-
- Closed
-
frelist I would need more information about customer workload because I could not reproduce issue easily. Firstly, can you provide full unedited error log from all nodes, node configuration and show create sequence `SEQUENCE_LPCO_ID`; Here we need to remember that SELECT NEXT VALUE is basically write to sequence table and that could cause MDL conflict. However, at the moment it is not clear what the conflicting SQL-clause was.