[MDEV-29444] WSREP donor crashes after changing status from donor to joined Created: 2022-09-02 Updated: 2023-10-25 |
|
| Status: | Open |
| Project: | MariaDB Server |
| Component/s: | wsrep |
| Affects Version/s: | 10.5.9, 10.5.16 |
| Fix Version/s: | 10.5 |
| Type: | Bug | Priority: | Critical |
| Reporter: | cc lin | Assignee: | Alexey |
| Resolution: | Unresolved | Votes: | 2 |
| Labels: | crash, galera | ||
| Environment: |
OS: Debian GNU/Linux 10 (buster) |
||
| Attachments: |
|
||||||||
| Issue Links: |
|
||||||||
| Description |
|
It does not always happen. When it happens, we can observe logs like below: [Note] WSREP: Server status change donor -> joined [Note] WSREP: wsrep_notify_cmd is not defined, skipping notification. [ERROR] WSREP: Certification exception: Attempt to match against an empty key (1,0): 22 (Invalid argument) at / bitnami/blacksmith-sandbox/libgalera-26.4.7/galera/src/key_set.cpp:throw_match_empty_key():174 [Note] WSREP: ReplicatorSMM::abort() [Note] WSREP: Closing send monitor... [Note] WSREP: Closed send monitor. [Note] WSREP: gcomm: terminating thread [Note] WSREP: gcomm: joining thread [Note] WSREP: gcomm: closing backend ... [Note] WSREP: New SELF-LEAVE. [Note] WSREP: Flow-control interval: [0, 0] [Note] WSREP: Received SELF-LEAVE. Closing connection. [Note] WSREP: Shifting OPEN -> CLOSED (TO: 1585314378) [Note] WSREP: RECV thread exiting 0: Success ... [ Note] WSREP: recv_thread() joined. [ Note] WSREP: Closing replication queue. [ Note] WSREP: Closing slave action queue. [Note] WSREP: /opt/bitnami/mairadb/sbin/mysqld: Terminated. ... |
| Comments |
| Comment by cc lin [ 2022-09-06 ] | |||||||||||||||||||||||||
|
Timeline of key events in the attached log file 15:46:00 mariadb cluster member "conductor-mariadb-fz1-0" was starting mariadb 15:46:05 WSREP: Member 2.0 (conductor-mariadb-fz1-0) requested state transfer from 'any'. Selected 0.0 (conductor-mariadb-fz1-2)(SYNCED) as donor. 15:48:52 conductor-mariadb-fz1-2 [Note] WSREP: resume 15:48:53 conductor-mariadb-fz1-2 [Note] WSREP: New SELF-LEAVE. | |||||||||||||||||||||||||
| Comment by cc lin [ 2022-09-12 ] | |||||||||||||||||||||||||
|
fz1-0 Core Dump Thread pointer: 0x0 | |||||||||||||||||||||||||
| Comment by Daniel Black [ 2022-09-12 ] | |||||||||||||||||||||||||
|
On the bitnami side I'm stuck on debug symbol resolution until bitnami provide debug symbols information. | |||||||||||||||||||||||||
| Comment by Brad [ 2022-09-30 ] | |||||||||||||||||||||||||
|
We met the issue again on 2022/9/30. It is critical for our system. Could Daniel help to clarify? Thanks. | |||||||||||||||||||||||||
| Comment by Brad [ 2022-09-30 ] | |||||||||||||||||||||||||
|
How could we do for bitnami "debug symbols information"? | |||||||||||||||||||||||||
| Comment by Brad [ 2022-10-01 ] | |||||||||||||||||||||||||
|
Our critical systems suffered this issue. Please help. | |||||||||||||||||||||||||
| Comment by Brad [ 2022-10-03 ] | |||||||||||||||||||||||||
|
Your help will be highly appreciated. Thanks. | |||||||||||||||||||||||||
| Comment by Brad [ 2022-10-04 ] | |||||||||||||||||||||||||
|
Does the issue impact SST only? Or SST & IST, both? | |||||||||||||||||||||||||
| Comment by Daniel Black [ 2022-10-18 ] | |||||||||||||||||||||||||
|
Sorry for the delay, I was on leave and hoping for a Bitnami debug info. As an alternative, the Docker Library ubuntu focal seems compatible with the Debian 11 base of Bitnami, so build the following:
Building and running these will have a full gdb backtrace when the container crashes in the logs. | |||||||||||||||||||||||||
| Comment by Brad [ 2022-10-21 ] | |||||||||||||||||||||||||
|
Thanks for your reply. We will follow and attempt to re-produce the issue. Thanks. | |||||||||||||||||||||||||
| Comment by Jan Lindström (Inactive) [ 2023-01-02 ] | |||||||||||||||||||||||||
|
cclin Can you provide us full error log and proper stack trace as instructed. | |||||||||||||||||||||||||
| Comment by Seppo Jaakola [ 2023-06-13 ] | |||||||||||||||||||||||||
|
Could be due to gcache corruption, assigning to Yurchenko for analysis |