[MDEV-17243] Galera Server crashes with "WSREP: FSM: no such a transition ABORTING -> REPLICATING" on loading data Created: 2018-09-19 Updated: 2021-03-10 Resolved: 2019-03-18 |
|
| Status: | Closed |
| Project: | MariaDB Server |
| Component/s: | Galera |
| Affects Version/s: | 10.1, 10.2.14 |
| Fix Version/s: | 10.2.23, 10.3.14, 10.4.4 |
| Type: | Bug | Priority: | Critical |
| Reporter: | Zdravelina Sokolovska (Inactive) | Assignee: | Jan Lindström (Inactive) |
| Resolution: | Fixed | Votes: | 1 |
| Labels: | None | ||
| Environment: |
3x Master-Master Servers ;OS Fedora 27 |
||
| Issue Links: |
|
||||||||
| Description |
|
Galera Server crashes with "WSREP: FSM: no such a transition ABORTING -> REPLICATING" on the crash occurred on concurrent loading of several tables after interrupting the previous session, dropping database ,recreate schema and restarting load 2018-09-19 15:27:45 139794208356096 [ERROR] WSREP: FSM: no such a transition ABORTING -> REPLICATING To report this bug, see https://mariadb.com/kb/en/reporting-bugs We will try our best to scrape up some info that will hopefully help Server version: 10.2.14-MariaDB Thread pointer: 0x7f23d40008d8 Trying to get some variables. Optimizer switch: index_merge=on,index_merge_union=on,index_merge_sort_union=on,index_merge_intersection=on,index_merge_sort_intersection=off,engine_condition_pushdown=off,index_condition_pushdown=on,derived_merge=on,derived_with_keys=on,firstmatch=on,loosescan=on,materialization=on,in_to_exists=on,semijoin=on,partial_match_rowid_merge=on,partial_match_table_scan=on,subquery_cache=on,mrr=off,mrr_cost_based=off,mrr_sort_keys=off,outer_join_with_cache=on,semijoin_with_cache=on,join_cache_incremental=on,join_cache_hashed=on,join_cache_bka=on,optimize_join_buffer_size=off,table_elimination=on,extended_keys=on,exists_to_in=on,orderby_uses_equalities=on,condition_pushdown_for_derived=on The manual page at http://dev.mysql.com/doc/mysql/en/crashing.html contains |
| Comments |
| Comment by Jan Lindström (Inactive) [ 2018-09-24 ] |
|
Looks in surface similar to |
| Comment by Julius Goryavsky [ 2018-10-16 ] |
|
https://github.com/MariaDB/galera/pull/3 TrxMap structure doesn't take into consideration presence of two trx This eventually causes same trx object to get shared among two This problem could be solved by taking into consideration conn-id, |
| Comment by Jan Lindström (Inactive) [ 2019-01-17 ] |
|
bar Please review the latest version or if you already did please mark both PR and this accordingly. |
| Comment by Alexander Barkov [ 2019-01-21 ] |
|
jplindst, sorry I can't review this change. I suggest to ask someone more familiar with this code. Perhaps Sergey Vojtovich could review. |
| Comment by Jan Lindström (Inactive) [ 2019-01-22 ] |
|
svoj Can you review this ? |