[MDEV-13747] MariaDB 10.1.21 server sudden crash Created: 2017-09-06 Updated: 2017-12-13 Resolved: 2017-12-13 |
|
| Status: | Closed |
| Project: | MariaDB Server |
| Component/s: | Galera |
| Affects Version/s: | 10.1.21 |
| Fix Version/s: | 10.1.30 |
| Type: | Bug | Priority: | Major |
| Reporter: | Artur Čuvašov | Assignee: | Andrii Nikitin (Inactive) |
| Resolution: | Duplicate | Votes: | 0 |
| Labels: | None | ||
| Environment: |
CentOS 7.3.1611 |
||
| Issue Links: |
|
||||||||||||
| Description |
|
After this WSREP Recovery process starts. |
| Comments |
| Comment by Artur Čuvašov [ 2017-09-09 ] |
|
Another nde has crashed today with a similar message: ``` To report this bug, see https://mariadb.com/kb/en/reporting-bugs We will try our best to scrape up some info that will hopefully help Server version: 10.1.21-MariaDB Thread pointer: 0x7f8a1fc12008 Trying to get some variables. Optimizer switch: index_merge=on,index_merge_union=on,index_merge_sort_union=on,index_merge_intersection=on,index_merge_sort_intersection=off,engine_condition_pushdown=off,index_condition_pushdown=on,derived_merge=on,derived_with_keys=on,firstmatch=on,loosescan=on,materialization=on,in_to_exists=on,semijoin=on,partial_match_rowid_merge=on,partial_match_table_scan=on,subquery_cache=on,mrr=off,mrr_cost_based=off,mrr_sort_keys=off,outer_join_with_cache=on,semijoin_with_cache=on,join_cache_incremental=on,join_cache_hashed=on,join_cache_bka=on,optimize_join_buffer_size=off,table_elimination=on,extended_keys=on,exists_to_in=on,orderby_uses_equalities=off The manual page at http://dev.mysql.com/doc/mysql/en/crashing.html contains |
| Comment by Andrii Nikitin (Inactive) [ 2017-09-11 ] |
|
Hi mtxd , To collect related info, (which may help in troubleshooting) could you provide: |
| Comment by Andrii Nikitin (Inactive) [ 2017-09-11 ] |
|
Disabling binary logging is known workaround to |
| Comment by Artur Čuvašov [ 2017-09-11 ] |
|
Does it mean it's just enough to add skip-log-bin and comment log-bin, expire-logs-days, sync-binlog ? What about binlog_format ? Since this is a know workaround for more than a year, is anything clear about the reason and a solution? Some people might need binary logs together with galera clustering. |
| Comment by Andrii Nikitin (Inactive) [ 2017-09-11 ] |
|
I can say only that crash is related to binary logging and it is confirmed that if binary logging doesn't happen - the crash will not occur. |
| Comment by Brendan P [ 2017-10-08 ] |
|
Some further information, On our cluster we observed that the node we do most of the writes to will eventually stop purging binlogs, expire-logs-days ceases to function. Running a reset master will delete all the binary logs as expected, but the server has crashed several times randomly after, this could be 10 or 30 minutes after the fact or never at all, however expire-logs-days does work again if the server hasn't crashed. We think that pt-online-schema-change could be a factor, it has crashed the server many times with the same random time period after running it during alters, and also seems to be a cause of the binary log purging failure after some time after a few concurrent successful alters. |
| Comment by Artur Čuvašov [ 2017-10-08 ] |
|
Check this out if any of your apps use "ENABLE/DISABLE KEYS" construction: https://binary-data.github.io/2017/04/05/magento-mysql-crash-deadlock-when-index-under-highload/ Br, |
| Comment by Daniel Black [ 2017-10-09 ] |
|
Also worth noting GET_LOCK is a known limitation and unsupported in Galera https://mariadb.com/kb/en/library/mariadb-galera-cluster-known-limitations/ |
| Comment by Andrei Elkin [ 2017-12-13 ] |
|
Fixed by |