[MDEV-7276] Mariadb 10.0.14 Crash Signal 11 Created: 2014-12-06 Updated: 2015-01-13 Resolved: 2015-01-13 |
|
| Status: | Closed |
| Project: | MariaDB Server |
| Component/s: | Optimizer |
| Affects Version/s: | 10.0.14 |
| Fix Version/s: | 10.0.15 |
| Type: | Bug | Priority: | Major |
| Reporter: | Peter McLarty | Assignee: | Unassigned |
| Resolution: | Fixed | Votes: | 0 |
| Labels: | None | ||
| Environment: |
Redhat Linux 2.6.32-431.29.2.el6.x86_64 |
||
| Attachments: |
|
||||||||
| Issue Links: |
|
||||||||
| Description |
|
Couldn't restart with optimizer_switch parameters kept failing on that |
| Comments |
| Comment by Peter McLarty [ 2014-12-06 ] | |||||||||||
|
Interesting event - the buffer pool shows all pages free from around midnight before the crash | |||||||||||
| Comment by Peter McLarty [ 2014-12-08 ] | |||||||||||
|
I have conducted some testing today to see if i can better isolate the cause of this problem I was using Jmeter to run a series of real queries from our production database which don't always perform well and would likely benefit from histograms. I was only running 2 test threads in Jmeter Setting the optimizer_use_condition_selectivity to either 4 or 5 will crash this database, below is the optimizer switch settings from the crash entry in that log, Optimizer switch: index_merge=on,index_merge_union=on,index_merge_sort_union=on,index_merge_intersection=on,index_merge_sort_intersection=off,engine_condition_pushdown=off,index_condition_pushdown=on,derived_merge=on,derived_with_keys=on,firstmatch=on,loosescan=on,materialization=on,in_to_exists=on,semijoin=on,partial_match_rowid_merge=on,partial_match_table_scan=on,subquery_cache=on,mrr=off,mrr_cost_based=off,mrr_sort_keys=off,outer_join_with_cache=on,semijoin_with_cache=on,join_cache_incremental=on,join_cache_hashed=on,join_cache_bka=on,optimize_join_buffer_size=off,table_elimination=on,extended_keys=on,exists_to_in=on | |||||||||||
| Comment by Elena Stepanova [ 2014-12-08 ] | |||||||||||
|
Hi Peter, Would you be able to provide the query and involved data structures (or better datadump from these tables, if possible)? Also, in the initial description you quoted the error log, but the quote ends 3 lines after 'signal 11'. Is it all yo uhad in the log? Wasn't there at least an attempt of printing a stack trace, followed by the connection id (and possibly the query) where the crash happened? | |||||||||||
| Comment by Peter McLarty [ 2014-12-09 ] | |||||||||||
|
I have added the additional information to the description, missed it originally copying and pasting buffer contents didn't completely copy. Not sure what tables and query I am looking for? I will have to speak to my manager about what i can provide in the way of schema and data dump objects. Need to be aware of Australian privacy laws if the table contains sensitive data | |||||||||||
| Comment by Elena Stepanova [ 2014-12-09 ] | |||||||||||
|
Hi Peter, Thank you. Even if you can't provide the data, please paste the table structures and of course the query. You can obfuscate the table and column names (call them col1, col2 etc. if you wish, just do it consistently through the tables and the query). I hope this way there will be no problem with the law. As I understand from your previous query, you are able to reproduce the crash. In this case, locating the exact query and involved tables is not difficult.
In the general log you created, you will see numerous records of the kind
etc. The first number is the connection ID, which corresponds the Connection ID from the error log above. Don't forget to disable general log again, as it can affect performance. Thanks. | |||||||||||
| Comment by Peter McLarty [ 2015-01-13 ] | |||||||||||
|
I have uploaded to ftp some information on this. I can now reproduce the crash. It seems the problem won't occur when there is no new data being added to the database as I test instance, restored copy would not break. I have uploaded some gdb info, the schema of the tables which seem to have been involved and the query that was the last in the general.log at the time of the failure | |||||||||||
| Comment by Elena Stepanova [ 2015-01-13 ] | |||||||||||
|
Thanks, I could now reproduce the crash on 10.0.14.
|