[MDEV-9549] Trying to decrypt a not encrypted page Created: 2016-02-11 Updated: 2016-03-12 Resolved: 2016-03-12 |
|
| Status: | Closed |
| Project: | MariaDB Server |
| Component/s: | Encryption, Storage Engine - InnoDB, Storage Engine - XtraDB |
| Affects Version/s: | 10.1.10 |
| Fix Version/s: | 10.1.13 |
| Type: | Bug | Priority: | Major |
| Reporter: | René Cannaò | Assignee: | Jan Lindström (Inactive) |
| Resolution: | Fixed | Votes: | 1 |
| Labels: | None | ||
| Environment: |
Ubuntu trusty |
||
| Description |
|
Crashing bug, the server is trying to decrypt a page that is not encrypted.
Recompiled MariaDB 10.1.10 from source and debug symbols , the backtrace is:
key_version seems to be not zero:
Although: |
| Comments |
| Comment by Elena Stepanova [ 2016-02-12 ] |
|
Are you sure this backtrace is of the crashed thread? I don't see the signal handler there. Could you please attach all threads' stack trace? Please also paste and attach your cnf file(s) and specify which build you were using. Thanks. |
| Comment by Valerie Parham-Thompson [ 2016-02-22 ] |
|
Rene, Chris, Did you pinpoint a workload and setup that reproduced this error? I upgraded a test server from 10.0.21 to 10.1.11 and ran a large amount of inserts and selects against it, with no issues. I also set up replication between 10.0.21 and 10.1.11 and ran concurrent inserts and selects, with no issues. This is on Centos7. My tests were a very simple workload, though. I would like to continue to try to reproduce the error. What are the 10.0 and 10.1 versions you're using? OS and configs? |
| Comment by René Cannaò [ 2016-03-02 ] |
|
I don't have a scientific reproducible test case for this, but I can share some interesting patterns I found. Based on #a and #b , it seems that what caused the crash is not present anymore after a crash recovery. But #a points that maybe a specific write (that is stored in the relay log) causes the server to crash again. c) the master crashes when executing large transactions involving updating several hundreds of thousands of rows and deleting a similar amount I want to highlight again that I don't have a reproducible test case to share (no core dump, no specific queries), but #c and #d were quite common patterns. As the crashes seem somehow related to the size of the transactions, my hypothesis is: |
| Comment by Chris Calender (Inactive) [ 2016-03-02 ] |
|
Similar to René, I don't have a reproducible test case for this either. I did find out some extra details, which will hopefully help to isolate this. 1. This occurred after an in-place upgrade from MariaDB 10.0 to MariaDB 10.1.10 2. There had been a global tablespace error that also existed in the MariaDB 10.0 instance (but didn't cause problems in 10.0, so it went unnoticed). But when this tablespace error was encountered in 10.1.10 by mysqldump (note a mysqldump was in progress at the time of the crash), it triggered the crash. Unfortunately, I don't have further specifics regarding the exact tablespace error. 3. Here is the error log snippet of the crash & backtrace: 2016-02-25 19:42:46 140508033857280 [Warning] View 'test'.'vuser': there is unknown charset/collation names (client: ''; connection: ''). To report this bug, see http://kb.askmonty.org/en/reporting-bugs We will try our best to scrape up some info that will hopefully help Server version: 10.1.10-MariaDB-enterprise-log Thread pointer: 0x0x7fca36f5d008 Trying to get some variables. Optimizer switch: index_merge=on,index_merge_union=on,index_merge_sort_union=on,index_merge_intersection=on,index_merge_sort_intersection=off,engine_condition_pushdown=off,index_condition_pushdown=on,derived_merge=on,derived_with_keys=on,firstmatch=on,loosescan=on,materialization=on,in_to_exists=on,semijoin=on,partial_match_rowid_merge=on,partial_match_table_scan=on,subquery_cache=on,mrr=off,mrr_cost_based=off,mrr_sort_keys=off,outer_join_with_cache=on,semijoin_with_cache=on,join_cache_incremental=on,join_cache_hashed=on,join_cache_bka=on,optimize_join_buffer_size=off,table_elimination=on,extended_keys=on,exists_to_in=on The manual page at http://dev.mysql.com/doc/mysql/en/crashing.html contains |
| Comment by Elena Stepanova [ 2016-03-09 ] |
|
Did you also have any kind of a global tablespace error before or upon the upgrade, similar to what ccalender mentioned in the previous comment? I suppose if you had it, it would show in the error log, could you maybe check it for errors?
is it about concurrent updates/deletes, or sequential actions on the same table? Is it limited to the same table(s)? If so, can you paste SHOW CREATE and SHOW INDEX? Do you remember whether these tables were created in 10.0 or in previous versions? Thanks. |
| Comment by Jan Lindström (Inactive) [ 2016-03-12 ] |
|
commit f341d94423daa37bf4bee4d9b96ba8e8d93484c6 Make sure that on decrypt we do not try to reference |