[MDEV-9771] server doesn't start because of one crashed table Created: 2016-03-21 Updated: 2016-04-22 Resolved: 2016-04-22 |
|
| Status: | Closed |
| Project: | MariaDB Server |
| Component/s: | Storage Engine - InnoDB |
| Affects Version/s: | 10.0.23 |
| Fix Version/s: | N/A |
| Type: | Bug | Priority: | Major |
| Reporter: | Pavel Dvorak | Assignee: | Jan Lindström (Inactive) |
| Resolution: | Not a Bug | Votes: | 0 |
| Labels: | None | ||
| Environment: |
debian 8.3 |
||
| Attachments: |
|
| Description |
|
server crashed and then wouldn't start |
| Comments |
| Comment by Elena Stepanova [ 2016-03-21 ] | ||||||||||||||||||||||
|
Hi, I see two variations of a fatal data error in your log:
repeated a number of times between 160318 20:38:18 and 160318 21:26:58;
Then there was a relatively successful (but also not clean) startup at 160318 21:55:10, and even normal shutdown. So, which of the two "error 71" variations are you referring to, and which one did you solve by removing a table (and what do you mean by that – that you removed an *.ibd file? Also might worth paying attention:
| ||||||||||||||||||||||
| Comment by Pavel Dvorak [ 2016-03-21 ] | ||||||||||||||||||||||
|
i think gps_DAITE errors occured when we tried to reimport the database and the server was running problem was this 160318 20:38:22 [ERROR] InnoDB: File (unknown): 'read' returned OS error 71. Cannot continue operation | ||||||||||||||||||||||
| Comment by Elena Stepanova [ 2016-03-21 ] | ||||||||||||||||||||||
|
Thanks. jplindst, there is a choice of things to look at in the error log. While it's questionable if the engine should actually start with a badly corrupted data (without force_recovery) – and it's still corrupted even after the file was removed – still, obviously things go wrong there, the whole "File (unknown)" to begin with; so, I'm assigning it to you to see what and how should be fixed here. | ||||||||||||||||||||||
| Comment by Pavel Dvorak [ 2016-03-21 ] | ||||||||||||||||||||||
|
yes it's regular version | ||||||||||||||||||||||
| Comment by Jan Lindström (Inactive) [ 2016-03-22 ] | ||||||||||||||||||||||
|
Hmm, most likely the file (unknown) is the system tablespace as it has no table name, this is not a bug it is by design. InnoDB has protection that it will not start by default on corrupted datadir, instead you need to provide force_recovery option. Error log contains several different errors, out of disk space, long semaphore wait during rename table (known problem), O_DIRECT errors (known problem), etc. Can you start your database know or do you still need help to investigate the issue? If you need investigation, I would need your database. | ||||||||||||||||||||||
| Comment by Pavel Dvorak [ 2016-03-22 ] | ||||||||||||||||||||||
|
yes db is running now so if i understand that correctly i save safely delete all temp files starting with "#"? | ||||||||||||||||||||||
| Comment by Elena Stepanova [ 2016-03-22 ] | ||||||||||||||||||||||
|
jplindst, I would vote for better diagnostics in the error log, including replacing "unknown" with something meaningful, and maybe some hints on what to do with the problem (like in other cases when InnoDB helpfully suggests to use innodb_force_recovery). | ||||||||||||||||||||||
| Comment by Jan Lindström (Inactive) [ 2016-04-22 ] | ||||||||||||||||||||||
|
The diagnostics that can be easily added for GA product is already there. Adding additional diagnostics is possible but not possible to do exhaustive and all cases cover fix. For example fixing this (unknow) file is not possible on all cases easily. | ||||||||||||||||||||||
| Comment by Jan Lindström (Inactive) [ 2016-04-22 ] | ||||||||||||||||||||||
|
commit 298e1d3f71cce89c83645ea9cc1eade4bdaf4917 Improve error diagnostics on I/O errors. If node->name is NULL |