[MDEV-21230] Corrupt database with MariaDB 10.4.10 (see: MDEV-20987) Created: 2019-12-05 Updated: 2020-04-20 Resolved: 2020-04-20 |
|
| Status: | Closed |
| Project: | MariaDB Server |
| Component/s: | Server |
| Affects Version/s: | 10.4.10 |
| Fix Version/s: | 10.3.21, 10.4.11 |
| Type: | Bug | Priority: | Critical |
| Reporter: | Markus | Assignee: | Thirunarayanan Balathandayuthapani |
| Resolution: | Duplicate | Votes: | 0 |
| Labels: | bug, centos7, need_feedback | ||
| Environment: |
CentOS 7.7.1908, MariaDB 10.4.10, Relution 4.56, (VirtualBox 6.0.14) |
||
| Attachments: |
|
||||||||||||||||||||||||
| Issue Links: |
|
||||||||||||||||||||||||
| Description |
|
We are still experiencing the issue first reported with Steps for reproduction
Post-Installation steps
Install MariaDB
Configure MariaDB for application
Install application
Create initial database
Minimal application configuration
Start application
You can now start and stop the application, the database is working as expected. Restart MariaDB
Attempt to start application again
|
| Comments |
| Comment by Markus [ 2019-12-19 ] | |||||||||||||||||||||||||||||||||
|
Here's an easier reproduction, using docker: MariaDB 10.4.10
MariaDB 10.4.11
We've just tested MariaDB 10.4.11 and the issue can no longer be observed. | |||||||||||||||||||||||||||||||||
| Comment by Thirunarayanan Balathandayuthapani [ 2020-03-16 ] | |||||||||||||||||||||||||||||||||
|
m.pfeiffer can you please attach the server log file too ? The error messages in the log file could help | |||||||||||||||||||||||||||||||||
| Comment by Markus [ 2020-03-16 ] | |||||||||||||||||||||||||||||||||
|
Log file attached, this should be the relevant line:
| |||||||||||||||||||||||||||||||||
| Comment by Marko Mäkelä [ 2020-03-16 ] | |||||||||||||||||||||||||||||||||
|
m.pfeiffer, I see, the server denied access to that table due to corruption. If you used a 10.3 or 10.4 version before If the cause is something else, then we would need a complete test case that would repeat the corruption. We made extensive efforts to chase down the bug around September and October last year, before I finally realized that I think that something else should have changed between 10.4.10 and 10.4.11. Was it with exactly the same database contents? | |||||||||||||||||||||||||||||||||
| Comment by Marko Mäkelä [ 2020-03-16 ] | |||||||||||||||||||||||||||||||||
|
This could be related to | |||||||||||||||||||||||||||||||||
| Comment by Markus [ 2020-03-16 ] | |||||||||||||||||||||||||||||||||
|
@marko, I used the Docker setup from #comment-140997 just now to produce the server log file I attached in my previous comment. So this was a "fresh install" of MariaDB 10.4.10 with a brand new database. The issue can be easily reproduced with the steps in the initial description or by using Docker for both MariaDB and our product. I know the initial description is pretty lengthy, but basically it boils down to:
The Docker setup does pretty much the same thing, only a lot faster, since there is no need to install anything. The issue occurs with both MariaDB 10.4.10 and 10.4.9, but does not occur with 10.4.11. Once corrupt, updating the MariaDB version does not fix the issue. So this is a "fresh install" each time, by using "docker-compose down --volumes" between tests to remove the containers and associated volumes. | |||||||||||||||||||||||||||||||||
| Comment by Markus [ 2020-03-16 ] | |||||||||||||||||||||||||||||||||
|
Alright, I've used our service's logs to get the SQL statements issued by Liquibase to set up the database. Using the statements for one of the affected tables, I'm now able to reproduce the issue manually.
Repeating the same steps with MariaDB 10.4.11 does no longer see the same issue.
| |||||||||||||||||||||||||||||||||
| Comment by Marko Mäkelä [ 2020-03-16 ] | |||||||||||||||||||||||||||||||||
|
m.pfeiffer, it looks like your report could be a duplicate of Can you confirm the character set of the column uuid, for example by SHOW CREATE TABLE corruption.scheduler_task? Free support advice:
| |||||||||||||||||||||||||||||||||
| Comment by Markus [ 2020-03-16 ] | |||||||||||||||||||||||||||||||||
|
It does indeed look like a duplicate of that issue. We use utf8mb4 for our database since we need to support emojis. We rarely drop columns for backwards compatibility reasons, which would explain why <2% of our tables were affected. We use strings for legacy/compatibility reasons, not sure how feasible it is to change that unfortunately. I suppose it might be possible to change the character set of our key columns though. I'll open an internal ticket to investigate that. |