[MDEV-24260] mariabackup and innochecksum detects page faults but all ok in application Created: 2020-11-20 Updated: 2021-02-08 Resolved: 2021-02-08 |
|
| Status: | Closed |
| Project: | MariaDB Server |
| Component/s: | Admin statements, Backup, mariabackup, Platform RedHat, Server, Storage Engine - InnoDB |
| Affects Version/s: | 10.5.5, 10.5.8 |
| Fix Version/s: | N/A |
| Type: | Bug | Priority: | Minor |
| Reporter: | Frank Olsen | Assignee: | Vladislav Lesin |
| Resolution: | Incomplete | Votes: | 0 |
| Labels: | need_feedback, need_rr | ||
| Environment: |
Centos 7.8.2003 kernel 3.10.0-1127.19.1.el7.x_-_64 |
||
| Issue Links: |
|
||||||||
| Description |
|
Hi, maribackup fails on multiple tables with: Error: failed to read page after 10 retries. File <fname>.ibd seems to be corrupted. What did work was to import the dump on my Windows 10 Portable computer on 10.5.8. UPDATE: imported on a Centos 7 VM with same redhat-release/kernel: so, I guess this is not a bug but rather something bad in the configuration of the other two VMs. Main difference is that there are two instances. Or maybe some specific configuration that makes it fail. Tried different settings for innodb_checksum_algorithm to no avail. Did not see any message regarding corruption in error log. Seems quite similar to Happens even on empty tables. UPDATES:
Best regards |
| Comments |
| Comment by Vladislav Lesin [ 2020-11-30 ] |
|
The differential characteristic of |
| Comment by Frank Olsen [ 2020-11-30 ] |
|
Hi, If I add another page I also get: And now innochecksum complains about page 5 again. Best regards, |
| Comment by Frank Olsen [ 2020-11-30 ] |
|
Just to tried importing the now OK database that was originally corrupted. On another server. |
| Comment by Frank Olsen [ 2020-12-02 ] |
|
Hi, Another day another test.
Best regards, |
| Comment by Frank Olsen [ 2020-12-02 ] |
|
Next test: There were no errors in the mysql_error.log on the original instance to explain any corruption. A comparison of one of the corruped .ibd files showed that after the FORCE rebuild the followings to lines wen away at the end of the file: Meaning after FORCE I have: So to summarize/conclude at the moment:
It seems that there is some underlying systems issue at either Linux (Centos 7) or ESX level to explain the corruption. Not sure what though. Best regards, |
| Comment by Vladislav Lesin [ 2020-12-09 ] |
|
6tasticMDB, you wrote: > If I add another page I also get: Yes, this looks very similar to > Again what worked was to move it to another file system and do the import. What does it mean "to move it to another file system"? As I understand, you stopped the server, copied data directory to another file system, started the server, and then imported data from some mysqldump file? Is it correct? > There were no errors in the mysql_error.log on the original instance to explain any corruption. That means corrupted page is not reachable from the root of B-tree, the same thing we saw in > Where they really corrupted? select * no problem. Exported without any issue. Nothing in MySQL logs. No application error. For As we don't understand the root case, we decided to add the ability to continue backup if corrupted page is reached (see > Import on another instance on same server and also on another server: corruptions but not on the same tables. Corruptions even on empty tables in some cases. This could help us to find the root case. We would very appreciate you if you would agree to run import under rr, and then let us analyse rr traces on your environment(unfortunately, rr traces in most cases can not be replayed on another environment, besides rr does not work well on virtual machines). This would let us debug the process of pages corruption during import. |
| Comment by Julien Fritsch [ 2021-01-07 ] |
|
6tasticMDB we consider this bug currently more as a duplicate of |