Details
-
Task
-
Status: Closed (View Workflow)
-
Critical
-
Resolution: Fixed
Description
Currently Mariabackup aborts when it detects any InnoDB corruption. Needs an option to complete the backup and flag or log the corruption rather than leaving the entire server with no backup.
In situations where Mariabackup detects corruption while taking a backup, it currently aborts where InnoDB would assert, making backing up a corrupted server impossible.
This is obviously not practical when corruption in one table prevents making backups of the entire server.
Would it be possible to address this need by adding a force option like innodb_focre_recovery=1 to mariabackup, for instance?
-----------------
From Julien - Here is an additonal explanaition why this would be important to be done in 10.6 ralf.gebhardt@mariadb.com.
-------------------
From Vlad Lesin - Here is detailed description of the feature from commit message:
The new option --log-innodb-page-corruption is introduced.
When this option is set, backup is not interrupted if innodb corrupted
page is detected. Instead it logs all found corrupted pages in
innodb_corrupted_pages file in backup directory and finishes with error.
For incremental backup corrupted pages are also copied to .delta file,
because we can't do LSN check for such pages during backup,
innodb_corrupted_pages will also be created in incremental backup
directory.
During --prepare, corrupted pages list is read from the file just after
redo log is applied, and each page from the list is checked if it is allocated
in it's tablespace or not. If it is not allocated, then it is zeroed out,
flushed to the tablespace and removed from the list. If all pages are removed
from the list, then --prepare is finished successfully and
innodb_corrupted_pages file is removed from backup directory. Otherwise
--prepare is finished with error message and innodb_corrupted_pages contains
the list of the pages, which are detected as corrupted during backup, and are
allocated in their tablespaces, what means backup directory contains corrupted
innodb pages, and backup can not be considered as consistent.
For incremental --prepare corrupted pages from .delta files are applied
to the base backup, innodb_corrupted_pages is read from both base in
incremental directories, and the same action is proceded for corrupted
pages list as for full --prepare. innodb_corrupted_pages file is
modified or removed only in base directory.
If DDL happens during backup, it is also processed at the end of backup
to have correct tablespace names in innodb_corrupted_pages.
Attachments
Issue Links
- relates to
-
MDEV-21681 Log sequence numbers do not match during backup
-
- Closed
-
-
MDEV-23971 add the ability to fix corrupted pages on --prepare
-
- Closed
-
-
MDEV-24479 Document mariabackup option from MDEV-22929
-
- Closed
-
-
MDEV-21109 Table corruption not detected with CHECK TABLE or innochecksum, only with mariabackup
-
- Closed
-
Activity
Field | Original Value | New Value |
---|---|---|
Link |
This issue relates to |
Description |
In situations where Mariabackup detects corruption while taking a backup, it currently aborts where InnoDB would assert, making backing up a corrupted server impossible.
This is obviously not practical when corruption in one table prevents making backups of the entire server. Would it be possible to address this need by adding an innodb_focre_recovery=1 option to mariabackup, for instance? |
In situations where Mariabackup detects corruption while taking a backup, it currently aborts where InnoDB would assert, making backing up a corrupted server impossible.
This is obviously not practical when corruption in one table prevents making backups of the entire server. Would it be possible to address this need by adding a force option like innodb_focre_recovery=1 to mariabackup, for instance? |
Assignee | Ralf Gebhardt [ ralf.gebhardt@mariadb.com ] |
Description |
In situations where Mariabackup detects corruption while taking a backup, it currently aborts where InnoDB would assert, making backing up a corrupted server impossible.
This is obviously not practical when corruption in one table prevents making backups of the entire server. Would it be possible to address this need by adding a force option like innodb_focre_recovery=1 to mariabackup, for instance? |
Currently Mariabackup aborts when it detects any InnoDB corruption. Needs an option to complete the backup and flag or log the corruption rather than leaving the entire server with no backup.
In situations where Mariabackup detects corruption while taking a backup, it currently aborts where InnoDB would assert, making backing up a corrupted server impossible. This is obviously not practical when corruption in one table prevents making backups of the entire server. Would it be possible to address this need by adding a force option like innodb_focre_recovery=1 to mariabackup, for instance? |
Summary | Currently Mariabackup aborts when it detects any InnoDB corruption. Needs an option to complete the backup and flag or log the corruption rather than leaving the entire server with no backup. | MariaBackup option to report and/or continue when corruption is encountered |
Description |
Currently Mariabackup aborts when it detects any InnoDB corruption. Needs an option to complete the backup and flag or log the corruption rather than leaving the entire server with no backup.
In situations where Mariabackup detects corruption while taking a backup, it currently aborts where InnoDB would assert, making backing up a corrupted server impossible. This is obviously not practical when corruption in one table prevents making backups of the entire server. Would it be possible to address this need by adding a force option like innodb_focre_recovery=1 to mariabackup, for instance? |
Currently Mariabackup aborts when it detects any InnoDB corruption. Needs an option to complete the backup and flag or log the corruption rather than leaving the entire server with no backup.
In situations where Mariabackup detects corruption while taking a backup, it currently aborts where InnoDB would assert, making backing up a corrupted server impossible. This is obviously not practical when corruption in one table prevents making backups of the entire server. Would it be possible to address this need by adding a force option like innodb_focre_recovery=1 to mariabackup, for instance? ----------------- From Julien - Here is an additonal [explanaition |https://jira.mariadb.org/browse/MDEV-21109?focusedCommentId=160067&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-160067]why this would be important to be done in 10.6 [~ralf.gebhardt@mariadb.com]. |
Description |
Currently Mariabackup aborts when it detects any InnoDB corruption. Needs an option to complete the backup and flag or log the corruption rather than leaving the entire server with no backup.
In situations where Mariabackup detects corruption while taking a backup, it currently aborts where InnoDB would assert, making backing up a corrupted server impossible. This is obviously not practical when corruption in one table prevents making backups of the entire server. Would it be possible to address this need by adding a force option like innodb_focre_recovery=1 to mariabackup, for instance? ----------------- From Julien - Here is an additonal [explanaition |https://jira.mariadb.org/browse/MDEV-21109?focusedCommentId=160067&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-160067]why this would be important to be done in 10.6 [~ralf.gebhardt@mariadb.com]. |
Currently Mariabackup aborts when it detects any InnoDB corruption. Needs an option to complete the backup and flag or log the corruption rather than leaving the entire server with no backup.
In situations where Mariabackup detects corruption while taking a backup, it currently aborts where InnoDB would assert, making backing up a corrupted server impossible. This is obviously not practical when corruption in one table prevents making backups of the entire server. Would it be possible to address this need by adding a force option like innodb_focre_recovery=1 to mariabackup, for instance? ----------------- From Julien - Here is an additonal [explanaition |https://jira.mariadb.org/browse/MDEV-21109?focusedCommentId=160067&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-160067]why this would be important to be done in 10.6 [~ralf.gebhardt@mariadb.com]. -------------- From Nick, Additional Thoughts: I believe we could achieve this simply by reverting or modifying https://jira.mariadb.org/browse/MDEV-20607. That changed the behavior of backup to instead of reporting errors in the log and then *Completed: Ok* to crash when an error occurs. My thought is that it would be better to continue the backup (or have a flag to allow this) and report* Completed - Errors encountered, check log instead*. This would allow customers to use backup as a corruption detection tool and allow for partial backups in the case of corrupted tables. |
Link | This issue relates to TODO-2507 [ TODO-2507 ] |
Description |
Currently Mariabackup aborts when it detects any InnoDB corruption. Needs an option to complete the backup and flag or log the corruption rather than leaving the entire server with no backup.
In situations where Mariabackup detects corruption while taking a backup, it currently aborts where InnoDB would assert, making backing up a corrupted server impossible. This is obviously not practical when corruption in one table prevents making backups of the entire server. Would it be possible to address this need by adding a force option like innodb_focre_recovery=1 to mariabackup, for instance? ----------------- From Julien - Here is an additonal [explanaition |https://jira.mariadb.org/browse/MDEV-21109?focusedCommentId=160067&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-160067]why this would be important to be done in 10.6 [~ralf.gebhardt@mariadb.com]. -------------- From Nick, Additional Thoughts: I believe we could achieve this simply by reverting or modifying https://jira.mariadb.org/browse/MDEV-20607. That changed the behavior of backup to instead of reporting errors in the log and then *Completed: Ok* to crash when an error occurs. My thought is that it would be better to continue the backup (or have a flag to allow this) and report* Completed - Errors encountered, check log instead*. This would allow customers to use backup as a corruption detection tool and allow for partial backups in the case of corrupted tables. |
Currently Mariabackup aborts when it detects any InnoDB corruption. Needs an option to complete the backup and flag or log the corruption rather than leaving the entire server with no backup.
In situations where Mariabackup detects corruption while taking a backup, it currently aborts where InnoDB would assert, making backing up a corrupted server impossible. This is obviously not practical when corruption in one table prevents making backups of the entire server. Would it be possible to address this need by adding a force option like innodb_focre_recovery=1 to mariabackup, for instance? ----------------- From Julien - Here is an additonal [explanaition |https://jira.mariadb.org/browse/MDEV-21109?focusedCommentId=160067&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-160067]why this would be important to be done in 10.6 [~ralf.gebhardt@mariadb.com]. |
Link |
This issue relates to |
Assignee | Ralf Gebhardt [ ralf.gebhardt@mariadb.com ] | Vladislav Lesin [ vlad.lesin ] |
Fix Version/s | 10.2 [ 14601 ] | |
Fix Version/s | 10.3 [ 22126 ] | |
Fix Version/s | 10.4 [ 22408 ] | |
Fix Version/s | 10.5 [ 23123 ] | |
Fix Version/s | 10.6 [ 24028 ] |
Status | Open [ 1 ] | In Progress [ 3 ] |
Priority | Major [ 3 ] | Critical [ 2 ] |
Assignee | Vladislav Lesin [ vlad.lesin ] | Vladislav Vaintroub [ wlad ] |
Status | In Progress [ 3 ] | In Review [ 10002 ] |
Description |
Currently Mariabackup aborts when it detects any InnoDB corruption. Needs an option to complete the backup and flag or log the corruption rather than leaving the entire server with no backup.
In situations where Mariabackup detects corruption while taking a backup, it currently aborts where InnoDB would assert, making backing up a corrupted server impossible. This is obviously not practical when corruption in one table prevents making backups of the entire server. Would it be possible to address this need by adding a force option like innodb_focre_recovery=1 to mariabackup, for instance? ----------------- From Julien - Here is an additonal [explanaition |https://jira.mariadb.org/browse/MDEV-21109?focusedCommentId=160067&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-160067]why this would be important to be done in 10.6 [~ralf.gebhardt@mariadb.com]. |
Currently Mariabackup aborts when it detects any InnoDB corruption. Needs an option to complete the backup and flag or log the corruption rather than leaving the entire server with no backup.
In situations where Mariabackup detects corruption while taking a backup, it currently aborts where InnoDB would assert, making backing up a corrupted server impossible. This is obviously not practical when corruption in one table prevents making backups of the entire server. Would it be possible to address this need by adding a force option like innodb_focre_recovery=1 to mariabackup, for instance? ----------------- From Julien - Here is an additonal [explanaition |https://jira.mariadb.org/browse/MDEV-21109?focusedCommentId=160067&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-160067]why this would be important to be done in 10.6 [~ralf.gebhardt@mariadb.com]. ------------------- From Vlad Lesin - Here is detailed description of the feature from commit message: The new option --log-innodb-page-corruption is introduced. When this option is set, backup is not interrupted if innodb corrupted page is detected. Instead it logs all found corrupted pages in innodb_corrupted_pages file in backup directory and finishes with error. For incremental backup corrupted pages are also copied to .delta file, because we can't do LSN check for such pages during backup, innodb_corrupted_pages will also be created in incremental backup directory. During --prepare, corrupted pages list is read from the file just after redo log is applied, and each page from the list is checked if it is allocated in it's tablespace or not. If it is not allocated, then it is zeroed out, flushed to the tablespace and removed from the list. If all pages are removed from the list, then --prepare is finished successfully and innodb_corrupted_pages file is removed from backup directory. Otherwise --prepare is finished with error message and innodb_corrupted_pages contains the list of the pages, which are detected as corrupted during backup, and are allocated in their tablespaces, what means backup directory contains corrupted innodb pages, and backup can not be considered as consistent. For incremental --prepare corrupted pages from .delta files are applied to the base backup, innodb_corrupted_pages is read from both base in incremental directories, and the same action is proceded for corrupted pages list as for full --prepare. innodb_corrupted_pages file is modified or removed only in base directory. If DDL happens during backup, it is also processed at the end of backup to have correct tablespace names in innodb_corrupted_pages. |
Assignee | Vladislav Vaintroub [ wlad ] | Vladislav Lesin [ vlad.lesin ] |
Status | In Review [ 10002 ] | Stalled [ 10000 ] |
Fix Version/s | 10.2.37 [ 25112 ] | |
Fix Version/s | 10.3.28 [ 25111 ] | |
Fix Version/s | 10.4.18 [ 25110 ] | |
Fix Version/s | 10.5.9 [ 25109 ] | |
Fix Version/s | 10.6.0 [ 24431 ] | |
Fix Version/s | 10.2 [ 14601 ] | |
Fix Version/s | 10.3 [ 22126 ] | |
Fix Version/s | 10.4 [ 22408 ] | |
Fix Version/s | 10.5 [ 23123 ] | |
Fix Version/s | 10.6 [ 24028 ] | |
Resolution | Fixed [ 1 ] | |
Status | Stalled [ 10000 ] | Closed [ 6 ] |
Link | This issue relates to TODO-2679 [ TODO-2679 ] |
Link |
This issue relates to |
Link |
This issue relates to |
Workflow | MariaDB v3 [ 110171 ] | MariaDB v4 [ 134292 ] |
Zendesk Related Tickets | 110821 185032 126984 |
MDEV-20607is only for innodb initialization. It does not touch the code of page consistency verification, so, no, this issue can not be implemented with justMDEV-20607reverting.