[MDEV-28994] Backup produces garbage when using memory-mapped log (PMEM) Created: 2022-07-01 Updated: 2022-07-05 Resolved: 2022-07-01 |
|
| Status: | Closed |
| Project: | MariaDB Server |
| Component/s: | Backup |
| Affects Version/s: | 10.8 |
| Fix Version/s: | 10.8.4, 10.9.2, 10.10.1 |
| Type: | Bug | Priority: | Major |
| Reporter: | Marko Mäkelä | Assignee: | Marko Mäkelä |
| Resolution: | Fixed | Votes: | 0 |
| Labels: | affects-tests, corruption | ||
| Issue Links: |
|
||||||||||||||||
| Description |
|
However, this also exposed us to a (very reasonable) limitation of the rr debugger: it is assumed that the memory-mapped file will not be changed by processes that are not traced by the same rr record instance. Because it may be hard to remember the rule that mariadb-backup --backup must not be traced with rr record when the PMEM interface is being used, we’d better make backup refuse the use of the "fake PMEM" (mmap on redo log located in /dev/shm) when the process is apparently being run under rr record. Normal file system calls should be fine. We can use innodb_use_native_aio=0 as a proxy for detecting rr record, because both the old io_setup (libaio) nor the newer io_uring interface will return ENOSYS under rr record. Access to log files on real persistent memory (mount -o dax) will not be affected. It is assumed that rr record mariadb-backup --backup seldomly needs to be run such that the log is stored in persistent memory. |
| Comments |
| Comment by Marko Mäkelä [ 2022-07-01 ] | |||||||||||||
|
mleich did some more testing. Unfortunately, the problems occur even when not using rr. Both /dev/shm and PMEM are affected. We even tried the following patch to force mariadb-backup --backup to always read the server’s ib_logfile0 via file system calls (pread(2)):
mleich, can you please fill in more details? I am afraid that the only way to fix this is to make the server process provide a copy of the log to the backup process. That would be a subset of MDEV-14992. | |||||||||||||
| Comment by Marko Mäkelä [ 2022-07-01 ] | |||||||||||||
|
I forgot to mention the most important part: The error that was reported by mariadb-backup --prepare was
In the dataset that I analyzed, the ib_logfile0 that had been produced by mariadb-backup --backup looked like garbage. I did not recognize proper mini-transaction boundaries. The checkpoint header pointed to the middle of something that looked a little like updating some FULLTEXT INDEX related fields, while it should have been a FILE_CHECKPOINT record, optionally preceded by a number of FILE_MODIFY records. Because of this mismatch, no mini-transaction was successfully parsed from the log. The end-of-minitransaction byte should always be 1 and never 0 in a log file that is produced by backup, but here the parser reached the byte 0. That triggered the end of file. What could have happened is the following:
Never using mmap for reading the log in mariadb-backup --backup should fix this type of scenario. |