[MDEV-14992] BACKUP: in-server backup - Jira

Vladislav Vaintroub created issue - 2018-01-19 07:53

Vladislav Vaintroub made changes - 2018-01-19 07:54

Field	Original Value	New Value
Component/s		Backup [ 13902 ]
Fix Version/s		10.4 [ 22408 ]

Vladislav Vaintroub made changes - 2018-01-19 09:37

Description

The purpose of this work is to improve current situation around backups by impolementing a SQL BACKUP command.

h1. Interface between client and server
BACKUP command sends the file segments to the client via MySQL's protocol ResultSet, i.e client issues BACKUP statement, server sends a result set back

The ResultSet consists of following fields
* VARCHAR(N) filename
* INT(SET?) flags (OPEN,CLOSE, SPARSE, etc)
* BIGINT offset
* BIGINT length
* LONGBLOB data

h2.The client
There will be a simple client program which will issue "BACKUP" statement ,read the result set from server, and apply it to create a directory with a copy of the database. Due to abiliity to use client API (in C, .NET, Java, etc), it should be relatively easy to write specialized backup applications for special needs (e.g "streaming to clould", distributing to multiple machines)

h1. Interface between server and storage engines.
Most of the actual work is done by the storage engines, server initiates backup by calling storage engine's _backup()_begin_ function. For every file chunk, storage engine calls server's _backup_file(filename, flag, offset, length, data)_ function, that streams file chink as result set row to client. When backup is finished, engine calls `backup_end()` so that server would know when backup is finally done.

There default implementation of _backup_start()_ enumerates engines own files, reads them, and signals _backup_file()_ for every chunk read. _FLUSH TABLES FOR BACKUP_, must be issued for that.

Engines that support online backup, like Innodb would have much more complicated implementation.

h1.Incremental and partial backups
Incremental/differential (changes since specific backup) and partial backups( only some tables) will be supported as well, and the client-server interface does not change much for that . For incremental backup, the client application might need to invent some "patch" format, or use some standard for binary diffs (rdiff? bsdiff? xdelta? ), and save diffs to be applied to "base backup" directory later.

h1. Backup metadata.
There is a need to store different kind of metadata in the backup, usually binlog position, and some kind of timestamp (LSN) for incremental backup. This information can be stored as session variable available after BACKUP command, or send as extra result set after the backup one (protocol does support multi-result sets).

h1. Restore

Unlike the current solution with mariabackup or xtrabackup, there won't be any kinds of "prepare" or "restore" application,

The directory that was made when full backup was stored should be usable for new database.

files from partial backups could just be copied to destination (and maybe "IMPORT TABLESPACE", if we still support that).
There should still be some kind of "patch" application for applying incremental backups to the base directory. Dependend on how our client stores incremental backup, we can do without it , and users would only need to use 3rd party bsdiff, rdiff,, mspatcha.

Vladislav Vaintroub made changes - 2018-01-19 09:40

Description

The purpose of this work is to improve current situation around backups by impolementing a SQL BACKUP command.

h1. Interface between client and server
BACKUP command sends the file segments to the client via MySQL's protocol ResultSet, i.e client issues BACKUP statement, server sends a result set back

The ResultSet consists of following fields
* VARCHAR(N) filename
* INT(SET?) flags (OPEN,CLOSE, SPARSE, etc)
* BIGINT offset
* BIGINT length
* LONGBLOB data

h2.The client
There will be a simple client program which will issue "BACKUP" statement ,read the result set from server, and apply it to create a directory with a copy of the database. Due to abiliity to use client API (in C, .NET, Java, etc), it should be relatively easy to write specialized backup applications for special needs (e.g "streaming to clould", distributing to multiple machines)

h1. Interface between server and storage engines.
Most of the actual work is done by the storage engines, server initiates backup by calling storage engine's _backup()_begin_ function. For every file chunk, storage engine calls server's _backup_file(filename, flag, offset, length, data)_ function, that streams file chink as result set row to client. When backup is finished, engine calls `backup_end()` so that server would know when backup is finally done.

There default implementation of _backup_start()_ enumerates engines own files, reads them, and signals _backup_file()_ for every chunk read. _FLUSH TABLES FOR BACKUP_, must be issued for that.

Engines that support online backup, like Innodb would have much more complicated implementation.

h1.Incremental and partial backups
Incremental/differential (changes since specific backup) and partial backups( only some tables) will be supported as well, and the client-server interface does not change much for that . For incremental backup, the client application might need to invent some "patch" format, or use some standard for binary diffs (rdiff? bsdiff? xdelta? ), and save diffs to be applied to "base backup" directory later.

h1. Backup metadata.
There is a need to store different kind of metadata in the backup, usually binlog position, and some kind of timestamp (LSN) for incremental backup. This information can be stored as session variable available after BACKUP command, or send as extra result set after the backup one (protocol does support multi-result sets).

h1. Restore

Unlike the current solution with mariabackup or xtrabackup, there won't be any kinds of "prepare" or "restore" application,

The directory that was made when full backup was stored should be usable for new database.

files from partial backups could just be copied to destination (and maybe "IMPORT TABLESPACE", if we still support that).
There should still be some kind of "patch" application for applying incremental backups to the base directory. Dependend on how our client stores incremental backup, we can do without it , and users would only need to use 3rd party bsdiff, rdiff,, mspatcha.

The purpose of this work is to improve current situation around backups by implementing a SQL BACKUP command.

h1. Interface between client and server
BACKUP command sends the file segments to the client via MySQL's protocol ResultSet, i.e client issues BACKUP statement, server sends a result set back

The ResultSet consists of following fields
* VARCHAR(N) filename
* INT(SET?) flags (OPEN,CLOSE, SPARSE, etc)
* BIGINT offset
* BIGINT length
* LONGBLOB data

h2.The client
There will be a simple client program which will issue "BACKUP" statement ,read the result set from server, and apply it to create a directory with a copy of the database. Due to abiliity to use client API (in C, .NET, Java, etc), it should be relatively easy to write specialized backup applications for special needs (e.g "streaming to clould", distributing to multiple machines)

h1. Interface between server and storage engines.
Most of the actual work is done by the storage engines, server initiates backup by calling storage engine's _backup()_begin_ function. For every file chunk, storage engine calls server's _backup_file(filename, flag, offset, length, data)_ function, that streams file chink as result set row to client. When backup is finished, engine calls `backup_end()` so that server would know when backup is finally done.

There default implementation of _backup_start()_ enumerates engines own files, reads them, and signals _backup_file()_ for every chunk read. _FLUSH TABLES FOR BACKUP_, must be issued for that.

Engines that support online backup, like Innodb would have much more complicated implementation.

h1.Incremental and partial backups
Incremental/differential (changes since specific backup) and partial backups( only some tables) will be supported as well, and the client-server interface does not change much for that . For incremental backup, the client application might need to invent some "patch" format, or use some standard for binary diffs (rdiff? bsdiff? xdelta? ), and save diffs to be applied to "base backup" directory later.

h1. Backup metadata.
There is a need to store different kind of metadata in the backup, usually binlog position, and some kind of timestamp (LSN) for incremental backup. This information can be stored as session variable available after BACKUP command, or send as extra result set after the backup one (protocol does support multi-result sets).

h1. Restore

Unlike the current solution with mariabackup or xtrabackup, there won't be any kinds of "prepare" or "restore" application,

The directory that was made when full backup was stored should be usable for new database.

files from partial backups could just be copied to destination (and maybe "IMPORT TABLESPACE", if we still support that).
There should still be some kind of "patch" application for applying incremental backups to the base directory. Dependend on how our client stores incremental backup, we can do without it , and users would only need to use 3rd party bsdiff, rdiff,, mspatcha.

Vladislav Vaintroub made changes - 2018-01-19 09:48

Description

The purpose of this work is to improve current situation around backups by implementing a SQL BACKUP command.

h1. Interface between client and server
BACKUP command sends the file segments to the client via MySQL's protocol ResultSet, i.e client issues BACKUP statement, server sends a result set back

The ResultSet consists of following fields
* VARCHAR(N) filename
* INT(SET?) flags (OPEN,CLOSE, SPARSE, etc)
* BIGINT offset
* BIGINT length
* LONGBLOB data

h2.The client
There will be a simple client program which will issue "BACKUP" statement ,read the result set from server, and apply it to create a directory with a copy of the database. Due to abiliity to use client API (in C, .NET, Java, etc), it should be relatively easy to write specialized backup applications for special needs (e.g "streaming to clould", distributing to multiple machines)

h1. Interface between server and storage engines.
Most of the actual work is done by the storage engines, server initiates backup by calling storage engine's _backup()_begin_ function. For every file chunk, storage engine calls server's _backup_file(filename, flag, offset, length, data)_ function, that streams file chink as result set row to client. When backup is finished, engine calls `backup_end()` so that server would know when backup is finally done.

There default implementation of _backup_start()_ enumerates engines own files, reads them, and signals _backup_file()_ for every chunk read. _FLUSH TABLES FOR BACKUP_, must be issued for that.

Engines that support online backup, like Innodb would have much more complicated implementation.

h1.Incremental and partial backups
Incremental/differential (changes since specific backup) and partial backups( only some tables) will be supported as well, and the client-server interface does not change much for that . For incremental backup, the client application might need to invent some "patch" format, or use some standard for binary diffs (rdiff? bsdiff? xdelta? ), and save diffs to be applied to "base backup" directory later.

h1. Backup metadata.
There is a need to store different kind of metadata in the backup, usually binlog position, and some kind of timestamp (LSN) for incremental backup. This information can be stored as session variable available after BACKUP command, or send as extra result set after the backup one (protocol does support multi-result sets).

h1. Restore

Unlike the current solution with mariabackup or xtrabackup, there won't be any kinds of "prepare" or "restore" application,

The directory that was made when full backup was stored should be usable for new database.

files from partial backups could just be copied to destination (and maybe "IMPORT TABLESPACE", if we still support that).
There should still be some kind of "patch" application for applying incremental backups to the base directory. Dependend on how our client stores incremental backup, we can do without it , and users would only need to use 3rd party bsdiff, rdiff,, mspatcha.

The purpose of this work is to improve current situation around backups by implementing a SQL BACKUP command.

h1. Interface between client and server
BACKUP command sends the file segments to the client via MySQL's protocol ResultSet, i.e client issues BACKUP statement, server sends a result set back

The ResultSet consists of following fields
* VARCHAR(N) filename
* INT(SET?) flags (OPEN,CLOSE, SPARSE, etc)
* BIGINT offset
* LONGBLOB data

h2.The client
There will be a simple client program which will issue "BACKUP" statement ,read the result set from server, and apply it to create a directory with a copy of the database. Due to abiliity to use client API (in C, .NET, Java, etc), it should be relatively easy to write specialized backup applications for special needs (e.g "streaming to clould", distributing to multiple machines)

h1. Interface between server and storage engines.
Most of the actual work is done by the storage engines, server initiates backup by calling storage engine's _backup()_begin_ function. For every file chunk, storage engine calls server's _backup_file(filename, flag, offset, length, data)_ function, that streams file chink as result set row to client. When backup is finished, engine calls `backup_end()` so that server would know when backup is finally done.

There default implementation of _backup_start()_ enumerates engines own files, reads them, and signals _backup_file()_ for every chunk read. _FLUSH TABLES FOR BACKUP_, must be issued for that.

Engines that support online backup, like Innodb would have much more complicated implementation.

h1.Incremental and partial backups
Incremental/differential (changes since specific backup) and partial backups( only some tables) will be supported as well, and the client-server interface does not change much for that . For incremental backup, the client application might need to invent some "patch" format, or use some standard for binary diffs (rdiff? bsdiff? xdelta? ), and save diffs to be applied to "base backup" directory later.

h1. Backup metadata.
There is a need to store different kind of metadata in the backup, usually binlog position, and some kind of timestamp (LSN) for incremental backup. This information can be stored as session variable available after BACKUP command, or send as extra result set after the backup one (protocol does support multi-result sets).

h1. Restore

Unlike the current solution with mariabackup or xtrabackup, there won't be any kinds of "prepare" or "restore" application,

The directory that was made when full backup was stored should be usable for new database.

files from partial backups could just be copied to destination (and maybe "IMPORT TABLESPACE", if we still support that).
There should still be some kind of "patch" application for applying incremental backups to the base directory. Dependend on how our client stores incremental backup, we can do without it , and users would only need to use 3rd party bsdiff, rdiff,, mspatcha.

Vladislav Vaintroub made changes - 2018-01-19 09:51

Description

The purpose of this work is to improve current situation around backups by implementing a SQL BACKUP command.

h1. Interface between client and server
BACKUP command sends the file segments to the client via MySQL's protocol ResultSet, i.e client issues BACKUP statement, server sends a result set back

The ResultSet consists of following fields
* VARCHAR(N) filename
* INT(SET?) flags (OPEN,CLOSE, SPARSE, etc)
* BIGINT offset
* LONGBLOB data

h2.The client
There will be a simple client program which will issue "BACKUP" statement ,read the result set from server, and apply it to create a directory with a copy of the database. Due to abiliity to use client API (in C, .NET, Java, etc), it should be relatively easy to write specialized backup applications for special needs (e.g "streaming to clould", distributing to multiple machines)

h1. Interface between server and storage engines.
Most of the actual work is done by the storage engines, server initiates backup by calling storage engine's _backup()_begin_ function. For every file chunk, storage engine calls server's _backup_file(filename, flag, offset, length, data)_ function, that streams file chink as result set row to client. When backup is finished, engine calls `backup_end()` so that server would know when backup is finally done.

There default implementation of _backup_start()_ enumerates engines own files, reads them, and signals _backup_file()_ for every chunk read. _FLUSH TABLES FOR BACKUP_, must be issued for that.

Engines that support online backup, like Innodb would have much more complicated implementation.

h1.Incremental and partial backups
Incremental/differential (changes since specific backup) and partial backups( only some tables) will be supported as well, and the client-server interface does not change much for that . For incremental backup, the client application might need to invent some "patch" format, or use some standard for binary diffs (rdiff? bsdiff? xdelta? ), and save diffs to be applied to "base backup" directory later.

h1. Backup metadata.
There is a need to store different kind of metadata in the backup, usually binlog position, and some kind of timestamp (LSN) for incremental backup. This information can be stored as session variable available after BACKUP command, or send as extra result set after the backup one (protocol does support multi-result sets).

h1. Restore

Unlike the current solution with mariabackup or xtrabackup, there won't be any kinds of "prepare" or "restore" application,

The directory that was made when full backup was stored should be usable for new database.

files from partial backups could just be copied to destination (and maybe "IMPORT TABLESPACE", if we still support that).
There should still be some kind of "patch" application for applying incremental backups to the base directory. Dependend on how our client stores incremental backup, we can do without it , and users would only need to use 3rd party bsdiff, rdiff,, mspatcha.

The purpose of this work is to improve current situation around backups by implementing a SQL BACKUP command.

h1. Interface between client and server
BACKUP command sends the file segments to the client via MySQL's protocol ResultSet, i.e client issues BACKUP statement, server sends a result set back

The ResultSet consists of following fields
* VARCHAR(N) filename
* INT(SET?) flags (OPEN,CLOSE, SPARSE, RENAME, DELETE, etc)
* BIGINT offset
* LONGBLOB data

h2.The client
There will be a simple client program which will issue "BACKUP" statement ,read the result set from server, and apply it to create a directory with a copy of the database. Due to abiliity to use client API (in C, .NET, Java, etc), it should be relatively easy to write specialized backup applications for special needs (e.g "streaming to clould", distributing to multiple machines)

h1. Interface between server and storage engines.
Most of the actual work is done by the storage engines, server initiates backup by calling storage engine's _backup()_begin_ function. For every file chunk, storage engine calls server's _backup_file(filename, flag, offset, length, data)_ function, that streams file chink as result set row to client. When backup is finished, engine calls `backup_end()` so that server would know when backup is finally done.

There default implementation of _backup_start()_ enumerates engines own files, reads them, and signals _backup_file()_ for every chunk read. _FLUSH TABLES FOR BACKUP_, must be issued for that.

Engines that support online backup, like Innodb would have much more complicated implementation.

h1.Incremental and partial backups
Incremental/differential (changes since specific backup) and partial backups( only some tables) will be supported as well, and the client-server interface does not change much for that . For incremental backup, the client application might need to invent some "patch" format, or use some standard for binary diffs (rdiff? bsdiff? xdelta? ), and save diffs to be applied to "base backup" directory later.

h1. Backup metadata.
There is a need to store different kind of metadata in the backup, usually binlog position, and some kind of timestamp (LSN) for incremental backup. This information can be stored as session variable available after BACKUP command, or send as extra result set after the backup one (protocol does support multi-result sets).

h1. Restore

Unlike the current solution with mariabackup or xtrabackup, there won't be any kinds of "prepare" or "restore" application,

The directory that was made when full backup was stored should be usable for new database.

files from partial backups could just be copied to destination (and maybe "IMPORT TABLESPACE", if we still support that).
There should still be some kind of "patch" application for applying incremental backups to the base directory. Dependend on how our client stores incremental backup, we can do without it , and users would only need to use 3rd party bsdiff, rdiff,, mspatcha.

zhangyuan made changes - 2018-02-24 16:26

Attachment

xtrabackup.png [ 45264 ]

zhangyuan added a comment - 2018-02-24 16:33

Hi, The attached file is a basic idea of cross-engine backup which we have implemented a year ago.

zhangyuan added a comment - 2018-02-24 16:33 Hi, The attached file is a basic idea of cross-engine backup which we have implemented a year ago.

zhangyuan made changes - 2018-02-24 18:13

Attachment

xtrabackup.png [ 45264 ]

zhangyuan made changes - 2018-02-24 18:13

Attachment

xtrabackup.png [ 45265 ]

zhangyuan made changes - 2018-02-24 18:14

Attachment

xtrabackup.png [ 45265 ]

zhangyuan made changes - 2018-02-24 18:14

Attachment

xtrabackup.png [ 45266 ]

Ralf Gebhardt made changes - 2018-06-15 18:37

Priority

Major [ 3 ]

Minor [ 4 ]

Ralf Gebhardt made changes - 2018-06-15 18:37

Priority

Minor [ 4 ]

Major [ 3 ]

Julien Fritsch made changes - 2018-07-05 14:16

Epic Link

PT-75 [ 68556 ]

Julien Fritsch made changes - 2018-07-05 14:20

Epic Link

PT-75 [ 68556 ]

PT-77 [ 68558 ]

Ralf Gebhardt made changes - 2018-08-23 20:06

Rank

Ranked higher

Sergei Golubchik made changes - 2018-09-18 19:41

Link

This issue relates to ~~MDEV-5336~~ [ ~~MDEV-5336~~ ]

Sergei Golubchik made changes - 2018-09-27 15:55

Summary

SQL command for BACKUP

BACKUP: in-server backup

Ralf Gebhardt made changes - 2018-09-28 13:28

Epic Link

PT-77 [ 68558 ]

PT-92 [ 69417 ]

Ralf Gebhardt made changes - 2018-09-28 13:29

Fix Version/s

10.4 [ 22408 ]

Nuno added a comment - 2019-01-22 17:56

This would be VERY great, and would make it more like SQL Server.

I love the way BACKUP/RESTORE works in SQL Server.

It would be very nice and convenient if we could do BACKUP/RESTORE for single databases, so I could restore a copy of the database in the same server, to restore data of a column that got messed up, etc...

Thank you very much for this.

Nuno added a comment - 2019-01-22 17:56 This would be VERY great, and would make it more like SQL Server. I love the way BACKUP/RESTORE works in SQL Server. It would be very nice and convenient if we could do BACKUP/RESTORE for single databases, so I could restore a copy of the database in the same server, to restore data of a column that got messed up, etc... Thank you very much for this.

Geoff Montee (Inactive) made changes - 2019-01-24 00:18

Link

This issue blocks MDEV-18364 [ MDEV-18364 ]

Geoff Montee (Inactive) made changes - 2019-02-28 19:43

Link

This issue blocks MDBM-1 [ MDBM-1 ]

Ralf Gebhardt made changes - 2019-03-13 08:43

NRE Projects

RM_long_term RM_platform

RM_long_term RM_platform RM_105_CANDIDATE

Geoff Montee (Inactive) made changes - 2019-04-12 23:08

Link

This issue relates to MDEV-7502 [ MDEV-7502 ]

Geoff Montee (Inactive) made changes - 2019-06-18 08:51

Link

This issue relates to ~~MXS-2542~~ [ ~~MXS-2542~~ ]

Marko Mäkelä made changes - 2020-02-05 18:49

Link

This issue relates to ~~MDEV-21105~~ [ ~~MDEV-21105~~ ]

Marko Mäkelä made changes - 2020-03-31 16:33

Link

This issue relates to ~~MDEV-22096~~ [ ~~MDEV-22096~~ ]

Daniel Black added a comment - 2020-09-17 03:05

As a within storage engine detail, a linux only function that reduce the contention is to use ioctl_ficlonerange, where available to gain a copy of the key table contents shallowly but immune to changes and release backup locks as soon as possible.

Daniel Black added a comment - 2020-09-17 03:05 As a within storage engine detail, a linux only function that reduce the contention is to use ioctl_ficlonerange , where available to gain a copy of the key table contents shallowly but immune to changes and release backup locks as soon as possible.

Vladislav Lesin made changes - 2021-03-12 11:21

Link

This issue relates to MDEV-18336 [ MDEV-18336 ]

Sergei Golubchik made changes - 2021-12-06 21:21

Workflow

MariaDB v3 [ 84950 ]

MariaDB v4 [ 130766 ]

Marko Mäkelä added a comment - 2022-01-20 13:02 - edited

Server-side backup will prevent errors like ~~MDEV-21255~~ or MDEV-27551 from occurring, because the server can simply replicate its write-ahead log to the backup stream, and the server knows when it is deleting or renaming files. If needed, the throughput of any write workload that the server is concurrently handling would slow down.

Some years ago, I thought that the most straightforward implementation of a server-side InnoDB backup would be to (re)write changed pages from the buffer pool to a result set stream.

Advantage: Whatever tool restores the stream can simply restore all data files directly.
Disadvantage: The stream basically becomes a log that stores full page images even for tiny changes (such as updating one byte on the page).

With ~~MDEV-12353~~ and ~~MDEV-14425~~, I think that the best format for streaming InnoDB changes would be the InnoDB log file. The server-side backup could dump InnoDB data file images as well as an append-only copy of the server’s log, written as an ib_logfile0 that can be applied by the recipient. This log file would have similar format as the one that is currently created by mariadb-backup --backup.

For incremental backups, it could be easiest to implement redo log archiving, say, by duplicating the circular ib_logfile0 with append-only file(s) that might be named ib_logfile.%llu and start at the LSN identified in the header or in the file name. An incremental backup client would request all log from the LSN where it last left off, and the server would either say "sorry, our log is not that old" or construct a corresponding log file (starting with an archived log file that is at least as old as that LSN). In the ~~MDEV-14425~~ format, cutting some LSN from the beginning of an append-only log file is trivial: just rewrite the header so that the log payload area starts at the desired LSN, and then copy everything starting from the byte offset that represents that LSN.

Marko Mäkelä added a comment - 2022-01-20 13:02 - edited Server-side backup will prevent errors like MDEV-21255 or MDEV-27551 from occurring, because the server can simply replicate its write-ahead log to the backup stream, and the server knows when it is deleting or renaming files. If needed, the throughput of any write workload that the server is concurrently handling would slow down. Some years ago, I thought that the most straightforward implementation of a server-side InnoDB backup would be to (re)write changed pages from the buffer pool to a result set stream. Advantage: Whatever tool restores the stream can simply restore all data files directly. Disadvantage: The stream basically becomes a log that stores full page images even for tiny changes (such as updating one byte on the page). With MDEV-12353 and MDEV-14425 , I think that the best format for streaming InnoDB changes would be the InnoDB log file. The server-side backup could dump InnoDB data file images as well as an append-only copy of the server’s log, written as an ib_logfile0 that can be applied by the recipient. This log file would have similar format as the one that is currently created by mariadb-backup --backup . For incremental backups, it could be easiest to implement redo log archiving, say, by duplicating the circular ib_logfile0 with append-only file(s) that might be named ib_logfile.%llu and start at the LSN identified in the header or in the file name. An incremental backup client would request all log from the LSN where it last left off, and the server would either say "sorry, our log is not that old" or construct a corresponding log file (starting with an archived log file that is at least as old as that LSN). In the MDEV-14425 format, cutting some LSN from the beginning of an append-only log file is trivial: just rewrite the header so that the log payload area starts at the desired LSN, and then copy everything starting from the byte offset that represents that LSN.

Marko Mäkelä made changes - 2022-01-21 07:01

Link

This issue relates to ~~MDEV-14425~~ [ ~~MDEV-14425~~ ]

Marko Mäkelä added a comment - 2022-01-21 07:48

The suggested "Interface between client and server" would basically replace the internal datasink.h interface that is being used in mariadb-backup, and it would be similar to the datasink_xbstream implementation.

For InnoDB, the server-side backup would have the benefit that of using the buffer pool as a cache. Asynchronous read-ahead can load pages to the buffer pool.

The algorithm for a full backup could be as follows:

Initiate a log checkpoint and start streaming log log records directly from log_sys.buf.
Enumerate all persistent tablespaces. (They may be dropped at any time because we are not holding locks; we must prevent bugs like MDEV-27551.)
For each tablespace that we found, acquire U-lock on every page (based on the current size of the tablespace) and dump it to the stream.
- Skip any pages that do not exist in the buffer pool and are marked as free in the allocation bitmap page.
- If a page that we read to the buffer pool is corrupted, issue an error message, and skip the page. (Backup must not cause a server crash like ~~MDEV-13542~~.)
- Skip any pages that are marked as free in the page descriptor, similar to ~~MDEV-15528~~.
- Skip any pages that are marked as newly initialized in the page descriptor. They will be covered by the redo log, similar to ~~MDEV-19738~~.
- If access_time==0 && oldest_modification() <= 1, the page must have been only accessed due to backup, and we might want to evict it from the buffer pool, to reduce cache pollution.
- Switch to the next tablespace if the tablespace was flagged for deletion.

Observations:

If files were deleted during the backup, the stream may contain some ‘garbage’ pages belonging to deleted files, as well as FILE_DELETE records in the copied log.
If files were renamed during the backup, the stream would contain the original file names, as well as FILE_RENAME records in the copied log.
If files were created during the backup, all changes will be covered by the copied log; see ~~MDEV-24626~~.
If files were extended during the backup, in the copied log there will be records to update the FSP_SIZE and to initialize any added pages.
If any pages were modified after they were copied, in the copied log there will be records to cover those updates.

For incremental backups, my best idea remains that the log be archived on the server along with a log of all checkpoints. We would simply stream the log from the latest checkpoint that is at or before the requested LSN. We would probably have to keep a directory of checkpoint LSNs in the archived log, because the log header only stores the 2 latest ones. A simple way could be to switch to a new archived log file on every checkpoint. The checkpoint LSN of the archived log could be available in the file name.

All functionality of mariadb-backup --prepare would have to be integrated in the normal InnoDB startup.

Marko Mäkelä added a comment - 2022-01-21 07:48 The suggested "Interface between client and server" would basically replace the internal datasink.h interface that is being used in mariadb-backup , and it would be similar to the datasink_xbstream implementation. For InnoDB, the server-side backup would have the benefit that of using the buffer pool as a cache. Asynchronous read-ahead can load pages to the buffer pool. The algorithm for a full backup could be as follows: Initiate a log checkpoint and start streaming log log records directly from log_sys.buf . Enumerate all persistent tablespaces. (They may be dropped at any time because we are not holding locks; we must prevent bugs like MDEV-27551 .) For each tablespace that we found, acquire U-lock on every page (based on the current size of the tablespace) and dump it to the stream. Skip any pages that do not exist in the buffer pool and are marked as free in the allocation bitmap page. If a page that we read to the buffer pool is corrupted, issue an error message, and skip the page. (Backup must not cause a server crash like MDEV-13542 .) Skip any pages that are marked as free in the page descriptor, similar to MDEV-15528 . Skip any pages that are marked as newly initialized in the page descriptor. They will be covered by the redo log, similar to MDEV-19738 . If access_time==0 && oldest_modification() <= 1 , the page must have been only accessed due to backup, and we might want to evict it from the buffer pool, to reduce cache pollution. Switch to the next tablespace if the tablespace was flagged for deletion. Observations: If files were deleted during the backup, the stream may contain some ‘garbage’ pages belonging to deleted files, as well as FILE_DELETE records in the copied log. If files were renamed during the backup, the stream would contain the original file names, as well as FILE_RENAME records in the copied log. If files were created during the backup, all changes will be covered by the copied log; see MDEV-24626 . If files were extended during the backup, in the copied log there will be records to update the FSP_SIZE and to initialize any added pages. If any pages were modified after they were copied, in the copied log there will be records to cover those updates. For incremental backups, my best idea remains that the log be archived on the server along with a log of all checkpoints. We would simply stream the log from the latest checkpoint that is at or before the requested LSN. We would probably have to keep a directory of checkpoint LSNs in the archived log, because the log header only stores the 2 latest ones. A simple way could be to switch to a new archived log file on every checkpoint. The checkpoint LSN of the archived log could be available in the file name. All functionality of mariadb-backup --prepare would have to be integrated in the normal InnoDB startup.

Marko Mäkelä made changes - 2022-01-25 12:01

Link

This issue relates to ~~MDEV-27424~~ [ ~~MDEV-27424~~ ]

Marko Mäkelä added a comment - 2022-01-25 12:01

Bugs like ~~MDEV-27424~~ would be impossible, or they would affect the server and backup alike, if all pages for the backup were read from the server’s buffer pool. There would be no race condition between two processes reading and writing files concurrently, and no need to re-read corrupted-looking pages because of that.

Marko Mäkelä added a comment - 2022-01-25 12:01 Bugs like MDEV-27424 would be impossible, or they would affect the server and backup alike, if all pages for the backup were read from the server’s buffer pool. There would be no race condition between two processes reading and writing files concurrently, and no need to re-read corrupted-looking pages because of that.

Marko Mäkelä added a comment - 2022-01-26 11:16 - edited

~~MDEV-27621~~ is a case where backup cannot keep up with the server that is writing log. If the server process itself streamed the log like proposed in this task, it could trivially slow down the write workload as necessary.

Marko Mäkelä added a comment - 2022-01-26 11:16 - edited MDEV-27621 is a case where backup cannot keep up with the server that is writing log. If the server process itself streamed the log like proposed in this task, it could trivially slow down the write workload as necessary.

Marko Mäkelä made changes - 2022-01-26 11:16

Link

This issue relates to ~~MDEV-27621~~ [ ~~MDEV-27621~~ ]

Marko Mäkelä made changes - 2022-02-11 10:17

Link

This issue relates to ~~MDEV-27812~~ [ ~~MDEV-27812~~ ]

markus makela made changes - 2022-04-01 15:52

Link

This issue blocks ~~MXS-2542~~ [ ~~MXS-2542~~ ]

markus makela made changes - 2022-04-01 15:52

Link

This issue relates to ~~MXS-2542~~ [ ~~MXS-2542~~ ]

Todd Stoffel (Inactive) made changes - 2022-04-01 18:44

Link

This issue blocks ~~MXS-2542~~ [ ~~MXS-2542~~ ]

Marko Mäkelä made changes - 2022-05-04 07:14

Link

This issue relates to MDEV-19492 [ MDEV-19492 ]

Rick Pizzi (Inactive) added a comment - 2022-05-04 07:35

marko not sure it is a good idea for a strained backup to slow down the server to accommodate its needs. We want backup to be as less impactful as we can and this idea goes in the opposite direction.

Rick Pizzi (Inactive) added a comment - 2022-05-04 07:35 marko not sure it is a good idea for a strained backup to slow down the server to accommodate its needs. We want backup to be as less impactful as we can and this idea goes in the opposite direction.

Marko Mäkelä added a comment - 2022-06-21 06:53

The DS_TYPE_LOCAL interface of mariadb-backup could be demoted to a fallback of something that would invoke the Linux system call copy_file_range. That function call would not only allow files to be copied with less context switching, but also allow instantaneous ‘copying’ on file systems that support snapshots via copy-on-write (such as xfs and btrfs).

Similarly, the DS_TYPE_STDOUT interface could be demoted to a fallback of the Linux system call splice or the nonstandard but widely implemented system call sendfile.

Marko Mäkelä added a comment - 2022-06-21 06:53 The DS_TYPE_LOCAL interface of mariadb-backup could be demoted to a fallback of something that would invoke the Linux system call copy_file_range . That function call would not only allow files to be copied with less context switching, but also allow instantaneous ‘copying’ on file systems that support snapshots via copy-on-write (such as xfs and btrfs ). Similarly, the DS_TYPE_STDOUT interface could be demoted to a fallback of the Linux system call splice or the nonstandard but widely implemented system call sendfile .

Marko Mäkelä added a comment - 2022-06-30 05:41

rpizzi, the "redo log archiving" feature of MySQL 8.0 implements the idea of server-assisted copying of the log in a restricted form: writing a copy of the log to a file system that is directly accessible at the server.

What would you choose if the choices are the following?

Some slowdown of the server during backup (because the server would be responsible for replicating the log)
Extreme I/O load during backup (~~MDEV-28772~~), or context switching load on the kernel (inter-process communication via the file system)
A failed backup (~~MDEV-27621~~)

Marko Mäkelä added a comment - 2022-06-30 05:41 rpizzi , the "redo log archiving" feature of MySQL 8.0 implements the idea of server-assisted copying of the log in a restricted form: writing a copy of the log to a file system that is directly accessible at the server. What would you choose if the choices are the following? Some slowdown of the server during backup (because the server would be responsible for replicating the log) Extreme I/O load during backup ( MDEV-28772 ), or context switching load on the kernel (inter-process communication via the file system) A failed backup ( MDEV-27621 )

Rick Pizzi (Inactive) added a comment - 2022-06-30 07:19

If there are enough resources I would go with #2 .
Obviously the issue is when there are not enough resources for both the workload AND the backup.
In this situation we want the workload to take precedence but we also want the backup to complete, albeit
taking longer if there is resource starvation.

Rick Pizzi (Inactive) added a comment - 2022-06-30 07:19 If there are enough resources I would go with #2 . Obviously the issue is when there are not enough resources for both the workload AND the backup. In this situation we want the workload to take precedence but we also want the backup to complete, albeit taking longer if there is resource starvation.

Rick Pizzi (Inactive) added a comment - 2022-06-30 09:45

marko reading the blog article I am somewhat puzzled - an heavily used system will already write redo logs at a very high pace, do we (does LeFred) really think that it's a good idea to duplicate that heavy I/O in these circumstances??

To avoid redo logs overrun it should be sufficient to make them large enough.

Rick Pizzi (Inactive) added a comment - 2022-06-30 09:45 marko reading the blog article I am somewhat puzzled - an heavily used system will already write redo logs at a very high pace, do we (does LeFred) really think that it's a good idea to duplicate that heavy I/O in these circumstances?? To avoid redo logs overrun it should be sufficient to make them large enough.

Manjot Singh (Inactive) added a comment - 2022-06-30 17:12

Marko, both 1 and 2 seem like good ideas, perhaps configurable with a variable such as innodb_log_file_backup_method.

Manjot Singh (Inactive) added a comment - 2022-06-30 17:12 Marko, both 1 and 2 seem like good ideas, perhaps configurable with a variable such as innodb_log_file_backup_method.

Federico Razzoli added a comment - 2022-06-30 18:41

+1 for Manjot request. Among other motivations, xfs snapshots can be problematic, as they freeze IO.

Federico Razzoli added a comment - 2022-06-30 18:41 +1 for Manjot request. Among other motivations, xfs snapshots can be problematic, as they freeze IO.

Marko Mäkelä made changes - 2022-07-01 12:37

Link

This issue relates to MDEV-27551 [ MDEV-27551 ]

Marko Mäkelä made changes - 2022-07-01 12:46

Link

This issue relates to ~~MDEV-28994~~ [ ~~MDEV-28994~~ ]

Marko Mäkelä made changes - 2022-08-04 06:57

Link

This issue relates to ~~MDEV-29115~~ [ ~~MDEV-29115~~ ]

AirFocus made changes - 2022-08-09 16:11

Description

The purpose of this work is to improve current situation around backups by implementing a SQL BACKUP command.

h1. Interface between client and server
BACKUP command sends the file segments to the client via MySQL's protocol ResultSet, i.e client issues BACKUP statement, server sends a result set back

The ResultSet consists of following fields
* VARCHAR(N) filename
* INT(SET?) flags (OPEN,CLOSE, SPARSE, RENAME, DELETE, etc)
* BIGINT offset
* LONGBLOB data

h2.The client
There will be a simple client program which will issue "BACKUP" statement ,read the result set from server, and apply it to create a directory with a copy of the database. Due to abiliity to use client API (in C, .NET, Java, etc), it should be relatively easy to write specialized backup applications for special needs (e.g "streaming to clould", distributing to multiple machines)

h1. Interface between server and storage engines.
Most of the actual work is done by the storage engines, server initiates backup by calling storage engine's _backup()_begin_ function. For every file chunk, storage engine calls server's _backup_file(filename, flag, offset, length, data)_ function, that streams file chink as result set row to client. When backup is finished, engine calls `backup_end()` so that server would know when backup is finally done.

There default implementation of _backup_start()_ enumerates engines own files, reads them, and signals _backup_file()_ for every chunk read. _FLUSH TABLES FOR BACKUP_, must be issued for that.

Engines that support online backup, like Innodb would have much more complicated implementation.

h1.Incremental and partial backups
Incremental/differential (changes since specific backup) and partial backups( only some tables) will be supported as well, and the client-server interface does not change much for that . For incremental backup, the client application might need to invent some "patch" format, or use some standard for binary diffs (rdiff? bsdiff? xdelta? ), and save diffs to be applied to "base backup" directory later.

h1. Backup metadata.
There is a need to store different kind of metadata in the backup, usually binlog position, and some kind of timestamp (LSN) for incremental backup. This information can be stored as session variable available after BACKUP command, or send as extra result set after the backup one (protocol does support multi-result sets).

h1. Restore

Unlike the current solution with mariabackup or xtrabackup, there won't be any kinds of "prepare" or "restore" application,

The directory that was made when full backup was stored should be usable for new database.

files from partial backups could just be copied to destination (and maybe "IMPORT TABLESPACE", if we still support that).
There should still be some kind of "patch" application for applying incremental backups to the base directory. Dependend on how our client stores incremental backup, we can do without it , and users would only need to use 3rd party bsdiff, rdiff,, mspatcha.

The purpose of this work is to improve current situation around backups by implementing a SQL BACKUP command.

h1. Interface between client and server

BACKUP command sends the file segments to the client via MySQL's protocol ResultSet, i.e client issues BACKUP statement, server sends a result set back

The ResultSet consists of following fields

* VARCHAR(N) filename
* INT(SET?) flags (OPEN,CLOSE, SPARSE, RENAME, DELETE, etc)
* BIGINT offset
* LONGBLOB data

h2.The client
There will be a simple client program which will issue "BACKUP" statement ,read the result set from server, and apply it to create a directory with a copy of the database. Due to abiliity to use client API (in C, .NET, Java, etc), it should be relatively easy to write specialized backup applications for special needs (e.g "streaming to clould", distributing to multiple machines)

h1. Interface between server and storage engines.

Most of the actual work is done by the storage engines, server initiates backup by calling storage engine's _backup()_begin_ function. For every file chunk, storage engine calls server's _backup_file(filename, flag, offset, length, data)_ function, that streams file chink as result set row to client. When backup is finished, engine calls `backup\_end()` so that server would know when backup is finally done.

There default implementation of _backup_start()_ enumerates engines own files, reads them, and signals _backup_file()_ for every chunk read. _FLUSH TABLES FOR BACKUP_, must be issued for that.

Engines that support online backup, like Innodb would have much more complicated implementation.

h1.Incremental and partial backups
Incremental/differential (changes since specific backup) and partial backups( only some tables) will be supported as well, and the client\-server interface does not change much for that . For incremental backup, the client application might need to invent some "patch" format, or use some standard for binary diffs (rdiff? bsdiff? xdelta? ), and save diffs to be applied to "base backup" directory later.

h1. Backup metadata.

There is a need to store different kind of metadata in the backup, usually binlog position, and some kind of timestamp (LSN) for incremental backup. This information can be stored as session variable available after BACKUP command, or send as extra result set after the backup one (protocol does support multi\-result sets).

h1. Restore

Unlike the current solution with mariabackup or xtrabackup, there won't be any kinds of "prepare" or "restore" application,

The directory that was made when full backup was stored should be usable for new database.

files from partial backups could just be copied to destination (and maybe "IMPORT TABLESPACE", if we still support that).
There should still be some kind of "patch" application for applying incremental backups to the base directory. Dependend on how our client stores incremental backup, we can do without it , and users would only need to use 3rd party bsdiff, rdiff,, mspatcha.

Julien Fritsch made changes - 2022-08-10 08:19

Description

The purpose of this work is to improve current situation around backups by implementing a SQL BACKUP command.

h1. Interface between client and server

BACKUP command sends the file segments to the client via MySQL's protocol ResultSet, i.e client issues BACKUP statement, server sends a result set back

The ResultSet consists of following fields

* VARCHAR(N) filename
* INT(SET?) flags (OPEN,CLOSE, SPARSE, RENAME, DELETE, etc)
* BIGINT offset
* LONGBLOB data

h2.The client
There will be a simple client program which will issue "BACKUP" statement ,read the result set from server, and apply it to create a directory with a copy of the database. Due to abiliity to use client API (in C, .NET, Java, etc), it should be relatively easy to write specialized backup applications for special needs (e.g "streaming to clould", distributing to multiple machines)

h1. Interface between server and storage engines.

Most of the actual work is done by the storage engines, server initiates backup by calling storage engine's _backup()_begin_ function. For every file chunk, storage engine calls server's _backup_file(filename, flag, offset, length, data)_ function, that streams file chink as result set row to client. When backup is finished, engine calls `backup\_end()` so that server would know when backup is finally done.

There default implementation of _backup_start()_ enumerates engines own files, reads them, and signals _backup_file()_ for every chunk read. _FLUSH TABLES FOR BACKUP_, must be issued for that.

Engines that support online backup, like Innodb would have much more complicated implementation.

h1.Incremental and partial backups
Incremental/differential (changes since specific backup) and partial backups( only some tables) will be supported as well, and the client\-server interface does not change much for that . For incremental backup, the client application might need to invent some "patch" format, or use some standard for binary diffs (rdiff? bsdiff? xdelta? ), and save diffs to be applied to "base backup" directory later.

h1. Backup metadata.

There is a need to store different kind of metadata in the backup, usually binlog position, and some kind of timestamp (LSN) for incremental backup. This information can be stored as session variable available after BACKUP command, or send as extra result set after the backup one (protocol does support multi\-result sets).

h1. Restore

Unlike the current solution with mariabackup or xtrabackup, there won't be any kinds of "prepare" or "restore" application,

The directory that was made when full backup was stored should be usable for new database.

files from partial backups could just be copied to destination (and maybe "IMPORT TABLESPACE", if we still support that).
There should still be some kind of "patch" application for applying incremental backups to the base directory. Dependend on how our client stores incremental backup, we can do without it , and users would only need to use 3rd party bsdiff, rdiff,, mspatcha.

The purpose of this work is to improve current situation around backups by implementing a SQL BACKUP command.

h1. Interface between client and server

BACKUP command sends the file segments to the client via MySQL's protocol ResultSet, i.e client issues BACKUP statement, server sends a result set back

The ResultSet consists of following fields

* VARCHAR(N) filename
* INT(SET?) flags (OPEN,CLOSE, SPARSE, RENAME, DELETE, etc)
* BIGINT offset
* LONGBLOB data

h2.The client
There will be a simple client program which will issue "BACKUP" statement ,read the result set from server, and apply it to create a directory with a copy of the database. Due to abiliity to use client API (in C, .NET, Java, etc), it should be relatively easy to write specialized backup applications for special needs (e.g "streaming to clould", distributing to multiple machines)

h1. Interface between server and storage engines.

Most of the actual work is done by the storage engines, server initiates backup by calling storage engine's _backup()_begin_ function. For every file chunk, storage engine calls server's _backup_file(filename, flag, offset, length, data)_ function, that streams file chink as result set row to client. When backup is finished, engine calls `backup_end()` so that server would know when backup is finally done.

There default implementation of _backup_start()_ enumerates engines own files, reads them, and signals _backup_file()_ for every chunk read. _FLUSH TABLES FOR BACKUP_, must be issued for that.

Engines that support online backup, like Innodb would have much more complicated implementation.

h1.Incremental and partial backups
Incremental/differential (changes since specific backup) and partial backups( only some tables) will be supported as well, and the client\-server interface does not change much for that . For incremental backup, the client application might need to invent some "patch" format, or use some standard for binary diffs (rdiff? bsdiff? xdelta? ), and save diffs to be applied to "base backup" directory later.

h1. Backup metadata.

There is a need to store different kind of metadata in the backup, usually binlog position, and some kind of timestamp (LSN) for incremental backup. This information can be stored as session variable available after BACKUP command, or send as extra result set after the backup one (protocol does support multi\-result sets).

h1. Restore

Unlike the current solution with mariabackup or xtrabackup, there won't be any kinds of "prepare" or "restore" application,

The directory that was made when full backup was stored should be usable for new database.

files from partial backups could just be copied to destination (and maybe "IMPORT TABLESPACE", if we still support that).
There should still be some kind of "patch" application for applying incremental backups to the base directory. Dependend on how our client stores incremental backup, we can do without it , and users would only need to use 3rd party bsdiff, rdiff,, mspatcha.

Marko Mäkelä made changes - 2022-11-19 09:19

Link

This issue relates to ~~MDEV-30026~~ [ ~~MDEV-30026~~ ]

Max Mether made changes - 2023-03-15 14:47

Assignee

Marko Mäkelä [ marko ]

Ralf Gebhardt made changes - 2023-05-02 08:32

Link

This issue relates to MDEV-31166 [ MDEV-31166 ]

Ralf Gebhardt made changes - 2023-05-05 13:36

Fix Version/s

11.3 [ 28565 ]

Ralf Gebhardt made changes - 2023-05-05 13:37

Fix Version/s

11.2 [ 28603 ]

Ralf Gebhardt made changes - 2023-05-05 13:38

Priority

Major [ 3 ]

Critical [ 2 ]

Ralf Gebhardt made changes - 2023-05-05 13:40

Fix Version/s

11.3 [ 28565 ]

Ralf Gebhardt made changes - 2023-05-05 13:40

Fix Version/s

11.3 [ 28565 ]

Marko Mäkelä added a comment - 2023-06-06 09:49

I believe that a reasonable subgoal of this task would be to remove the need to execute mariadb-backup --prepare, that is, try to make it so that the server can be started directly on backed-up files. With the InnoDB log file format change of ~~MDEV-14425~~, that should be even easier:

Incremental backups can be restored by treating the initial ib_logfile0 from the full backup and the incremental snippets as if they had been concatenated together.
- In the ~~MDEV-14425~~ format, the size of an InnoDB log block is mini-transaction; there is no padding to any 512-byte log blocks anymore.
- As an option, incremental backup could simply append new log records to the ib_logfile0 that was updated by the last incremental or full backup.
Some adjustment due to ~~MDEV-18184~~ will be needed.
- Maybe, conduct some performance tests to verify if the ‘optimization’ is useful at all? A server-side backup can avoid copying newly initialized data files altogether!
- The server could look for a backup metadata file and validate its contents, like mariadb-backup --prepare currently does.

Marko Mäkelä added a comment - 2023-06-06 09:49 I believe that a reasonable subgoal of this task would be to remove the need to execute mariadb-backup --prepare , that is, try to make it so that the server can be started directly on backed-up files. With the InnoDB log file format change of MDEV-14425 , that should be even easier: Incremental backups can be restored by treating the initial ib_logfile0 from the full backup and the incremental snippets as if they had been concatenated together. In the MDEV-14425 format, the size of an InnoDB log block is mini-transaction; there is no padding to any 512-byte log blocks anymore. As an option, incremental backup could simply append new log records to the ib_logfile0 that was updated by the last incremental or full backup. Some adjustment due to MDEV-18184 will be needed. Maybe, conduct some performance tests to verify if the ‘optimization’ is useful at all? A server-side backup can avoid copying newly initialized data files altogether! The server could look for a backup metadata file and validate its contents, like mariadb-backup --prepare currently does.

Marko Mäkelä added a comment - 2023-06-06 10:52

I forgot that there are two usage scenarios of incremental backup:

With no gaps in the redo log: If writes are seldom, or we implemented and enabled log archiving on the server, we can simply copy and apply the additional section of the log.
If the log from the last backup LSN is missing, we must copy all data files that were changed since the last backup (based on the FIL_PAGE_LSN of each page).

In the latter case, the additional preparation might be too complex to be implemented as part of the normal crash recovery.

Marko Mäkelä added a comment - 2023-06-06 10:52 I forgot that there are two usage scenarios of incremental backup: With no gaps in the redo log: If writes are seldom, or we implemented and enabled log archiving on the server, we can simply copy and apply the additional section of the log. If the log from the last backup LSN is missing, we must copy all data files that were changed since the last backup (based on the FIL_PAGE_LSN of each page). In the latter case, the additional preparation might be too complex to be implemented as part of the normal crash recovery.

Sergei Golubchik made changes - 2023-06-06 12:25

Fix Version/s

11.2 [ 28603 ]

Sergei Golubchik made changes - 2023-08-16 13:58

Fix Version/s

11.3 [ 28565 ]

Marko Mäkelä made changes - 2023-10-12 11:30

Link

This issue relates to MDEV-13833 [ MDEV-13833 ]

Julien Fritsch made changes - 2023-11-30 16:29

Issue Type

Task [ 3 ]

New Feature [ 2 ]

Marko Mäkelä made changes - 2023-12-10 11:02

Link

This issue relates to ~~MDEV-18985~~ [ ~~MDEV-18985~~ ]

Marko Mäkelä made changes - 2023-12-12 14:15

Link

This issue relates to MDEV-19749 [ MDEV-19749 ]

Marko Mäkelä made changes - 2024-02-02 11:54

Link

This issue relates to ~~MDEV-33367~~ [ ~~MDEV-33367~~ ]

Marko Mäkelä made changes - 2024-04-24 05:31

Link

This issue relates to MDEV-31446 [ MDEV-31446 ]

Marko Mäkelä made changes - 2024-04-24 05:39

Link

This issue relates to ~~MDEV-33980~~ [ ~~MDEV-33980~~ ]

Marko Mäkelä made changes - 2024-05-03 06:51

Link

This issue relates to ~~MDEV-34062~~ [ ~~MDEV-34062~~ ]

Marko Mäkelä made changes - 2024-05-03 07:06

Link

This issue relates to ~~MDEV-31410~~ [ ~~MDEV-31410~~ ]

Julien Fritsch made changes - 2024-07-01 13:51

Fix Version/s

11.7 [ 29815 ]

Marko Mäkelä added a comment - 2024-08-19 10:12 - edited

There is a Linux system call ioctl(FICLONE) (applicable to XFS, btrfs, bcachefs) as well as macOS fclonefileat() (APFS) and something for the Microsoft Windows ReFS that would allow more efficient copying of files, similar to GNU cp --reflink=auto. I think that we should consider an option to employ that for copying most files. MyRocks backups are probably best served by regular hard links.

Edit: MDEV-23947 has been filed for this idea.

Marko Mäkelä added a comment - 2024-08-19 10:12 - edited There is a Linux system call ioctl(FICLONE) (applicable to XFS, btrfs, bcachefs) as well as macOS fclonefileat() (APFS) and something for the Microsoft Windows ReFS that would allow more efficient copying of files, similar to GNU cp --reflink=auto . I think that we should consider an option to employ that for copying most files. MyRocks backups are probably best served by regular hard links. Edit: MDEV-23947 has been filed for this idea.

Marko Mäkelä added a comment - 2024-08-19 13:19

I think that we need to implement some prototype of this, or ~~MDEV-21105~~ (or MDEV-7502).

Marko Mäkelä added a comment - 2024-08-19 13:19 I think that we need to implement some prototype of this, or MDEV-21105 (or MDEV-7502 ).

Marko Mäkelä made changes - 2024-08-19 13:19

Assignee

Marko Mäkelä [ marko ]

Debarun Banerjee [ JIRAUSER54513 ]

Sergei Golubchik made changes - 2024-08-27 14:13

Fix Version/s		11.8 [ 29921 ]
Fix Version/s	11.7 [ 29815 ]

Marko Mäkelä made changes - 2024-08-29 09:29

Link

This issue relates to MDEV-23947 [ MDEV-23947 ]

Sergei Golubchik made changes - 2024-12-10 15:31

Fix Version/s		11.9 [ 29945 ]
Fix Version/s	11.8 [ 29921 ]

Marko Mäkelä made changes - 2025-02-25 09:30

Link

This issue relates to MDEV-36159 [ MDEV-36159 ]

Sergei Golubchik made changes - 2025-02-25 19:46

Fix Version/s		12.1 [ 29992 ]
Fix Version/s	12.0 [ 29945 ]

Julien Fritsch made changes - 4 days ago

Sprint

Server 12.1 dev sprint [ 793 ]

Marko Mäkelä made changes - 3 days ago

Link

This issue relates to MDEV-36159 [ MDEV-36159 ]

Marko Mäkelä made changes - 10 hours ago

Link

This issue relates to MDEV-35791 [ MDEV-35791 ]

MariaDB Server

BACKUP: in-server backup

Details

Description

Interface between client and server

The client

Interface between server and storage engines.

Incremental and partial backups

Backup metadata.

Restore

Attachments

Attachments

Issue Links

Sub-Tasks

Activity

People

Dates

Git Integration