[MDEV-14992] BACKUP: in-server backup - Jira

Vladislav Vaintroub created issue - 2018-01-19 07:53

Vladislav Vaintroub made changes - 2018-01-19 07:54

Field	Original Value	New Value
Component/s		Backup [ 13902 ]
Fix Version/s		10.4 [ 22408 ]

Vladislav Vaintroub made changes - 2018-01-19 09:37

Description

The purpose of this work is to improve current situation around backups by impolementing a SQL BACKUP command.

h1. Interface between client and server
BACKUP command sends the file segments to the client via MySQL's protocol ResultSet, i.e client issues BACKUP statement, server sends a result set back

The ResultSet consists of following fields
* VARCHAR(N) filename
* INT(SET?) flags (OPEN,CLOSE, SPARSE, etc)
* BIGINT offset
* BIGINT length
* LONGBLOB data

h2.The client
There will be a simple client program which will issue "BACKUP" statement ,read the result set from server, and apply it to create a directory with a copy of the database. Due to abiliity to use client API (in C, .NET, Java, etc), it should be relatively easy to write specialized backup applications for special needs (e.g "streaming to clould", distributing to multiple machines)

h1. Interface between server and storage engines.
Most of the actual work is done by the storage engines, server initiates backup by calling storage engine's _backup()_begin_ function. For every file chunk, storage engine calls server's _backup_file(filename, flag, offset, length, data)_ function, that streams file chink as result set row to client. When backup is finished, engine calls `backup_end()` so that server would know when backup is finally done.

There default implementation of _backup_start()_ enumerates engines own files, reads them, and signals _backup_file()_ for every chunk read. _FLUSH TABLES FOR BACKUP_, must be issued for that.

Engines that support online backup, like Innodb would have much more complicated implementation.

h1.Incremental and partial backups
Incremental/differential (changes since specific backup) and partial backups( only some tables) will be supported as well, and the client-server interface does not change much for that . For incremental backup, the client application might need to invent some "patch" format, or use some standard for binary diffs (rdiff? bsdiff? xdelta? ), and save diffs to be applied to "base backup" directory later.

h1. Backup metadata.
There is a need to store different kind of metadata in the backup, usually binlog position, and some kind of timestamp (LSN) for incremental backup. This information can be stored as session variable available after BACKUP command, or send as extra result set after the backup one (protocol does support multi-result sets).

h1. Restore

Unlike the current solution with mariabackup or xtrabackup, there won't be any kinds of "prepare" or "restore" application,

The directory that was made when full backup was stored should be usable for new database.

files from partial backups could just be copied to destination (and maybe "IMPORT TABLESPACE", if we still support that).
There should still be some kind of "patch" application for applying incremental backups to the base directory. Dependend on how our client stores incremental backup, we can do without it , and users would only need to use 3rd party bsdiff, rdiff,, mspatcha.

Vladislav Vaintroub made changes - 2018-01-19 09:40

Description

The purpose of this work is to improve current situation around backups by impolementing a SQL BACKUP command.

h1. Interface between client and server
BACKUP command sends the file segments to the client via MySQL's protocol ResultSet, i.e client issues BACKUP statement, server sends a result set back

The ResultSet consists of following fields
* VARCHAR(N) filename
* INT(SET?) flags (OPEN,CLOSE, SPARSE, etc)
* BIGINT offset
* BIGINT length
* LONGBLOB data

h2.The client
There will be a simple client program which will issue "BACKUP" statement ,read the result set from server, and apply it to create a directory with a copy of the database. Due to abiliity to use client API (in C, .NET, Java, etc), it should be relatively easy to write specialized backup applications for special needs (e.g "streaming to clould", distributing to multiple machines)

h1. Interface between server and storage engines.
Most of the actual work is done by the storage engines, server initiates backup by calling storage engine's _backup()_begin_ function. For every file chunk, storage engine calls server's _backup_file(filename, flag, offset, length, data)_ function, that streams file chink as result set row to client. When backup is finished, engine calls `backup_end()` so that server would know when backup is finally done.

There default implementation of _backup_start()_ enumerates engines own files, reads them, and signals _backup_file()_ for every chunk read. _FLUSH TABLES FOR BACKUP_, must be issued for that.

Engines that support online backup, like Innodb would have much more complicated implementation.

h1.Incremental and partial backups
Incremental/differential (changes since specific backup) and partial backups( only some tables) will be supported as well, and the client-server interface does not change much for that . For incremental backup, the client application might need to invent some "patch" format, or use some standard for binary diffs (rdiff? bsdiff? xdelta? ), and save diffs to be applied to "base backup" directory later.

h1. Backup metadata.
There is a need to store different kind of metadata in the backup, usually binlog position, and some kind of timestamp (LSN) for incremental backup. This information can be stored as session variable available after BACKUP command, or send as extra result set after the backup one (protocol does support multi-result sets).

h1. Restore

Unlike the current solution with mariabackup or xtrabackup, there won't be any kinds of "prepare" or "restore" application,

The directory that was made when full backup was stored should be usable for new database.

files from partial backups could just be copied to destination (and maybe "IMPORT TABLESPACE", if we still support that).
There should still be some kind of "patch" application for applying incremental backups to the base directory. Dependend on how our client stores incremental backup, we can do without it , and users would only need to use 3rd party bsdiff, rdiff,, mspatcha.

The purpose of this work is to improve current situation around backups by implementing a SQL BACKUP command.

h1. Interface between client and server
BACKUP command sends the file segments to the client via MySQL's protocol ResultSet, i.e client issues BACKUP statement, server sends a result set back

The ResultSet consists of following fields
* VARCHAR(N) filename
* INT(SET?) flags (OPEN,CLOSE, SPARSE, etc)
* BIGINT offset
* BIGINT length
* LONGBLOB data

h2.The client
There will be a simple client program which will issue "BACKUP" statement ,read the result set from server, and apply it to create a directory with a copy of the database. Due to abiliity to use client API (in C, .NET, Java, etc), it should be relatively easy to write specialized backup applications for special needs (e.g "streaming to clould", distributing to multiple machines)

h1. Interface between server and storage engines.
Most of the actual work is done by the storage engines, server initiates backup by calling storage engine's _backup()_begin_ function. For every file chunk, storage engine calls server's _backup_file(filename, flag, offset, length, data)_ function, that streams file chink as result set row to client. When backup is finished, engine calls `backup_end()` so that server would know when backup is finally done.

There default implementation of _backup_start()_ enumerates engines own files, reads them, and signals _backup_file()_ for every chunk read. _FLUSH TABLES FOR BACKUP_, must be issued for that.

Engines that support online backup, like Innodb would have much more complicated implementation.

h1.Incremental and partial backups
Incremental/differential (changes since specific backup) and partial backups( only some tables) will be supported as well, and the client-server interface does not change much for that . For incremental backup, the client application might need to invent some "patch" format, or use some standard for binary diffs (rdiff? bsdiff? xdelta? ), and save diffs to be applied to "base backup" directory later.

h1. Backup metadata.
There is a need to store different kind of metadata in the backup, usually binlog position, and some kind of timestamp (LSN) for incremental backup. This information can be stored as session variable available after BACKUP command, or send as extra result set after the backup one (protocol does support multi-result sets).

h1. Restore

Unlike the current solution with mariabackup or xtrabackup, there won't be any kinds of "prepare" or "restore" application,

The directory that was made when full backup was stored should be usable for new database.

files from partial backups could just be copied to destination (and maybe "IMPORT TABLESPACE", if we still support that).
There should still be some kind of "patch" application for applying incremental backups to the base directory. Dependend on how our client stores incremental backup, we can do without it , and users would only need to use 3rd party bsdiff, rdiff,, mspatcha.

Vladislav Vaintroub made changes - 2018-01-19 09:48

Description

The purpose of this work is to improve current situation around backups by implementing a SQL BACKUP command.

h1. Interface between client and server
BACKUP command sends the file segments to the client via MySQL's protocol ResultSet, i.e client issues BACKUP statement, server sends a result set back

The ResultSet consists of following fields
* VARCHAR(N) filename
* INT(SET?) flags (OPEN,CLOSE, SPARSE, etc)
* BIGINT offset
* BIGINT length
* LONGBLOB data

h2.The client
There will be a simple client program which will issue "BACKUP" statement ,read the result set from server, and apply it to create a directory with a copy of the database. Due to abiliity to use client API (in C, .NET, Java, etc), it should be relatively easy to write specialized backup applications for special needs (e.g "streaming to clould", distributing to multiple machines)

h1. Interface between server and storage engines.
Most of the actual work is done by the storage engines, server initiates backup by calling storage engine's _backup()_begin_ function. For every file chunk, storage engine calls server's _backup_file(filename, flag, offset, length, data)_ function, that streams file chink as result set row to client. When backup is finished, engine calls `backup_end()` so that server would know when backup is finally done.

There default implementation of _backup_start()_ enumerates engines own files, reads them, and signals _backup_file()_ for every chunk read. _FLUSH TABLES FOR BACKUP_, must be issued for that.

Engines that support online backup, like Innodb would have much more complicated implementation.

h1.Incremental and partial backups
Incremental/differential (changes since specific backup) and partial backups( only some tables) will be supported as well, and the client-server interface does not change much for that . For incremental backup, the client application might need to invent some "patch" format, or use some standard for binary diffs (rdiff? bsdiff? xdelta? ), and save diffs to be applied to "base backup" directory later.

h1. Backup metadata.
There is a need to store different kind of metadata in the backup, usually binlog position, and some kind of timestamp (LSN) for incremental backup. This information can be stored as session variable available after BACKUP command, or send as extra result set after the backup one (protocol does support multi-result sets).

h1. Restore

Unlike the current solution with mariabackup or xtrabackup, there won't be any kinds of "prepare" or "restore" application,

The directory that was made when full backup was stored should be usable for new database.

files from partial backups could just be copied to destination (and maybe "IMPORT TABLESPACE", if we still support that).
There should still be some kind of "patch" application for applying incremental backups to the base directory. Dependend on how our client stores incremental backup, we can do without it , and users would only need to use 3rd party bsdiff, rdiff,, mspatcha.

The purpose of this work is to improve current situation around backups by implementing a SQL BACKUP command.

h1. Interface between client and server
BACKUP command sends the file segments to the client via MySQL's protocol ResultSet, i.e client issues BACKUP statement, server sends a result set back

The ResultSet consists of following fields
* VARCHAR(N) filename
* INT(SET?) flags (OPEN,CLOSE, SPARSE, etc)
* BIGINT offset
* LONGBLOB data

h2.The client
There will be a simple client program which will issue "BACKUP" statement ,read the result set from server, and apply it to create a directory with a copy of the database. Due to abiliity to use client API (in C, .NET, Java, etc), it should be relatively easy to write specialized backup applications for special needs (e.g "streaming to clould", distributing to multiple machines)

h1. Interface between server and storage engines.
Most of the actual work is done by the storage engines, server initiates backup by calling storage engine's _backup()_begin_ function. For every file chunk, storage engine calls server's _backup_file(filename, flag, offset, length, data)_ function, that streams file chink as result set row to client. When backup is finished, engine calls `backup_end()` so that server would know when backup is finally done.

There default implementation of _backup_start()_ enumerates engines own files, reads them, and signals _backup_file()_ for every chunk read. _FLUSH TABLES FOR BACKUP_, must be issued for that.

Engines that support online backup, like Innodb would have much more complicated implementation.

h1.Incremental and partial backups
Incremental/differential (changes since specific backup) and partial backups( only some tables) will be supported as well, and the client-server interface does not change much for that . For incremental backup, the client application might need to invent some "patch" format, or use some standard for binary diffs (rdiff? bsdiff? xdelta? ), and save diffs to be applied to "base backup" directory later.

h1. Backup metadata.
There is a need to store different kind of metadata in the backup, usually binlog position, and some kind of timestamp (LSN) for incremental backup. This information can be stored as session variable available after BACKUP command, or send as extra result set after the backup one (protocol does support multi-result sets).

h1. Restore

Unlike the current solution with mariabackup or xtrabackup, there won't be any kinds of "prepare" or "restore" application,

The directory that was made when full backup was stored should be usable for new database.

files from partial backups could just be copied to destination (and maybe "IMPORT TABLESPACE", if we still support that).
There should still be some kind of "patch" application for applying incremental backups to the base directory. Dependend on how our client stores incremental backup, we can do without it , and users would only need to use 3rd party bsdiff, rdiff,, mspatcha.

Vladislav Vaintroub made changes - 2018-01-19 09:51

Description

The purpose of this work is to improve current situation around backups by implementing a SQL BACKUP command.

h1. Interface between client and server
BACKUP command sends the file segments to the client via MySQL's protocol ResultSet, i.e client issues BACKUP statement, server sends a result set back

The ResultSet consists of following fields
* VARCHAR(N) filename
* INT(SET?) flags (OPEN,CLOSE, SPARSE, etc)
* BIGINT offset
* LONGBLOB data

h2.The client
There will be a simple client program which will issue "BACKUP" statement ,read the result set from server, and apply it to create a directory with a copy of the database. Due to abiliity to use client API (in C, .NET, Java, etc), it should be relatively easy to write specialized backup applications for special needs (e.g "streaming to clould", distributing to multiple machines)

h1. Interface between server and storage engines.
Most of the actual work is done by the storage engines, server initiates backup by calling storage engine's _backup()_begin_ function. For every file chunk, storage engine calls server's _backup_file(filename, flag, offset, length, data)_ function, that streams file chink as result set row to client. When backup is finished, engine calls `backup_end()` so that server would know when backup is finally done.

There default implementation of _backup_start()_ enumerates engines own files, reads them, and signals _backup_file()_ for every chunk read. _FLUSH TABLES FOR BACKUP_, must be issued for that.

Engines that support online backup, like Innodb would have much more complicated implementation.

h1.Incremental and partial backups
Incremental/differential (changes since specific backup) and partial backups( only some tables) will be supported as well, and the client-server interface does not change much for that . For incremental backup, the client application might need to invent some "patch" format, or use some standard for binary diffs (rdiff? bsdiff? xdelta? ), and save diffs to be applied to "base backup" directory later.

h1. Backup metadata.
There is a need to store different kind of metadata in the backup, usually binlog position, and some kind of timestamp (LSN) for incremental backup. This information can be stored as session variable available after BACKUP command, or send as extra result set after the backup one (protocol does support multi-result sets).

h1. Restore

Unlike the current solution with mariabackup or xtrabackup, there won't be any kinds of "prepare" or "restore" application,

The directory that was made when full backup was stored should be usable for new database.

files from partial backups could just be copied to destination (and maybe "IMPORT TABLESPACE", if we still support that).
There should still be some kind of "patch" application for applying incremental backups to the base directory. Dependend on how our client stores incremental backup, we can do without it , and users would only need to use 3rd party bsdiff, rdiff,, mspatcha.

The purpose of this work is to improve current situation around backups by implementing a SQL BACKUP command.

h1. Interface between client and server
BACKUP command sends the file segments to the client via MySQL's protocol ResultSet, i.e client issues BACKUP statement, server sends a result set back

The ResultSet consists of following fields
* VARCHAR(N) filename
* INT(SET?) flags (OPEN,CLOSE, SPARSE, RENAME, DELETE, etc)
* BIGINT offset
* LONGBLOB data

h2.The client
There will be a simple client program which will issue "BACKUP" statement ,read the result set from server, and apply it to create a directory with a copy of the database. Due to abiliity to use client API (in C, .NET, Java, etc), it should be relatively easy to write specialized backup applications for special needs (e.g "streaming to clould", distributing to multiple machines)

h1. Interface between server and storage engines.
Most of the actual work is done by the storage engines, server initiates backup by calling storage engine's _backup()_begin_ function. For every file chunk, storage engine calls server's _backup_file(filename, flag, offset, length, data)_ function, that streams file chink as result set row to client. When backup is finished, engine calls `backup_end()` so that server would know when backup is finally done.

There default implementation of _backup_start()_ enumerates engines own files, reads them, and signals _backup_file()_ for every chunk read. _FLUSH TABLES FOR BACKUP_, must be issued for that.

Engines that support online backup, like Innodb would have much more complicated implementation.

h1.Incremental and partial backups
Incremental/differential (changes since specific backup) and partial backups( only some tables) will be supported as well, and the client-server interface does not change much for that . For incremental backup, the client application might need to invent some "patch" format, or use some standard for binary diffs (rdiff? bsdiff? xdelta? ), and save diffs to be applied to "base backup" directory later.

h1. Backup metadata.
There is a need to store different kind of metadata in the backup, usually binlog position, and some kind of timestamp (LSN) for incremental backup. This information can be stored as session variable available after BACKUP command, or send as extra result set after the backup one (protocol does support multi-result sets).

h1. Restore

Unlike the current solution with mariabackup or xtrabackup, there won't be any kinds of "prepare" or "restore" application,

The directory that was made when full backup was stored should be usable for new database.

files from partial backups could just be copied to destination (and maybe "IMPORT TABLESPACE", if we still support that).
There should still be some kind of "patch" application for applying incremental backups to the base directory. Dependend on how our client stores incremental backup, we can do without it , and users would only need to use 3rd party bsdiff, rdiff,, mspatcha.

zhangyuan made changes - 2018-02-24 16:26

Attachment

xtrabackup.png [ 45264 ]

zhangyuan made changes - 2018-02-24 18:13

Attachment

xtrabackup.png [ 45264 ]

zhangyuan made changes - 2018-02-24 18:13

Attachment

xtrabackup.png [ 45265 ]

zhangyuan made changes - 2018-02-24 18:14

Attachment

xtrabackup.png [ 45265 ]

zhangyuan made changes - 2018-02-24 18:14

Attachment

xtrabackup.png [ 45266 ]

Ralf Gebhardt made changes - 2018-06-15 18:37

Priority

Major [ 3 ]

Minor [ 4 ]

Ralf Gebhardt made changes - 2018-06-15 18:37

Priority

Minor [ 4 ]

Major [ 3 ]

Julien Fritsch made changes - 2018-07-05 14:16

Epic Link

PT-75 [ 68556 ]

Julien Fritsch made changes - 2018-07-05 14:20

Epic Link

PT-75 [ 68556 ]

PT-77 [ 68558 ]

Ralf Gebhardt made changes - 2018-08-23 20:06

Rank

Ranked higher

Sergei Golubchik made changes - 2018-09-18 19:41

Link

This issue relates to ~~MDEV-5336~~ [ ~~MDEV-5336~~ ]

Sergei Golubchik made changes - 2018-09-27 15:55

Summary

SQL command for BACKUP

BACKUP: in-server backup

Ralf Gebhardt made changes - 2018-09-28 13:28

Epic Link

PT-77 [ 68558 ]

PT-92 [ 69417 ]

Ralf Gebhardt made changes - 2018-09-28 13:29

Fix Version/s

10.4 [ 22408 ]

Geoff Montee (Inactive) made changes - 2019-01-24 00:18

Link

This issue blocks MDEV-18364 [ MDEV-18364 ]

Geoff Montee (Inactive) made changes - 2019-02-28 19:43

Link

This issue blocks MDBM-1 [ MDBM-1 ]

Ralf Gebhardt made changes - 2019-03-13 08:43

NRE Projects

RM_long_term RM_platform

RM_long_term RM_platform RM_105_CANDIDATE

Geoff Montee (Inactive) made changes - 2019-04-12 23:08

Link

This issue relates to MDEV-7502 [ MDEV-7502 ]

Geoff Montee (Inactive) made changes - 2019-06-18 08:51

Link

This issue relates to ~~MXS-2542~~ [ ~~MXS-2542~~ ]

Marko Mäkelä made changes - 2020-02-05 18:49

Link

This issue relates to ~~MDEV-21105~~ [ ~~MDEV-21105~~ ]

Marko Mäkelä made changes - 2020-03-31 16:33

Link

This issue relates to ~~MDEV-22096~~ [ ~~MDEV-22096~~ ]

Vladislav Lesin made changes - 2021-03-12 11:21

Link

This issue relates to MDEV-18336 [ MDEV-18336 ]

Sergei Golubchik made changes - 2021-12-06 21:21

Workflow

MariaDB v3 [ 84950 ]

MariaDB v4 [ 130766 ]

Marko Mäkelä made changes - 2022-01-21 07:01

Link

This issue relates to ~~MDEV-14425~~ [ ~~MDEV-14425~~ ]

Marko Mäkelä made changes - 2022-01-25 12:01

Link

This issue relates to ~~MDEV-27424~~ [ ~~MDEV-27424~~ ]

Marko Mäkelä made changes - 2022-01-26 11:16

Link

This issue relates to ~~MDEV-27621~~ [ ~~MDEV-27621~~ ]

Marko Mäkelä made changes - 2022-02-11 10:17

Link

This issue relates to ~~MDEV-27812~~ [ ~~MDEV-27812~~ ]

markus makela made changes - 2022-04-01 15:52

Link

This issue blocks ~~MXS-2542~~ [ ~~MXS-2542~~ ]

markus makela made changes - 2022-04-01 15:52

Link

This issue relates to ~~MXS-2542~~ [ ~~MXS-2542~~ ]

Todd Stoffel (Inactive) made changes - 2022-04-01 18:44

Link

This issue blocks ~~MXS-2542~~ [ ~~MXS-2542~~ ]

Marko Mäkelä made changes - 2022-05-04 07:14

Link

This issue relates to MDEV-19492 [ MDEV-19492 ]

Marko Mäkelä made changes - 2022-07-01 12:37

Link

This issue relates to MDEV-27551 [ MDEV-27551 ]

Marko Mäkelä made changes - 2022-07-01 12:46

Link

This issue relates to ~~MDEV-28994~~ [ ~~MDEV-28994~~ ]

Marko Mäkelä made changes - 2022-08-04 06:57

Link

This issue relates to ~~MDEV-29115~~ [ ~~MDEV-29115~~ ]

AirFocus made changes - 2022-08-09 16:11

Description

The purpose of this work is to improve current situation around backups by implementing a SQL BACKUP command.

h1. Interface between client and server
BACKUP command sends the file segments to the client via MySQL's protocol ResultSet, i.e client issues BACKUP statement, server sends a result set back

The ResultSet consists of following fields
* VARCHAR(N) filename
* INT(SET?) flags (OPEN,CLOSE, SPARSE, RENAME, DELETE, etc)
* BIGINT offset
* LONGBLOB data

h2.The client
There will be a simple client program which will issue "BACKUP" statement ,read the result set from server, and apply it to create a directory with a copy of the database. Due to abiliity to use client API (in C, .NET, Java, etc), it should be relatively easy to write specialized backup applications for special needs (e.g "streaming to clould", distributing to multiple machines)

h1. Interface between server and storage engines.
Most of the actual work is done by the storage engines, server initiates backup by calling storage engine's _backup()_begin_ function. For every file chunk, storage engine calls server's _backup_file(filename, flag, offset, length, data)_ function, that streams file chink as result set row to client. When backup is finished, engine calls `backup_end()` so that server would know when backup is finally done.

There default implementation of _backup_start()_ enumerates engines own files, reads them, and signals _backup_file()_ for every chunk read. _FLUSH TABLES FOR BACKUP_, must be issued for that.

Engines that support online backup, like Innodb would have much more complicated implementation.

h1.Incremental and partial backups
Incremental/differential (changes since specific backup) and partial backups( only some tables) will be supported as well, and the client-server interface does not change much for that . For incremental backup, the client application might need to invent some "patch" format, or use some standard for binary diffs (rdiff? bsdiff? xdelta? ), and save diffs to be applied to "base backup" directory later.

h1. Backup metadata.
There is a need to store different kind of metadata in the backup, usually binlog position, and some kind of timestamp (LSN) for incremental backup. This information can be stored as session variable available after BACKUP command, or send as extra result set after the backup one (protocol does support multi-result sets).

h1. Restore

Unlike the current solution with mariabackup or xtrabackup, there won't be any kinds of "prepare" or "restore" application,

The directory that was made when full backup was stored should be usable for new database.

files from partial backups could just be copied to destination (and maybe "IMPORT TABLESPACE", if we still support that).
There should still be some kind of "patch" application for applying incremental backups to the base directory. Dependend on how our client stores incremental backup, we can do without it , and users would only need to use 3rd party bsdiff, rdiff,, mspatcha.

The purpose of this work is to improve current situation around backups by implementing a SQL BACKUP command.

h1. Interface between client and server

BACKUP command sends the file segments to the client via MySQL's protocol ResultSet, i.e client issues BACKUP statement, server sends a result set back

The ResultSet consists of following fields

* VARCHAR(N) filename
* INT(SET?) flags (OPEN,CLOSE, SPARSE, RENAME, DELETE, etc)
* BIGINT offset
* LONGBLOB data

h2.The client
There will be a simple client program which will issue "BACKUP" statement ,read the result set from server, and apply it to create a directory with a copy of the database. Due to abiliity to use client API (in C, .NET, Java, etc), it should be relatively easy to write specialized backup applications for special needs (e.g "streaming to clould", distributing to multiple machines)

h1. Interface between server and storage engines.

Most of the actual work is done by the storage engines, server initiates backup by calling storage engine's _backup()_begin_ function. For every file chunk, storage engine calls server's _backup_file(filename, flag, offset, length, data)_ function, that streams file chink as result set row to client. When backup is finished, engine calls `backup\_end()` so that server would know when backup is finally done.

There default implementation of _backup_start()_ enumerates engines own files, reads them, and signals _backup_file()_ for every chunk read. _FLUSH TABLES FOR BACKUP_, must be issued for that.

Engines that support online backup, like Innodb would have much more complicated implementation.

h1.Incremental and partial backups
Incremental/differential (changes since specific backup) and partial backups( only some tables) will be supported as well, and the client\-server interface does not change much for that . For incremental backup, the client application might need to invent some "patch" format, or use some standard for binary diffs (rdiff? bsdiff? xdelta? ), and save diffs to be applied to "base backup" directory later.

h1. Backup metadata.

There is a need to store different kind of metadata in the backup, usually binlog position, and some kind of timestamp (LSN) for incremental backup. This information can be stored as session variable available after BACKUP command, or send as extra result set after the backup one (protocol does support multi\-result sets).

h1. Restore

Unlike the current solution with mariabackup or xtrabackup, there won't be any kinds of "prepare" or "restore" application,

The directory that was made when full backup was stored should be usable for new database.

files from partial backups could just be copied to destination (and maybe "IMPORT TABLESPACE", if we still support that).
There should still be some kind of "patch" application for applying incremental backups to the base directory. Dependend on how our client stores incremental backup, we can do without it , and users would only need to use 3rd party bsdiff, rdiff,, mspatcha.

Julien Fritsch made changes - 2022-08-10 08:19

Description

The purpose of this work is to improve current situation around backups by implementing a SQL BACKUP command.

h1. Interface between client and server

BACKUP command sends the file segments to the client via MySQL's protocol ResultSet, i.e client issues BACKUP statement, server sends a result set back

The ResultSet consists of following fields

* VARCHAR(N) filename
* INT(SET?) flags (OPEN,CLOSE, SPARSE, RENAME, DELETE, etc)
* BIGINT offset
* LONGBLOB data

h2.The client
There will be a simple client program which will issue "BACKUP" statement ,read the result set from server, and apply it to create a directory with a copy of the database. Due to abiliity to use client API (in C, .NET, Java, etc), it should be relatively easy to write specialized backup applications for special needs (e.g "streaming to clould", distributing to multiple machines)

h1. Interface between server and storage engines.

Most of the actual work is done by the storage engines, server initiates backup by calling storage engine's _backup()_begin_ function. For every file chunk, storage engine calls server's _backup_file(filename, flag, offset, length, data)_ function, that streams file chink as result set row to client. When backup is finished, engine calls `backup\_end()` so that server would know when backup is finally done.

There default implementation of _backup_start()_ enumerates engines own files, reads them, and signals _backup_file()_ for every chunk read. _FLUSH TABLES FOR BACKUP_, must be issued for that.

Engines that support online backup, like Innodb would have much more complicated implementation.

h1.Incremental and partial backups
Incremental/differential (changes since specific backup) and partial backups( only some tables) will be supported as well, and the client\-server interface does not change much for that . For incremental backup, the client application might need to invent some "patch" format, or use some standard for binary diffs (rdiff? bsdiff? xdelta? ), and save diffs to be applied to "base backup" directory later.

h1. Backup metadata.

There is a need to store different kind of metadata in the backup, usually binlog position, and some kind of timestamp (LSN) for incremental backup. This information can be stored as session variable available after BACKUP command, or send as extra result set after the backup one (protocol does support multi\-result sets).

h1. Restore

Unlike the current solution with mariabackup or xtrabackup, there won't be any kinds of "prepare" or "restore" application,

The directory that was made when full backup was stored should be usable for new database.

files from partial backups could just be copied to destination (and maybe "IMPORT TABLESPACE", if we still support that).
There should still be some kind of "patch" application for applying incremental backups to the base directory. Dependend on how our client stores incremental backup, we can do without it , and users would only need to use 3rd party bsdiff, rdiff,, mspatcha.

The purpose of this work is to improve current situation around backups by implementing a SQL BACKUP command.

h1. Interface between client and server

BACKUP command sends the file segments to the client via MySQL's protocol ResultSet, i.e client issues BACKUP statement, server sends a result set back

The ResultSet consists of following fields

* VARCHAR(N) filename
* INT(SET?) flags (OPEN,CLOSE, SPARSE, RENAME, DELETE, etc)
* BIGINT offset
* LONGBLOB data

h2.The client
There will be a simple client program which will issue "BACKUP" statement ,read the result set from server, and apply it to create a directory with a copy of the database. Due to abiliity to use client API (in C, .NET, Java, etc), it should be relatively easy to write specialized backup applications for special needs (e.g "streaming to clould", distributing to multiple machines)

h1. Interface between server and storage engines.

Most of the actual work is done by the storage engines, server initiates backup by calling storage engine's _backup()_begin_ function. For every file chunk, storage engine calls server's _backup_file(filename, flag, offset, length, data)_ function, that streams file chink as result set row to client. When backup is finished, engine calls `backup_end()` so that server would know when backup is finally done.

There default implementation of _backup_start()_ enumerates engines own files, reads them, and signals _backup_file()_ for every chunk read. _FLUSH TABLES FOR BACKUP_, must be issued for that.

Engines that support online backup, like Innodb would have much more complicated implementation.

h1.Incremental and partial backups
Incremental/differential (changes since specific backup) and partial backups( only some tables) will be supported as well, and the client\-server interface does not change much for that . For incremental backup, the client application might need to invent some "patch" format, or use some standard for binary diffs (rdiff? bsdiff? xdelta? ), and save diffs to be applied to "base backup" directory later.

h1. Backup metadata.

There is a need to store different kind of metadata in the backup, usually binlog position, and some kind of timestamp (LSN) for incremental backup. This information can be stored as session variable available after BACKUP command, or send as extra result set after the backup one (protocol does support multi\-result sets).

h1. Restore

Unlike the current solution with mariabackup or xtrabackup, there won't be any kinds of "prepare" or "restore" application,

The directory that was made when full backup was stored should be usable for new database.

files from partial backups could just be copied to destination (and maybe "IMPORT TABLESPACE", if we still support that).
There should still be some kind of "patch" application for applying incremental backups to the base directory. Dependend on how our client stores incremental backup, we can do without it , and users would only need to use 3rd party bsdiff, rdiff,, mspatcha.

Marko Mäkelä made changes - 2022-11-19 09:19

Link

This issue relates to ~~MDEV-30026~~ [ ~~MDEV-30026~~ ]

Max Mether made changes - 2023-03-15 14:47

Assignee

Marko Mäkelä [ marko ]

Ralf Gebhardt made changes - 2023-05-02 08:32

Link

This issue relates to MDEV-31166 [ MDEV-31166 ]

Ralf Gebhardt made changes - 2023-05-05 13:36

Fix Version/s

11.3 [ 28565 ]

Ralf Gebhardt made changes - 2023-05-05 13:37

Fix Version/s

11.2 [ 28603 ]

Ralf Gebhardt made changes - 2023-05-05 13:38

Priority

Major [ 3 ]

Critical [ 2 ]

Ralf Gebhardt made changes - 2023-05-05 13:40

Fix Version/s

11.3 [ 28565 ]

Ralf Gebhardt made changes - 2023-05-05 13:40

Fix Version/s

11.3 [ 28565 ]

Sergei Golubchik made changes - 2023-06-06 12:25

Fix Version/s

11.2 [ 28603 ]

Sergei Golubchik made changes - 2023-08-16 13:58

Fix Version/s

11.3 [ 28565 ]

Marko Mäkelä made changes - 2023-10-12 11:30

Link

This issue relates to MDEV-13833 [ MDEV-13833 ]

Julien Fritsch made changes - 2023-11-30 16:29

Issue Type

Task [ 3 ]

New Feature [ 2 ]

Marko Mäkelä made changes - 2023-12-10 11:02

Link

This issue relates to ~~MDEV-18985~~ [ ~~MDEV-18985~~ ]

Marko Mäkelä made changes - 2023-12-12 14:15

Link

This issue relates to MDEV-19749 [ MDEV-19749 ]

Marko Mäkelä made changes - 2024-02-02 11:54

Link

This issue relates to ~~MDEV-33367~~ [ ~~MDEV-33367~~ ]

Marko Mäkelä made changes - 2024-04-24 05:31

Link

This issue relates to MDEV-31446 [ MDEV-31446 ]

Marko Mäkelä made changes - 2024-04-24 05:39

Link

This issue relates to ~~MDEV-33980~~ [ ~~MDEV-33980~~ ]

Marko Mäkelä made changes - 2024-05-03 06:51

Link

This issue relates to ~~MDEV-34062~~ [ ~~MDEV-34062~~ ]

Marko Mäkelä made changes - 2024-05-03 07:06

Link

This issue relates to ~~MDEV-31410~~ [ ~~MDEV-31410~~ ]

Julien Fritsch made changes - 2024-07-01 13:51

Fix Version/s

11.7 [ 29815 ]

Marko Mäkelä made changes - 2024-08-19 13:19

Assignee

Marko Mäkelä [ marko ]

Debarun Banerjee [ JIRAUSER54513 ]

Sergei Golubchik made changes - 2024-08-27 14:13

Fix Version/s		11.8 [ 29921 ]
Fix Version/s	11.7 [ 29815 ]

Marko Mäkelä made changes - 2024-08-29 09:29

Link

This issue relates to MDEV-23947 [ MDEV-23947 ]

Sergei Golubchik made changes - 2024-12-10 15:31

Fix Version/s		11.9 [ 29945 ]
Fix Version/s	11.8 [ 29921 ]

Marko Mäkelä made changes - 2025-02-25 09:30

Link

This issue relates to MDEV-36159 [ MDEV-36159 ]

Sergei Golubchik made changes - 2025-02-25 19:46

Fix Version/s		12.1 [ 29992 ]
Fix Version/s	12.0 [ 29945 ]

Julien Fritsch made changes - 2025-04-02 17:30

Sprint

Server 12.1 dev sprint [ 793 ]

Marko Mäkelä made changes - 2025-04-04 09:51

Link

This issue relates to MDEV-36159 [ MDEV-36159 ]

Marko Mäkelä made changes - 2025-04-07 06:36

Link

This issue relates to ~~MDEV-35791~~ [ ~~MDEV-35791~~ ]

Julien Fritsch made changes - 2025-04-08 10:45

Link

This issue split to MDEV-35248 [ MDEV-35248 ]

Julien Fritsch made changes - 2025-04-08 10:45

Sprint

Server 12.1 dev sprint [ 793 ]

Sergei Golubchik made changes - 2025-04-19 12:18

Link

This issue is blocked by MDEV-35248 [ MDEV-35248 ]

MariaDB Server

BACKUP: in-server backup

Details

Description

Interface between client and server

The client

Interface between server and storage engines.

Incremental and partial backups

Backup metadata.

Restore

Attachments

Attachments

Issue Links

Activity

People

Dates

Git Integration