[MDEV-21952] ibdata1 file size growing in MariaDB - Jira

Details

Type: Bug
Status: Closed (View Workflow)
Priority: Major
Resolution: Fixed
Affects Version/s: 10.2.25
Fix Version/s: 10.5.13, 10.6.5
Component/s: Storage Engine - InnoDB
Labels:
None
Environment:
MariaDB Galera 3 Node DR

Description

We are using DATETIME Interval in most of our MariaDB Tables where we have automated Data Purging Process to delete data at Interval 1 DAY , INTERVAL 1 HOUR and we have Optimize Table script which runs once in day ( ALTER TABLE <table> ENGINE=InnoDB )

It is observed ibdata1 file in mysql folder has grown to 122 GB

Do we have Tuning options or Space recovery options that we can some have workaround on ibdata1 file ?

Attachments

Issue Links

is duplicated by

MDEV-32414 [mariadb-galera] There is one node in a 3-node cluster, ibdata1, with a large file

Closed

is part of

MDEV-26445 innodb_undo_log_truncate is unnecessarily slow

Closed

relates to

MDEV-14795 InnoDB system tablespace cannot be shrunk

Closed

MDEV-19229 Allow innodb_undo_tablespaces to be changed after database creation

Closed

MDEV-26450 Corruption due to innodb_undo_log_truncate

Closed

MDEV-27734 Set innodb_change_buffering=none by default

Closed

MDEV-28187 mariadb-backup doesn't utilise innodb-undo-log-directory (if specified as a relative path) during copy-back operation

Closed

MDEV-29593 Purge misses a chance to free not-yet-reused undo pages

Closed

MDEV-29983 Deprecate innodb_file_per_table

Closed

MDEV-29986 Set innodb_undo_tablespaces=3 by default

Closed

MDEV-13013 InnoDB unnecessarily extends data files

Closed

MDEV-15528 Avoid writing freed InnoDB pages

Closed

MDEV-27153 ibdata1 file Leaking ? (errno 135) - using innodb_undo_tablespaces

Closed

mentioned in: Page Loading...

(8 relates to, 1 mentioned in)

Activity

Ascending order - Click to sort in descending order

Marko Mäkelä added a comment - 2020-03-16 09:38

The system tablespace should grow either due to the change buffer or due to undo logs. ALTER TABLE operations should not write excessive amounts of undo log in the reported version, because the ~~MDEV-11415~~ fix (affecting ALGORITHM=COPY) is already present. Native ALTER TABLE should not exercise the change buffer, but ALGORITHM=COPY could do that.

It seems to me that your question should best be answered by our support staff.

Marko Mäkelä added a comment - 2020-03-16 09:38 The system tablespace should grow either due to the change buffer or due to undo logs. ALTER TABLE operations should not write excessive amounts of undo log in the reported version, because the MDEV-11415 fix (affecting ALGORITHM=COPY ) is already present. Native ALTER TABLE should not exercise the change buffer, but ALGORITHM=COPY could do that. It seems to me that your question should best be answered by our support staff.

Tarun added a comment - 2020-03-16 15:40

I have enabled innodb_file_per_table =1 in my.cnf file , for each table *.ibd file is generated .

We are loading Data into MariaDB Galera DR using Apache Kakfa and sometime by Load ifile option

Tarun added a comment - 2020-03-16 15:40 I have enabled innodb_file_per_table =1 in my.cnf file , for each table *.ibd file is generated . We are loading Data into MariaDB Galera DR using Apache Kakfa and sometime by Load ifile option

Marko Mäkelä added a comment - 2020-03-16 16:13

You might want to consider acquiring a support contract.

Marko Mäkelä added a comment - 2020-03-16 16:13 You might want to consider acquiring a support contract .

Marko Mäkelä added a comment - 2021-08-20 06:31

During the development of ~~MDEV-15528~~, I mentioned to thiru that the mechanism of shrinking undo tablespaces that was implemented in ~~MDEV-13564~~ could be applied to any tablespace.

Undo tablespace truncation is a special case: the entire tablespace will be rewritten. But, we could use a similar mechanism for shrinking any tablespace. If the last pages of the tablespace are freed, we could shrink the file.

However, it is not trivial to shrink an arbitrary InnoDB tablespace. Even if the last pages in a tablespace are unallocated, we would want some hysteresis so that files would not be repeatedly extended (~~MDEV-13013~~) and shrunk.

Ultimately, I would like to remove the system tablespace at some point (MDEV-11633). It stores the following:

the doublewrite buffer (128 data pages at fixed location; MDEV-11659)
the TRX_SYS page that points to undo log headers and the doublewrite buffer
the DICT_HDR page that contains some global information
data dictionary tables that store the metadata of persistent InnoDB tables
the change buffer (not used if innodb_change_buffering=none)
undo logs (unless innodb_undo_tablespaces are being used)
InnoDB tables created with innodb_file_per_table=0

I think that it is easiest to configure the server so that nothing will unnecessarily be created in the system tablespace:

Disable the change buffer.
Make sure that undo logs will be stored in separate tablespaces starting with undo001.
Make sure that nothing is changing innodb_file_per_table from its default value ON.

This should prevent most fragmentation of the system tablespace. Theoretically, the data dictionary tables could still cause trouble, but I would be surprised if it were the case.

Would it be possible to defragment or pack the system tablespace?

The 128 doublewrite buffer pages as well as a few other pages are at a fixed location and must be left alone.
Undo log pages are virtually impossible to relocate, due to DB_ROLL_PTR pointing to them. But, we do support truncation of dedicated undo tablespaces.
BLOB pages are virtually impossible to relocate, because undo log records may point to them. Also, there may be multiple clustered index records pointing to the same BLOB, in case the PRIMARY KEY of a record that contains BLOBs was updated. The entire singly-linked list of BLOB pages would have to be moved in an atomic mini-transaction. Obeying the latching order while keeping all pointers to the BLOB pages synchronized could be challenging. If the BLOB is larger than the buffer pool, it could be outright impossible without changing the file format in some way.

I would like to know if this issue remains after implementing the mitigation measures: disable the change buffer, use dedicated undo tablespaces, and never use innodb_file_per_table=0. Any remaining growth should only be caused by the data dictionary tables. Those tables should not contain any BLOBs, so theoretically it should be possible to relocate pages of dictionary tables closer to the start, so that the system tablespace could be shrunk.

Note: It could be that the defragmentation feature (~~MDEV-5834~~, MDEV-18385) is only doing something about B-tree fill factor, and possibly not even attempting to move the pages to the start of the tablespace. And that feature definitely does not operate on any dictionary tables.

Marko Mäkelä added a comment - 2021-08-20 06:31 During the development of MDEV-15528 , I mentioned to thiru that the mechanism of shrinking undo tablespaces that was implemented in MDEV-13564 could be applied to any tablespace. Undo tablespace truncation is a special case: the entire tablespace will be rewritten. But, we could use a similar mechanism for shrinking any tablespace. If the last pages of the tablespace are freed, we could shrink the file. However, it is not trivial to shrink an arbitrary InnoDB tablespace. Even if the last pages in a tablespace are unallocated, we would want some hysteresis so that files would not be repeatedly extended ( MDEV-13013 ) and shrunk. Ultimately, I would like to remove the system tablespace at some point ( MDEV-11633 ). It stores the following: the doublewrite buffer (128 data pages at fixed location; MDEV-11659 ) the TRX_SYS page that points to undo log headers and the doublewrite buffer the DICT_HDR page that contains some global information data dictionary tables that store the metadata of persistent InnoDB tables the change buffer (not used if innodb_change_buffering=none ) undo logs (unless innodb_undo_tablespaces are being used) InnoDB tables created with innodb_file_per_table=0 I think that it is easiest to configure the server so that nothing will unnecessarily be created in the system tablespace: Disable the change buffer. Make sure that undo logs will be stored in separate tablespaces starting with undo001 . Make sure that nothing is changing innodb_file_per_table from its default value ON. This should prevent most fragmentation of the system tablespace. Theoretically, the data dictionary tables could still cause trouble, but I would be surprised if it were the case. Would it be possible to defragment or pack the system tablespace? The 128 doublewrite buffer pages as well as a few other pages are at a fixed location and must be left alone. Undo log pages are virtually impossible to relocate, due to DB_ROLL_PTR pointing to them. But, we do support truncation of dedicated undo tablespaces. BLOB pages are virtually impossible to relocate, because undo log records may point to them. Also, there may be multiple clustered index records pointing to the same BLOB, in case the PRIMARY KEY of a record that contains BLOBs was updated. The entire singly-linked list of BLOB pages would have to be moved in an atomic mini-transaction. Obeying the latching order while keeping all pointers to the BLOB pages synchronized could be challenging. If the BLOB is larger than the buffer pool, it could be outright impossible without changing the file format in some way. I would like to know if this issue remains after implementing the mitigation measures: disable the change buffer, use dedicated undo tablespaces, and never use innodb_file_per_table=0 . Any remaining growth should only be caused by the data dictionary tables. Those tables should not contain any BLOBs, so theoretically it should be possible to relocate pages of dictionary tables closer to the start, so that the system tablespace could be shrunk. Note: It could be that the defragmentation feature ( MDEV-5834 , MDEV-18385 ) is only doing something about B-tree fill factor, and possibly not even attempting to move the pages to the start of the tablespace. And that feature definitely does not operate on any dictionary tables.

Marko Mäkelä added a comment - 2021-09-21 15:22

Once ~~MDEV-26445~~ has been fixed, I think that it should be safe to enable InnoDB undo tablespace truncation (and multiple undo tablespaces). If you additionally disable the change buffer and ensure that no tables are being created in the system tablespace, then the system tablespace should not keep growing.

Marko Mäkelä added a comment - 2021-09-21 15:22 Once MDEV-26445 has been fixed, I think that it should be safe to enable InnoDB undo tablespace truncation (and multiple undo tablespaces). If you additionally disable the change buffer and ensure that no tables are being created in the system tablespace, then the system tablespace should not keep growing.

Marko Mäkelä added a comment - 2022-08-04 07:28

~~MDEV-27734~~ disabled the InnoDB change buffer due to known but irreproducible data corruption issues.

Marko Mäkelä added a comment - 2022-08-04 07:28 MDEV-27734 disabled the InnoDB change buffer due to known but irreproducible data corruption issues.

Marko Mäkelä added a comment - 2022-11-09 10:16

Fixing ~~MDEV-14795~~ will allow the InnoDB system tablespace to be shrunk in place. Until then, the only way to shrink it is to start from the scratch (dump and reload all InnoDB tables).

This ticket was closed because it is possible to avoid the growth of the InnoDB system tablespace by changing some configuration parameters.

Marko Mäkelä added a comment - 2022-11-09 10:16 Fixing MDEV-14795 will allow the InnoDB system tablespace to be shrunk in place. Until then, the only way to shrink it is to start from the scratch (dump and reload all InnoDB tables). This ticket was closed because it is possible to avoid the growth of the InnoDB system tablespace by changing some configuration parameters.

Marko Mäkelä added a comment - 2023-04-25 05:56

A contributing factor to the growth of the system tablespace would be the “caching” (delayed freeing) of undo log pages. The fix of ~~MDEV-29593~~ should reduce the excessive allocation of undo log pages.

That said, I would still recommend setting multiple innodb_undo_tablespaces. ~~MDEV-29986~~ changed the default to 3.

Marko Mäkelä added a comment - 2023-04-25 05:56 A contributing factor to the growth of the system tablespace would be the “caching” (delayed freeing) of undo log pages. The fix of MDEV-29593 should reduce the excessive allocation of undo log pages. That said, I would still recommend setting multiple innodb_undo_tablespaces . MDEV-29986 changed the default to 3.

People

Assignee:: Marko Mäkelä

Reporter:: Tarun

Votes:: 0 Vote for this issue

Watchers:: 7 Start watching this issue

Dates

Created:: 2020-03-16 09:16

Updated:: 2023-10-18 17:42

Resolved:: 2021-10-25 10:02

Git Integration

Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.

MariaDB Server