[MDEV-14310] Possible corruption by table-rebuilding or index-creating ALTER TABLE…ALGORITHM=INPLACE - Jira

XML

Word

Printable

Details

Type: Bug
Status: Closed (View Workflow)
Priority: Blocker
Resolution: Fixed
Affects Version/s: 10.2.11, 10.3.3
Fix Version/s: 10.2.11, 10.3.3
Component/s: Storage Engine - InnoDB
Labels:

Description

When I merged ~~MDEV-13328~~ from 10.1 to 10.2, the innodb.innodb test suddenly started to fail.

The culprit seemed to be a correct-looking conflict resolution in FlushObserver::flush(). As part of the test, innodb.innodb is executing the following:

let $default=`select @@storage_engine`;

set storage_engine=INNODB;

source include/varchar.inc;

This includes the following:

--error ER_DUP_ENTRY

alter table t1 add unique(v);

The duplicate key error triggers the following code at the end of row_merge_build_indexes():

		if (error != DB_SUCCESS) {

			flush_observer->interrupted();

		flush_observer->flush();

This in turn causes a choice of a dangerous parameter:

/** Flush dirty pages and wait. */

void

FlushObserver::flush()

/** Flush dirty pages and wait. */

void

FlushObserver::flush()

	buf_remove_t	buf_remove;

	if (m_interrupted) {

		buf_remove = BUF_REMOVE_FLUSH_NO_WRITE;

	} else {

		buf_remove = BUF_REMOVE_FLUSH_WRITE;

…

	/* Flush or remove dirty pages. */

	buf_LRU_flush_or_remove_pages(m_space_id, buf_remove, m_trx);

The danger here is that m_space_id=0, the system tablespace. This is potentially discarding other writes to the InnoDB system tablespace, potentially corrupting the whole instance.

We must use the equivalent of BUF_REMOVE_FLUSH_WRITE for the system tablespace (and in MySQL 5.7, for any table that resides in a persistent shared tablespace). Failure to do so caused all sorts of trouble when running innodb.innodb after the merge, especially when using --innodb-buffer-pool-size=5m (the minimum).
Apparently the corruption was caused by the following sequence of events:

Some pages in the system tablespace are modified.
The ADD UNIQUE INDEX operation runs and fails, and finally removes system tablespace pages from the buf_pool->flush_list without writing to the file (see above).
Some affected not-written-back pages are evicted from the buffer pool.
A page is eventually needed and read back to the buffer pool, with too old contents, or with all-zero contents.

Various assertions failed due to a supposedly-initialized page being all-zero.

Note: For a failed ALTER TABLE in a tablename.ibd file, it is perfectly OK to discard the entries from the flush_list, and to subsequently mark the pages as freed, or to delete the file (if it was a table-rebuilding ALTER).

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending
- Thumbnails
- List
- Download All

0001-5.7-Follow-up-fix-to-MDEV-13328.patch
13 kB
2017-11-20 08:27
0001-Remove-redundant-function-parameters.patch
5 kB
2017-11-20 08:28
0002-MDEV-13328-ALTER-TABLE-DISCARD-TABLESPACE-takes-a-lo.patch
24 kB
2017-11-20 08:28
5.7-MDEV-13328.patch
28 kB
2017-11-20 08:27

Issue Links

relates to

MDEV-13328 ALTER TABLE ... DISCARD TABLESPACE takes a lot of time with large buffer pool (>128G)

Closed

MDEV-14317 When ALTER TABLE is aborted, do not write garbage pages to data files

Closed

Activity

People

Assignee:: Marko Mäkelä

Reporter:: Marko Mäkelä

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 2017-11-07 13:17

Updated:: 2017-11-20 15:36

Resolved:: 2017-11-20 15:36

Time Tracking

Estimated:

Not Specified

Remaining:

Logged:

1.25d

Git Integration

Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.