[MDEV-32588] InnoDB may hang when running out of buffer pool - Jira

Details

Type: Bug
Status: Closed (View Workflow)
Priority: Blocker
Resolution: Fixed
Affects Version/s: 10.6, 10.10(EOL), 10.11, 11.0(EOL), 11.1(EOL), 11.2(EOL)
Fix Version/s: 10.6.16, 10.10.7, 10.11.6, 11.0.4, 11.1.3, 11.2.2
Component/s: Storage Engine - InnoDB
Labels:
- corruption
- crash
- hang
- race
- regression

Description

While running performance tests with a small buffer pool, I encountered an anomaly where InnoDB would hang because of running out of buffer pool. There would be some actually clean blocks at buf_pool.flush_list.start, but these would be skipped by buf_flush_LRU_list_batch(). The buf_pool.flush_list.end is being trimmed by periodic calls to buf_pool.get_oldest_modification(). I suspect that the entire buf_pool.LRU would be skipped because of being buffer-fixed, latched, or registered in buf_pool.flush_list due to the ~~MDEV-26010~~ optimization.

This seems to be a 10.6 regression due to ~~MDEV-26827~~.

~~MDEV-32050~~ would make this problem worse by allowing the purge_coordinator_task to buffer-fix a large number of pages.

Attachments

Issue Links

is caused by

MDEV-26827 Make page flushing even faster

Closed

relates to

MDEV-33613 InnoDB may still hang when temporarily running out of buffer pool

Closed

MDEV-26010 Assertion `lsn > 2' failed in buf_pool_t::get_oldest_modification

Closed

MDEV-32050 UNDO logs still growing for write-intensive workloads

Closed

MDEV-33508 Performance regression due to frequent scan of full buf_pool.flush_list

Closed

Activity

Ascending order - Click to sort in descending order

Marko Mäkelä added a comment - 2023-10-26 10:30

I think that the following patch fixes this.

diff a/storage/innobase/buf/buf0flu.cc b/storage/innobase/buf/buf0flu.cc

--- a/storage/innobase/buf/buf0flu.cc

+++ b/storage/innobase/buf/buf0flu.cc

@@ -1246,16 +1246,14 @@ static void buf_flush_LRU_list_batch(ulint max, bool evict,

     ut_ad(state >= buf_page_t::FREED);

     ut_ad(bpage->in_LRU_list);

-    switch (bpage->oldest_modification()) {

-    case 0:

+    if (!bpage->oldest_modification())

+    {

     evict:

       if (state != buf_page_t::FREED &&

           (state >= buf_page_t::READ_FIX || (~buf_page_t::LRU_MASK & state)))

         continue;

       buf_LRU_free_page(bpage, true);

       ++n->evicted;

-      /* fall through */

-    case 1:

       if (UNIV_LIKELY(scanned & 31))

         continue;

       mysql_mutex_unlock(&buf_pool.mutex);

@@ -1271,7 +1269,11 @@ static void buf_flush_LRU_list_batch(ulint max, bool evict,

       switch (bpage->oldest_modification()) {

       case 1:

         mysql_mutex_lock(&buf_pool.flush_list_mutex);

-        buf_pool.delete_from_flush_list(bpage);

+        if (ut_d(lsn_t lsn=) bpage->oldest_modification())

+        {

+          ut_ad(lsn == 1); /* It must be clean while we hold bpage->lock */

+          buf_pool.delete_from_flush_list(bpage);

+        }

         mysql_mutex_unlock(&buf_pool.flush_list_mutex);

         /* fall through */

       case 0:

Before ~~MDEV-26827~~, we were acting upon oldest_modification==1 while already holding buf_pool.flush_list_mutex. Both regressions were introduced by me in ~~MDEV-26827~~.

Marko Mäkelä added a comment - 2023-10-26 10:30 I think that the following patch fixes this. diff a/storage/innobase/buf/buf0flu.cc b/storage/innobase/buf/buf0flu.cc --- a/storage/innobase/buf/buf0flu.cc +++ b/storage/innobase/buf/buf0flu.cc @@ -1246,16 +1246,14 @@ static void buf_flush_LRU_list_batch(ulint max, bool evict, ut_ad(state >= buf_page_t::FREED); ut_ad(bpage->in_LRU_list); - switch (bpage->oldest_modification()) { - case 0: + if (!bpage->oldest_modification()) + { evict: if (state != buf_page_t::FREED && (state >= buf_page_t::READ_FIX || (~buf_page_t::LRU_MASK & state))) continue; buf_LRU_free_page(bpage, true); ++n->evicted; - /* fall through */ - case 1: if (UNIV_LIKELY(scanned & 31)) continue; mysql_mutex_unlock(&buf_pool.mutex); @@ -1271,7 +1269,11 @@ static void buf_flush_LRU_list_batch(ulint max, bool evict, switch (bpage->oldest_modification()) { case 1: mysql_mutex_lock(&buf_pool.flush_list_mutex); - buf_pool.delete_from_flush_list(bpage); + if (ut_d(lsn_t lsn=) bpage->oldest_modification()) + { + ut_ad(lsn == 1); /* It must be clean while we hold bpage->lock */ + buf_pool.delete_from_flush_list(bpage); + } mysql_mutex_unlock(&buf_pool.flush_list_mutex); /* fall through */ case 0: Before MDEV-26827 , we were acting upon oldest_modification==1 while already holding buf_pool.flush_list_mutex . Both regressions were introduced by me in MDEV-26827 .

Marko Mäkelä added a comment - 2023-10-26 10:35

In my tests on a non-debug build, the race condition that is fixed by the second hunk of the patch caused a shutdown hang as well as a

InnoDB: Failing assertion: list.count > 0

in buf_pool_t::insert_into_flush_list() during a mtr_t::commit().

Marko Mäkelä added a comment - 2023-10-26 10:35 In my tests on a non-debug build, the race condition that is fixed by the second hunk of the patch caused a shutdown hang as well as a InnoDB: Failing assertion: list.count > 0 in buf_pool_t::insert_into_flush_list() during a mtr_t::commit() .

People

Assignee:: Marko Mäkelä

Reporter:: Marko Mäkelä

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Dates

Created:: 2023-10-26 10:07

Updated:: 2024-08-08 06:37

Resolved:: 2023-10-27 13:05

Git Integration

Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.

MariaDB Server