Uploaded image for project: 'MariaDB Server'
  1. MariaDB Server
  2. MDEV-26811

Assertion "log_sys.n_pending_flushes == 1" fails in undo_truncate test, on shutdown

Details

    Description

      http://buildbot.askmonty.org/buildbot/builders/winx64-packages/builds/27562/steps/test/logs/stdio

      Looks like one thread is doing log_write_and_flush(), while another is doing log_write_flush_to_disk_low() . Also and third one is doing close_connections(), so perhaps this could be related to the recent sudden surge of crashes on shutdown in this specific test.

      Attachments

        Issue Links

          Activity

            Thank you. In the stack traces, I see that we have log_checkpoint() executing log_write_flush_to_disk_low() where we have a similar assertion (which did not fail):

            	ut_a(log_sys.n_pending_flushes == 1); /* No other threads here */
             
            	bool	do_flush = srv_file_flush_method != SRV_O_DSYNC;
             
            	if (do_flush) {
            		fil_flush(SRV_LOG_SPACE_FIRST_ID);
            	}
             
             
            	log_mutex_enter();
            

            The thread where the assertion failed in log_write_and_flush() is holding the mutex and thus blocking this thread.

            It seems to me that the == 1 must be removed from both assertions. Invoking fil_flush() concurrently from multiple threads is safe, because it is protected by fil_system.mutex.

            marko Marko Mäkelä added a comment - Thank you. In the stack traces, I see that we have log_checkpoint() executing log_write_flush_to_disk_low() where we have a similar assertion (which did not fail): ut_a(log_sys.n_pending_flushes == 1); /* No other threads here */   bool do_flush = srv_file_flush_method != SRV_O_DSYNC;   if (do_flush) { fil_flush(SRV_LOG_SPACE_FIRST_ID); }     log_mutex_enter(); The thread where the assertion failed in log_write_and_flush() is holding the mutex and thus blocking this thread. It seems to me that the == 1 must be removed from both assertions. Invoking fil_flush() concurrently from multiple threads is safe, because it is protected by fil_system.mutex .

            People

              marko Marko Mäkelä
              wlad Vladislav Vaintroub
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Git Integration

                  Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.