Details

    • Bug
    • Status: Stalled (View Workflow)
    • Major
    • Resolution: Unresolved
    • 10.4(EOL), 10.5, 10.6
    • 10.11
    • Galera, Tests
    • None

    Description

      Galera test case cleanup

      • Make sure that every test failure has open MDEV or MENT.
      • Make sure that disabled.def files are up-to-date.
      • Monitor buildbot and Jenkings test periodically for failing test cases.
      • Pick one linked here and fix it.

      Attachments

        Issue Links

          Activity

            Note: Locally I get even more failures, apparently related to a newer SSL library, like these:

            CURRENT_TEST: galera_sr.MDEV-27615
            mysqltest: At line 47: query 'reap' failed with wrong errno <Unknown> (2026): 'TLS/SSL error: unexpected eof while reading', instead of <Unknown> (2013)...
             
            galera_3nodes.galera_dynamic_protocol 'innodb' w4 [ fail ]  Found warnings/errors in server log file!
                    Test ended at 2022-12-16 11:29:19
            line
            2022-12-16 11:28:58 0 [Warning] WSREP: read_completion_condition(): unexpected eof while reading (167772454: 'error:0A000126:SSL routines::unexpected eof while reading')
            2022-12-16 11:28:58 0 [Warning] WSREP: read_handler(): unexpected eof while reading (167772454: 'error:0A000126:SSL routines::unexpected eof while reading')
            ^ Found warnings in /dev/shm/10.10/mysql-test/var/4/log/mysqld.3.err
            

            While I was looking for the Debian Sid failures on buildbot, I noticed lots of crashes on kvm-deb-sid-x86 like this:

            CURRENT_TEST: galera.galera_bf_abort_ps_bind
            mysqltest: At line 41: query 'update t set j='node2' where i=1' failed: <Unknown> (2013): Lost connection to server during query
            …
            #10 0x00ed8958 in vsnprintf (__ap=0xb0d93a68 "\002", __fmt=0x13d53c4 "wsrep_commit_empty for %llu client_state %s client_mode %s trans_state %s sql %s", __n=128, __s=0xb0d939bc "wsrep_commit_empty for 82112017988583426 client_state high priority client_mode executing trans_state NULL sql ") at /usr/include/i386-linux-gnu/bits/stdio2.h:85
            #11 WSREP_LOG (fun=0xcfa120 <sql_print_information(char const*, ...)>, fmt=0x13d53c4 "wsrep_commit_empty for %llu client_state %s client_mode %s trans_state %s sql %s") at ./sql/wsrep_mysqld.cc:299
            #12 0x00edec83 in wsrep_commit_empty (thd=0xb08006c8, all=true) at ./sql/wsrep_mysqld.cc:3693
            

            The cause of the SIGSEGV is a vsnprintf format string mismatch:

            void wsrep_commit_empty(THD* thd, bool all)
            {
              DBUG_ENTER("wsrep_commit_empty");
              WSREP_DEBUG("wsrep_commit_empty for %llu client_state %s client_mode"
                          " %s trans_state %s sql %s",
                          thd_get_thread_id(thd),
            …
            

            The function thd_get_thread_id() returns unsigned long, not unsigned long long, so the first parameter in the format string should be %lu instead of %llu. No warning is issued at compilation time, due to the lack of MDEV-21978.

            marko Marko Mäkelä added a comment - Note: Locally I get even more failures, apparently related to a newer SSL library, like these: CURRENT_TEST: galera_sr.MDEV-27615 mysqltest: At line 47: query 'reap' failed with wrong errno <Unknown> (2026): 'TLS/SSL error: unexpected eof while reading', instead of <Unknown> (2013)...   galera_3nodes.galera_dynamic_protocol 'innodb' w4 [ fail ] Found warnings/errors in server log file! Test ended at 2022-12-16 11:29:19 line 2022-12-16 11:28:58 0 [Warning] WSREP: read_completion_condition(): unexpected eof while reading (167772454: 'error:0A000126:SSL routines::unexpected eof while reading') 2022-12-16 11:28:58 0 [Warning] WSREP: read_handler(): unexpected eof while reading (167772454: 'error:0A000126:SSL routines::unexpected eof while reading') ^ Found warnings in /dev/shm/10.10/mysql-test/var/4/log/mysqld.3.err While I was looking for the Debian Sid failures on buildbot, I noticed lots of crashes on kvm-deb-sid-x86 like this: CURRENT_TEST: galera.galera_bf_abort_ps_bind mysqltest: At line 41: query 'update t set j='node2' where i=1' failed: <Unknown> (2013): Lost connection to server during query … #10 0x00ed8958 in vsnprintf (__ap=0xb0d93a68 "\002", __fmt=0x13d53c4 "wsrep_commit_empty for %llu client_state %s client_mode %s trans_state %s sql %s", __n=128, __s=0xb0d939bc "wsrep_commit_empty for 82112017988583426 client_state high priority client_mode executing trans_state NULL sql ") at /usr/include/i386-linux-gnu/bits/stdio2.h:85 #11 WSREP_LOG (fun=0xcfa120 <sql_print_information(char const*, ...)>, fmt=0x13d53c4 "wsrep_commit_empty for %llu client_state %s client_mode %s trans_state %s sql %s") at ./sql/wsrep_mysqld.cc:299 #12 0x00edec83 in wsrep_commit_empty (thd=0xb08006c8, all=true) at ./sql/wsrep_mysqld.cc:3693 The cause of the SIGSEGV is a vsnprintf format string mismatch: void wsrep_commit_empty(THD* thd, bool all) { DBUG_ENTER( "wsrep_commit_empty" ); WSREP_DEBUG( "wsrep_commit_empty for %llu client_state %s client_mode" " %s trans_state %s sql %s" , thd_get_thread_id(thd), … The function thd_get_thread_id() returns unsigned long , not unsigned long long , so the first parameter in the format string should be %lu instead of %llu . No warning is issued at compilation time, due to the lack of MDEV-21978 .

            jplindst, there were more commits disabling more tests. Please, check them all before closing the bug

            serg Sergei Golubchik added a comment - jplindst , there were more commits disabling more tests. Please, check them all before closing the bug

            Moreover, many of the disabled tests refer only to this ticket, and hence this ticket should not be closed before all such tests have either been re-enabled or associated with more specific tickets.

            marko Marko Mäkelä added a comment - Moreover, many of the disabled tests refer only to this ticket, and hence this ticket should not be closed before all such tests have either been re-enabled or associated with more specific tickets.

            The test galera_sr.MDEV-25718 is still disabled in 10.11.

            marko Marko Mäkelä added a comment - The test galera_sr. MDEV-25718 is still disabled in 10.11.
            marko Marko Mäkelä added a comment -

            The test galera_sr.MDEV-25718 is still disabled in 10.11. It currently is the only disabled test in that suite.

            marko Marko Mäkelä added a comment - The test galera_sr. MDEV-25718 is still disabled in 10.11. It currently is the only disabled test in that suite.

            People

              janlindstrom Jan Lindström
              jplindst Jan Lindström (Inactive)
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

                Created:
                Updated:

                Git Integration

                  Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.