Uploaded image for project: 'MariaDB Server'
  1. MariaDB Server
  2. MDEV-25114

Crash: WSREP: invalid state ROLLED_BACK (FATAL)

Details

    • Bug
    • Status: Closed (View Workflow)
    • Blocker
    • Resolution: Fixed
    • 10.3.28, 10.2(EOL), 10.4(EOL), 10.5
    • 10.2.41, 10.3.32, 10.4.22, 10.5.13
    • Galera

    Description

      About 29 hours after updating a previously very stable 3 server MariaDB Galera cluster from 10.3.27 to 10.3.28, one of the nodes crashed with the following message:

      2021-03-10 18:22:48 0 [ERROR] WSREP: invalid state ROLLED_BACK (FATAL)
               at /home/buildbot/buildbot/build/galera/src/replicator_smm.cpp:abort_trx():735
      2021-03-10 18:22:48 0 [ERROR] WSREP: cancel commit bad exit: 7 33792346039
      210310 18:22:48 [ERROR] mysqld got signal 6 ;
      

      Attached is a bit more log (coincidentally had been running with conflict logging enabled), but I don't think there's much more I can provide right now. I'm still logging this since before the update to 10.3.28 the cluster had been running very stable for months, and this could be somehow related to MDEV-25111, which we also encountered first time right after updating to 10.3.28.

      Attachments

        Issue Links

          Activity

            jplindst Jan Lindström (Inactive) added a comment - - edited

            serg Can you please review following:

            jplindst Jan Lindström (Inactive) added a comment - - edited serg Can you please review following: 10.2 branch: bb-10.2-KILL-as-TOI-galera commit: https://github.com/MariaDB/server/commit/6d0c1f3ae12593470f2a556c33e586bc08c677e8 10.3 branch: bb-10.3-KILL-as-TOI-galera commit: https://github.com/MariaDB/server/commit/ed85dc379fe686ee2bd8707826864e3d7c62877c
            jplindst Jan Lindström (Inactive) added a comment - - edited Final versions: branch: bb-10.2-KILL-as-TOI-galera commit : https://github.com/MariaDB/server/commit/f3e5e3d897c90dcf04586e57c6bec5ef6fabb8e5 branch: bb-10.3-KILL-as-TOI-galera commit : https://github.com/MariaDB/server/commit/002192b92cf3c9e206d711ca36cdf6694a457a8f branch: bb-10.4-KILL-as-TOI-galera commit : https://github.com/MariaDB/server/commit/778d88ea2d9ec9b2083a6eaadea5d31f9571ff14 branch: bb-10.5-KILL-as-TOI-galera commit : https://github.com/MariaDB/server/commit/2c8f52ea53d7532e395fa92c3b0e9c5dfb619403

            rebase,testing and push 10.5

            jplindst Jan Lindström (Inactive) added a comment - rebase,testing and push 10.5
            Baroti Steve Baroti added a comment -

            OMG, we are awaiting for this fix, thank you! After the September's 10.3.24 -> 10.3.31 upgrade, our three nodes dev cluster (10.3.31) has its node's mariadb service crashing, each node, every one or two hours. Wanted to revert, but we have to much data to restore. A festival of mariadb restarting nodes! Thanks again for your great efforts! Fingers crossed!

            Baroti Steve Baroti added a comment - OMG, we are awaiting for this fix, thank you! After the September's 10.3.24 -> 10.3.31 upgrade, our three nodes dev cluster (10.3.31) has its node's mariadb service crashing, each node, every one or two hours. Wanted to revert, but we have to much data to restore. A festival of mariadb restarting nodes! Thanks again for your great efforts! Fingers crossed!

            In 10.6.0, this was fixed in a simpler way by MDEV-24915.

            marko Marko Mäkelä added a comment - In 10.6.0, this was fixed in a simpler way by MDEV-24915 .

            People

              jplindst Jan Lindström (Inactive)
              emaijala Ere Maijala
              Votes:
              16 Vote for this issue
              Watchers:
              37 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Git Integration

                  Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.