Uploaded image for project: 'MariaDB Server'
  1. MariaDB Server
  2. MDEV-13906

Crash during WSREP recovery



    • Bug
    • Status: Closed (View Workflow)
    • Major
    • Resolution: Won't Fix
    • 10.1.25, 10.1.26
    • N/A
    • wsrep
    • None
    • Ubuntu 16.04.2, MariaDB 10.1.25 and 10.1.26


      We have a 3 node Galera Cluster and this weekend we tried rebooting each node one at a time for an EC2 instance upgrade. When the servers came back online they each crashed with signal 11 while trying to rejoin the cluster.

      The crash is occurring during WSREP recovery. If I set wsrep_on=OFF MySQL will startup without crashing, but it again crashes when dynamically setting wsrep_on=ON.

      Nothing shows up in the other two nodes' logs while the other join is starting up before it crashes. And all ports are open between the Galera nodes. Each node is running MariaDB 10.1.25 but I did upgrade one node to 10.1.26 to see if the problem was fixed there and it exhibited the same behavior. The only way I was get the nodes to rejoin the cluster was to force an SST sync. However the data directory is 1.8TB so that is far from ideal for each node restart.

      I've attached the wsrep_recovery log and apport crash file, but it doesn't contain the core dump for some reason. I've also uploaded the my.cnf and a mariadb.cnf config file containing the Galera Cluster related config options.


        1. _usr_sbin_mysqld.0.crash
          17 kB
        2. mariadb.cnf
          0.9 kB
        3. my.cnf
          2 kB
        4. wsrep_recovery.KbkcqG
          5 kB

        Issue Links



              jplindst Jan Lindström (Inactive)
              btraywick Bryan Traywick
              0 Vote for this issue
              5 Start watching this issue



                Git Integration

                  Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.