Uploaded image for project: 'MariaDB Server'
  1. MariaDB Server
  2. MDEV-9083

Slave IO thread does not handle autoreconnect to restarting Galera Cluster node

Details

    Description

      Hello,

      Considering the following architecture:

      • N-nodes MariaDB Galera Cluster
      • Standalone MariaDB server replicating from any above cluster node

      In the case where the master node is restarted, the slave stops with this error:
      1593 The slave I/O thread stops because a fatal error is encountered when it tried to SELECT @master_binlog_checksum. Error: WSREP has not yet prepared node for application use

      It has to be restarted manually (START SLAVE) in order to reconnect

      As Galera replication implies that, when a node is restarted, there is a short time when the node is available on port 3306, but the node cannot accept commands because it is processing an Incremental State Transfer request, I propose to have the slave thread autoreconnect instead of stopping when it encounters this particular error.

      Attachments

        Issue Links

          Activity

            Transition Time In Source Status Execution Times
            Nirbhay Choubey (Inactive) made transition -
            Open Confirmed
            218d 3h 28m 1
            Nirbhay Choubey (Inactive) made transition -
            Confirmed In Progress
            3d 1h 39m 1
            Nirbhay Choubey (Inactive) made transition -
            In Progress Closed
            19s 1

            People

              nirbhay_c Nirbhay Choubey (Inactive)
              tanj Guillaume Lefranc
              Votes:
              1 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Git Integration

                  Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.