Uploaded image for project: 'MariaDB MaxScale'
  1. MariaDB MaxScale
  2. MXS-6050

MaxScale does not log the root cause when a switchover/failover fails

    XMLWordPrintable

Details

    • Bug
    • Status: Closed (View Workflow)
    • Major
    • Resolution: Duplicate
    • None
    • 23.08.12
    • mariadbmon
    • None
    • MXS-SPRINT-251, MXS-SPRINT-252

    Description

      We are requesting an enhancement to the mariadbmon logging in MaxScale. Currently, when a switchover or failover event fails, the error message provides no diagnostic information regarding why the operation failed.

      Current Behavior: The log simply reports a generic failure: error :

      [mariadbmon] Switchover test-mariadb-0 -> test-mariadb-2 failed.
      

      Desired Behavior: The error log should include specific details about the failure condition. For example:

      • Timeout waiting for replication catch-up
      • Connection refused on target node
      • Permission denied
      • Specific SQL error returned during the promotion process

      Business Impact / Motivation: Without detailed error messages, it is impossible to determine the root cause of high-availability failures (e.g., distinguishing between a MariaDB bug, a network issue, or a configuration error) without reproducing the issue with debug logging enabled. This increases Time to Resolution (TTR) for critical production outages.

      Attachments

        Issue Links

          Activity

            People

              esa.korhonen Esa Korhonen
              kyle.hutchinson Kyle Hutchinson
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Git Integration

                  Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.