Uploaded image for project: 'MariaDB ColumnStore'
  1. MariaDB ColumnStore
  2. MCOL-5293

Replication not working after failover -restart pm1

Details

    • Bug
    • Status: Closed (View Workflow)
    • Blocker
    • Resolution: Fixed
    • None
    • 22.08.7
    • cmapi
    • 2022-22

    Description

      Build tested: 22.08.2, latest build from drone (#5838)

      Steps:

      1. Create a 3PM docker cluster
      2. Check cluster status on PM1. master=PM1, slave=PM2, PM3
      3. Create a database and a table, insert a row in PM1
      4. Verify table gets replicated to PM2
      5. Execute "docker container stop mcs1", wait 90 seconds
      6. Execute "docker container start mcs1", wait 60 seconds
      7. Check cluster status on PM1. master=PM1, slave=PM2, PM3
      Yesterday, I noticed MaxScale had PM2 sat as the master
      Today, MaxScale also set PM1 as the master (I don't know why such behavior today)

      For this test, PM2 was expected to take over as the master

      8. Create another table and inserted a row on PM1
      9. The table did not get replicated to PM2 or PM3
      10. "show slave status" on PM2 returned nothing
      11. "show slave status" on PM3 did return status, and no error

      PM3 slave status showed master log mariadb-bin.000002, position 4684
      PM1 master status showed master log mariadb-bin.000004, position 568

      Attachments

        Issue Links

          Activity

            dleeyh Daniel Lee (Inactive) created issue -
            dleeyh Daniel Lee (Inactive) made changes -
            Field Original Value New Value
            Rank Ranked higher
            toddstoffel Todd Stoffel (Inactive) made changes -
            Rank Ranked lower
            toddstoffel Todd Stoffel (Inactive) made changes -
            Priority Major [ 3 ] Blocker [ 1 ]
            toddstoffel Todd Stoffel (Inactive) made changes -
            toddstoffel Todd Stoffel (Inactive) made changes -
            Assignee Alan Mologorsky [ JIRAUSER49150 ]
            toddstoffel Todd Stoffel (Inactive) made changes -
            Rank Ranked higher
            toddstoffel Todd Stoffel (Inactive) made changes -
            Sprint 2022-22 [ 672 ]
            toddstoffel Todd Stoffel (Inactive) made changes -
            Rank Ranked lower
            toddstoffel Todd Stoffel (Inactive) made changes -
            Fix Version/s 22.11.01 [ 28458 ]
            alan.mologorsky Alan Mologorsky made changes -
            Status Open [ 1 ] In Progress [ 3 ]
            alan.mologorsky Alan Mologorsky made changes -
            Status In Progress [ 3 ] In Review [ 10002 ]
            alexey.vorovich alexey vorovich (Inactive) made changes -
            Summary Replication not working after failover Replication not working after failover -restart pm1
            alexey.vorovich alexey vorovich (Inactive) made changes -
            Fix Version/s 22.11.1 [ 28458 ]
            Resolution Duplicate [ 3 ]
            Status In Review [ 10002 ] Closed [ 6 ]
            alexey.vorovich alexey vorovich (Inactive) made changes -
            Resolution Duplicate [ 3 ]
            Status Closed [ 6 ] Stalled [ 10000 ]
            alexey.vorovich alexey vorovich (Inactive) made changes -
            Assignee Alan Mologorsky [ JIRAUSER49150 ] Todd Stoffel [ toddstoffel ]
            alexey.vorovich alexey vorovich (Inactive) made changes -
            ccalender Chris Calender (Inactive) made changes -
            Fix Version/s 22.11.1 [ 28458 ]
            ccalender Chris Calender (Inactive) made changes -
            Assignee Todd Stoffel [ toddstoffel ] Roman [ drrtuy ]
            alexey.vorovich alexey vorovich (Inactive) made changes -
            alexey.vorovich alexey vorovich (Inactive) made changes -
            alexey.vorovich alexey vorovich (Inactive) made changes -
            Fix Version/s 23.02 [ 28209 ]
            Fix Version/s 23.03.1 [ 28458 ]
            drrtuy Roman added a comment -

            The scenario David.Hall mentioned should be re-tested when MCOL-5306 is tested.

            drrtuy Roman added a comment - The scenario David.Hall mentioned should be re-tested when MCOL-5306 is tested.
            toddstoffel Todd Stoffel (Inactive) made changes -
            Fix Version/s 22.08.6 [ 28542 ]
            Fix Version/s 23.02 [ 28209 ]
            alexey.vorovich alexey vorovich (Inactive) made changes -
            Attachment yes_mxs.log [ 67353 ]
            alexey.vorovich alexey vorovich (Inactive) made changes -
            Assigned for Testing Daniel Lee [ dleeyh ]
            alexey.vorovich alexey vorovich (Inactive) made changes -
            Status Stalled [ 10000 ] In Progress [ 3 ]
            alexey.vorovich alexey vorovich (Inactive) made changes -
            Status In Progress [ 3 ] In Testing [ 10301 ]

            Build verified: 22.08.7

            engine: e243a5332b8613ce0e370a503461990fefc24fce
            server: d3049350bb5c61340f5a7518b155d3c9dacdcb33
            buildNo: 6202

            Executed test case in mustest, test advance.a000_failover.test

            Steps performed.

             	echo Checking MaxScale status......
                echo Checking ColumnStore status on mcs1......
                echo Running sanity test on mcs1......
                echo Checking ColumnStore status on mcs1......
                echo Stopping node mcs1......
             	echo Checking MaxScale status......
                echo Checking ColumnStore status on mcs2......
                echo Starting node mcs1......
                echo Checking MaxScale status......
                echo Checking ColumnStore status on mcs1......
                echo Create a 1g DBT2 database on mcs2......
                echo Check row counts on mcs1 for replication......
                echo Drop test database......
                echo Ending of test.
            

            Test result, output from the test, has been attached.

            dleeyh Daniel Lee (Inactive) added a comment - Build verified: 22.08.7 engine: e243a5332b8613ce0e370a503461990fefc24fce server: d3049350bb5c61340f5a7518b155d3c9dacdcb33 buildNo: 6202 Executed test case in mustest, test advance.a000_failover.test Steps performed. echo Checking MaxScale status...... echo Checking ColumnStore status on mcs1...... echo Running sanity test on mcs1...... echo Checking ColumnStore status on mcs1...... echo Stopping node mcs1...... echo Checking MaxScale status...... echo Checking ColumnStore status on mcs2...... echo Starting node mcs1...... echo Checking MaxScale status...... echo Checking ColumnStore status on mcs1...... echo Create a 1g DBT2 database on mcs2...... echo Check row counts on mcs1 for replication...... echo Drop test database...... echo Ending of test. Test result, output from the test, has been attached.
            dleeyh Daniel Lee (Inactive) made changes -
            Resolution Fixed [ 1 ]
            Status In Testing [ 10301 ] Closed [ 6 ]
            dleeyh Daniel Lee (Inactive) made changes -
            Attachment a0001_failover.result [ 67439 ]
            alexey.vorovich alexey vorovich (Inactive) made changes -
            alexey.vorovich alexey vorovich (Inactive) made changes -
            David.Hall David Hall (Inactive) made changes -
            Labels cluster
            allen.herrera Allen Herrera made changes -
            allen.herrera Allen Herrera made changes -
            allen.herrera Allen Herrera made changes -
            allen.herrera Allen Herrera made changes -
            allen.herrera Allen Herrera made changes -
            allen.herrera Allen Herrera made changes -
            allen.herrera Allen Herrera made changes -
            allen.herrera Allen Herrera made changes -
            allen.herrera Allen Herrera made changes -

            People

              drrtuy Roman
              dleeyh Daniel Lee (Inactive)
              Daniel Lee Daniel Lee (Inactive)
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Git Integration

                  Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.