Uploaded image for project: 'MariaDB MaxScale'
  1. MariaDB MaxScale
  2. MXS-3481

Complete test plan for transaction replay against XPAND Direct

    XMLWordPrintable

Details

    • Task
    • Status: Closed (View Workflow)
    • Major
    • Resolution: Done
    • None
    • 6.0.0
    • xpandmon
    • None

    Description

      Prologue
      There are multiple dimensions in the process, with lots of possible permutations of events which may occur at different times. Group Change may be very fast or rather slow. Sessions may be idle, or have transactions. Transactions may be quiet or running a statement. Group change can be initiated by flex-down or flex-up; or by node crash. Crashed node can return into the cluster without causing group change (10 minutes), or be considered dead (>10 min). There may be more.

      It is safe to assume that by now MaxScale has these algorithms fully debugged for normal read-write splits. The new and unusual thing about XPAND is that Group Change makes entire cluster inaccessible in a special way. It is worth validating the behaviors empirically.

      Given that the number of possible permutations is close to infinite, In this project we will select only a number of scenarios, trying to achieve the highest likelihood of overall success.

      Plan
      We concentrate on long group changes only.

      Part 1. Actions on the departing node

        Flex Down Crash and restore
      in 1 minute
      Crash and no restore
      first 1 minute (before GC)
      existing idle connections      
      existing connections
      with outstanding idle transactions
           
      existing connections
      with active transactions
           

      Part 2. Actions on the nodes not affected by group change

        Flex Down Crash and restore
      in 1 minute
      Crash and no restore
      first 1 minute (before GC)
      Flex Up
      existing idle connections        
      existing connections
      with outstanding idle transactions
             
      existing connections
      with active transactions
             
      new connections and transactions
      hitting maxscale
             

      Note Later on we would try to observe what happens to idle and active sessions and transactions living on a node which crashed and is not coming back when the 10 minutes interval is about to expire (i.e. when an activity is attempted while XPAND is still hoping for the node to come back, and then went into group change). But this is in due time. Let's get the fundamentals verified first.

      Attachments

        Issue Links

          Activity

            People

              rahul.joshi@mariadb.com Rahul Joshi (Inactive)
              gdorman Gregory Dorman (Inactive)
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Git Integration

                  Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.