[MCOL-5286] HA Failing when losing a node - restart secondary - pm3 - Jira

XML

Word

Printable

Details

Type: Bug
Status: Closed (View Workflow)
Priority: Blocker
Resolution: Fixed
Affects Version/s: None
Fix Version/s: cmapi-22.08.2
Component/s: cmapi
Labels:
- triage
Environment:
EC2 AWS 3 node cluster

Sprint:
2022-22

Description

In the latest version cs 22.08.x with cmpai 22.08.x
When losing a pm, selects continue fine on two nodes but the cluster doesnt shuffle to a 2 node cluster. Create table statments do not work during this lost pm timeframe.

Furthermore, when the 3rd PM does come back online, no selects work as the 3rd node is still part of the cluster but no subprocesses are online. until a manual mcsShutdown/mcsStart occurs.

steps to reproduce

Used ansible to setup 3 node nfs cluste  with CS 22.08.X and cmapi 22.08.x

create database test;

use test;

create table t1 ( a int) engine=columnstore;

insert into t1 values(1);

insert into t1 values(2);

insert into t1 values(3);

select * from t1;

exit

mcsStatus

# now shutdown node 3

mcsStatus # notice it doesnt work

  "error": "Got an error retrieving status from node ip-172-31-27-124.us-west-2.compute.internal"

mariadb test -e "select * from t1;"; # notice it works

mariadb test -e "create table t2 ( a int) engine=columnstore;" # notice it doesnt work

# start up node 3 node

mcsStatus

# notice cs subprocesses are offline on node 3

mariadb test -e "select * from t1;"; # notice it fails and errors

Attachments

Issue Links

relates to

MCOL-5293 Replication not working after failover -restart pm1

Closed

mentioned in: Page Loading...; Page Loading...; Page Loading...; Page Loading...; Page Loading...; Page Loading...; Page Loading...; Page Loading...; Page Loading...

(5 mentioned in)

Activity

People

Assignee:: Alan Mologorsky

Reporter:: Allen Herrera

Assigned for Testing:: Todd Stoffel (Inactive)

Votes:: 0 Vote for this issue

Watchers:: 6 Start watching this issue

Dates

Created:: 2022-10-31 21:00

Updated:: 2024-08-20 15:33

Resolved:: 2022-11-15 20:22

Git Integration

Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.