[MCOL-929] switchparentoammodule cores when pm2 is active on a 1um/2pm Data Redundancy system Created: 2017-09-18  Updated: 2023-10-26  Resolved: 2017-10-26

Status: Closed
Project: MariaDB ColumnStore
Component/s: ?
Affects Version/s: 1.1.0
Fix Version/s: 1.1.1

Type: Bug Priority: Major
Reporter: David Hill (Inactive) Assignee: David Hill (Inactive)
Resolution: Cannot Reproduce Votes: 0
Labels: None

Sprint: 2017-20, 2017-21

 Description   

after a failover where pm2 becomes active. Ran the command to active back to pm1 and it reported core file

Component Status Last Status Change
------------ -------------------------- ------------------------
System ACTIVE Mon Sep 18 20:00:01 2017

Module um1 ACTIVE Mon Sep 18 19:44:18 2017
Module pm1 ACTIVE Mon Sep 18 19:59:45 2017
Module pm2 ACTIVE Mon Sep 18 19:51:26 2017

Active Parent OAM Performance Module is 'pm2'
MariaDB ColumnStore Replication Feature is enabled

MariaDB ColumnStore Process statuses

Process Module Status Last Status Change Process ID
------------------ ------ --------------- ------------------------ ----------
ProcessMonitor um1 ACTIVE Mon Sep 18 19:43:33 2017 1385
ServerMonitor um1 ACTIVE Mon Sep 18 19:43:58 2017 1987
DBRMWorkerNode um1 ACTIVE Mon Sep 18 19:44:00 2017 2017
ExeMgr um1 ACTIVE Mon Sep 18 19:59:49 2017 4193
DDLProc um1 ACTIVE Mon Sep 18 19:59:55 2017 4214
DMLProc um1 ACTIVE Mon Sep 18 20:00:01 2017 4232
mysqld um1 ACTIVE Mon Sep 18 19:51:28 2017

ProcessMonitor pm1 ACTIVE Mon Sep 18 19:59:28 2017 1415
ProcessManager pm1 HOT_STANDBY Mon Sep 18 19:59:35 2017 2055
DBRMControllerNode pm1 COLD_STANDBY Mon Sep 18 19:59:35 2017
ServerMonitor pm1 ACTIVE Mon Sep 18 19:59:38 2017 2125
DBRMWorkerNode pm1 ACTIVE Mon Sep 18 19:59:40 2017 2153
DecomSvr pm1 ACTIVE Mon Sep 18 19:59:43 2017 2170
PrimProc pm1 ACTIVE Mon Sep 18 19:59:46 2017 2181
WriteEngineServer pm1 ACTIVE Mon Sep 18 19:59:47 2017 2192

ProcessMonitor pm2 ACTIVE Mon Sep 18 19:43:41 2017 1371
ProcessManager pm2 ACTIVE Mon Sep 18 19:50:30 2017 2235
DBRMControllerNode pm2 ACTIVE Mon Sep 18 19:51:18 2017 3029
ServerMonitor pm2 ACTIVE Mon Sep 18 19:51:20 2017 3050
DBRMWorkerNode pm2 ACTIVE Mon Sep 18 19:51:20 2017 3084
DecomSvr pm2 ACTIVE Mon Sep 18 19:51:24 2017 3122
PrimProc pm2 ACTIVE Mon Sep 18 19:51:26 2017 3152
WriteEngineServer pm2 ACTIVE Mon Sep 18 19:59:19 2017 6200

Active Alarm Counts: Critical = 0, Major = 0, Minor = 0, Warning = 0, Info = 0
mcsadmin> getst
getstorageconfig Mon Sep 18 20:00:43 2017

System Storage Configuration

Performance Module (DBRoot) Storage Type = DataRedundancy
System Assigned DBRoot Count = 2
DBRoot IDs assigned to 'pm1' = 2
DBRoot IDs assigned to 'pm2' = 1

mcsadmin> switch y
switchparentoammodule Mon Sep 18 20:14:15 2017

Switching to the Hot-Standby Parent OAM Module 'pm1'

MariaDB ColumnStore Replication is enabled, is there a 'MariaDB ColumnStore' Password configured in /root/.my.cnf (y,n):
Please enter: n

Check for active transactions

Switch Active Parent OAM Module starting...
Segmentation fault (core dumped)



 Comments   
Comment by David Hill (Inactive) [ 2017-09-18 ]

same core if you try a stopsystem first then switch..

Comment by David Hill (Inactive) [ 2017-10-26 ]

Ok, tested both root and no-root installs and couldn't reproduce the issue on latest builds.. so it must have been fixed with some change..

Generated at Thu Feb 08 02:24:52 UTC 2024 using Jira 8.20.16#820016-sha1:9d11dbea5f4be3d4cc21f03a88dd11d8c8687422.