[MCOL-1138] pm1 failover testing - didnt leave a HOT_STANDBY ProcMgr on remainng node Created: 2018-01-05 Updated: 2023-10-26 Resolved: 2018-01-25 |
|
| Status: | Closed |
| Project: | MariaDB ColumnStore |
| Component/s: | ? |
| Affects Version/s: | 1.1.2 |
| Fix Version/s: | 1.1.3 |
| Type: | Bug | Priority: | Major |
| Reporter: | David Hill (Inactive) | Assignee: | Daniel Lee (Inactive) |
| Resolution: | Fixed | Votes: | 0 |
| Labels: | None | ||
| Environment: |
non-root amazon ami with EBS 3pm combo system |
||
| Sprint: | 2018-02 |
| Description |
|
started with pm1 as the active/master node after install. stopped pm1 instance, PM3 took over as master, but PM2 ProcMgr didnt go HOT_STANDBY [mariadb-user@ip-172-30-0-204 ~]$ ma getsystemi System 1.1.2 System and Module statuses Component Status Last Status Change Module pm1 ACTIVE Fri Jan 5 15:43:48 2018 Active Parent OAM Performance Module is 'pm1' MariaDB ColumnStore Process statuses Process Module Status Last Status Change Process ID ProcessMonitor pm2 ACTIVE Fri Jan 5 15:43:07 2018 15334 ProcessMonitor pm3 ACTIVE Fri Jan 5 15:43:08 2018 14322 Active Alarm Counts: Critical = 0, Major = 0, Minor = 0, Warning = 0, Info = 0 STOP PM1 System 1.1.2 System and Module statuses Component Status Last Status Change Module pm1 AUTO_DISABLED/DEGRADED Thu Jan 4 21:35:01 2018 Active Parent OAM Performance Module is 'pm3' MariaDB ColumnStore Process statuses Process Module Status Last Status Change Process ID ProcessMonitor pm2 ACTIVE Thu Jan 4 21:19:18 2018 3458 ProcessMonitor pm3 ACTIVE Thu Jan 4 21:19:19 2018 3457 Active Alarm Counts: Critical = 3, Major = 1, Minor = 0, Warning = 0, Info = 0 |
| Comments |
| Comment by David Hill (Inactive) [ 2018-01-05 ] |
|
pm2 status MariaDB [(none)]> show master status\G;
ERROR: No query specified MariaDB [(none)]> show slave status\G;
ERROR: No query specified MariaDB [(none)]> |
| Comment by David Hill (Inactive) [ 2018-01-22 ] |
|
fixed 2 issues 1. fix issue where no HOT_STANDBY procmgr existed after pm1 outage HOW TO TEST.. 1. on a 3 combo pm system with storage,remove pm1 and make sure you are left with an HOST_STANDBY ProcMgr. |
| Comment by David Hill (Inactive) [ 2018-01-22 ] |
|
https://github.com/mariadb-corporation/mariadb-columnstore-engine/pull/379 |
| Comment by Ben Thompson (Inactive) [ 2018-01-22 ] |
|
Reviewed / Merged |
| Comment by Daniel Lee (Inactive) [ 2018-01-25 ] |
|
Build verified: 1.1.3-1 created on 01/24/2018, ami-99b40be1 Verified this ticket. mcsadmin> getprocessstatus MariaDB ColumnStore Process statuses Process Module Status Last Status Change Process ID ProcessMonitor pm2 ACTIVE Thu Jan 25 16:50:30 2018 3362 ProcessMonitor pm3 ACTIVE Thu Jan 25 16:50:31 2018 3370 |