[MCOL-3341] PM ExeMgr doesnt restart on User Module failure w/ local query enabled Created: 2019-05-30 Updated: 2023-10-26 Resolved: 2020-04-15 |
|
| Status: | Closed |
| Project: | MariaDB ColumnStore |
| Component/s: | ? |
| Affects Version/s: | 1.2.3 |
| Fix Version/s: | N/A |
| Type: | Bug | Priority: | Major |
| Reporter: | David Hill (Inactive) | Assignee: | Ben Thompson (Inactive) |
| Resolution: | Won't Fix | Votes: | 0 |
| Labels: | None | ||
| Environment: |
2um 2pm with local query enabled |
||
| Description |
|
Reported by customer and reproduced: System with multiple UMs and local query enabled, if UM1 goes down the ExeMgrs are all stopped and started as part of the recovery process. The ExeMgrs fail to start leaving the system in this state: System BUSY_INIT Thu May 30 14:45:51 2019 Module um1 AUTO_DISABLED/DEGRADED Thu May 30 14:45:57 2019 Active Parent OAM Performance Module is 'pm1' MariaDB ColumnStore Process statuses Process Module Status Last Status Change Process ID ProcessMonitor um2 ACTIVE Thu May 30 14:42:22 2019 7059 ProcessMonitor pm1 ACTIVE Thu May 30 14:41:30 2019 9303 ProcessMonitor pm2 ACTIVE Thu May 30 14:42:32 2019 7669 From pm1 logs when ExeMgr is trying to start back up May 30 14:46:47 ip-172-31-38-221 ProcessMonitor[9303]: 47.487022 |0|0|0| E 18 CAL0000: Process location: not found I think the issue is that in the a separate system install, the ExeMgr Process Configuration shows its running on UM. So the reason for the error above. Looks like there needs to be additional code to handle the local query option. Process #7 Configuration information |
| Comments |
| Comment by Todd Stoffel (Inactive) [ 2020-04-15 ] |
|
OAM is being deprecated and replaced by an enhanced API and the MaxScale orchestration project. |