==> warning.log <== Jan 25 21:40:10 s_columnstore@ip-172-31-47-54 messagequeue[8766]: 10.738349 |0|0|0| W 31 CAL0000: MessageQueueClient::write: error writing 2741 bytes to IOSocket: sd: 57 inet: 127.0.0.1 port: 8601. Socket error was InetStreamSocket::write error: Broken pipe -- write from InetStreamSocket: sd: 57 inet: 127.0.0.1 port: 8601 ==> debug.log <== Jan 25 21:40:10 s_columnstore@ip-172-31-47-54 ExeMgr[10765]: 10.775769 |4|0|0| D 16 CAL0041: Start SQL statement: select k.value as make, count(*) from study_response_1 s left join keymap k on k.key = s.col25 and k.keymap_group_id = 1 group by k.value; |mtab| Jan 25 21:40:10 s_columnstore@ip-172-31-47-54 ProcessMonitor[2442]: 10.894383 |0|0|0| D 18 CAL0000: STOPPING Process: ExeMgr Jan 25 21:40:10 s_columnstore@ip-172-31-47-54 ProcessMonitor[2442]: 10.894468 |0|0|0| D 18 CAL0000: StatusUpdate of Process ExeMgr State = 1 PID = 0 Jan 25 21:40:10 s_columnstore@ip-172-31-47-54 ProcMon[2442]: 10.894453 |0|0|0| D 18 CAL0000: Send SET Alarm ID 13 on device ExeMgr Jan 25 21:40:10 s_columnstore@ip-172-31-47-54 ProcessMonitor[2442]: 10.897010 |0|0|0| D 18 CAL0000: statusControl: REQUEST RECEIVED: Set Process pm1/ExeMgr State = AUTO_OFFLINE Jan 25 21:40:10 s_columnstore@ip-172-31-47-54 ProcessMonitor[2442]: 10.897247 |0|0|0| D 18 CAL0000: statusControl: Set Process pm1/ExeMgr State = AUTO_OFFLINE PID = 0 ==> alarm.log <== 13 MAJOR ALARM PROCESS_DOWN_AUTO Thu Jan 25 21:40:10 2018 1516916410 pm1 ProcessMonitor ExeMgr ==> debug.log <== Jan 25 21:40:10 s_columnstore@ip-172-31-47-54 ProcessMonitor[2442]: 10.898767 |0|0|0| D 18 CAL0000: Send CLEAR Alarm ID 25 on device ExeMgr Jan 25 21:40:10 s_columnstore@ip-172-31-47-54 ProcessMonitor[2442]: 10.900764 |0|0|0| D 18 CAL0000: Send SET Alarm ID 13 on device ExeMgr Jan 25 21:40:10 s_columnstore@ip-172-31-47-54 ProcessMonitor[2442]: 10.903530 |0|0|0| D 18 CAL0000: Pkill Process just to make sure: ExeMgr\* Jan 25 21:40:10 s_columnstore@ip-172-31-47-54 ProcessMonitor[2442]: 10.903600 |0|0|0| D 18 CAL0000: STARTING Process: ExeMgr Jan 25 21:40:10 s_columnstore@ip-172-31-47-54 ProcessMonitor[2442]: 10.903641 |0|0|0| D 18 CAL0000: Process location: /usr/local/mariadb/columnstore/bin/ExeMgr Jan 25 21:40:10 s_columnstore@ip-172-31-47-54 ProcessMonitor[2442]: 10.904065 |0|0|0| D 18 CAL0000: Dependent process of PrimProc/pm1 is 4 Jan 25 21:40:10 s_columnstore@ip-172-31-47-54 ProcessMonitor[2442]: 10.905340 |0|0|0| D 18 CAL0000: Send CLEAR Alarm ID 27 on device DBRM Jan 25 21:40:10 s_columnstore@ip-172-31-47-54 ProcessMonitor[2442]: 10.912290 |0|0|0| D 18 CAL0000: Pkill Process just to make sure: ExeMgr\* Jan 25 21:40:10 s_columnstore@ip-172-31-47-54 ProcessMonitor[2442]: 10.912354 |0|0|0| D 18 CAL0000: StatusUpdate of Process ExeMgr State = 3 PID = 0 Jan 25 21:40:10 s_columnstore@ip-172-31-47-54 ProcessMonitor[2442]: 10.912790 |0|0|0| D 18 CAL0000: statusControl: REQUEST RECEIVED: Set Process pm1/ExeMgr State = AUTO_INIT Jan 25 21:40:10 s_columnstore@ip-172-31-47-54 ProcessMonitor[2442]: 10.912847 |0|0|0| D 18 CAL0000: statusControl: Set Process pm1/ExeMgr State = AUTO_INIT PID = 0 Jan 25 21:40:11 s_columnstore@ip-172-31-47-54 ProcessMonitor[2442]: 11.920052 |0|0|0| D 18 CAL0000: StatusUpdate of Process ExeMgr State = 21 PID = 11362 Jan 25 21:40:11 s_columnstore@ip-172-31-47-54 ProcessMonitor[2442]: 11.920951 |0|0|0| D 18 CAL0000: statusControl: REQUEST RECEIVED: Set Process pm1/ExeMgr State = PID_UPDATE Jan 25 21:40:11 s_columnstore@ip-172-31-47-54 ProcessMonitor[2442]: 11.921236 |0|0|0| D 18 CAL0000: ExeMgr PID is 11362 Jan 25 21:40:11 s_columnstore@ip-172-31-47-54 ProcessMonitor[2442]: 11.921455 |0|0|0| D 18 CAL0000: statusControl: Set Process pm1/ExeMgr State = PID_UPDATE PID = 11362 Jan 25 21:40:11 s_columnstore@ip-172-31-47-54 ProcessMonitor[2442]: 11.921689 |0|0|0| D 18 CAL0000: Send CLEAR Alarm ID 13 on device ExeMgr ==> alarm.log <== 13 CLEARED MAJOR ALARM PROCESS_DOWN_AUTO Thu Jan 25 21:40:11 2018 1516916411 pm1 ProcessMonitor ExeMgr ==> debug.log <== Jan 25 21:40:11 s_columnstore@ip-172-31-47-54 ProcessMonitor[2442]: 11.923081 |0|0|0| D 18 CAL0000: Send CLEAR Alarm ID 25 on device ExeMgr Jan 25 21:40:11 s_columnstore@ip-172-31-47-54 ProcessMonitor[2442]: 11.924461 |0|0|0| D 18 CAL0000: Send CLEAR Alarm ID 21 on device ExeMgr Jan 25 21:40:12 s_columnstore@ip-172-31-47-54 ProcessMonitor[2442]: 12.938875 |0|0|0| D 18 CAL0000: statusControl: REQUEST RECEIVED: Set Process pm1/ExeMgr State = BUSY_INIT Jan 25 21:40:12 s_columnstore@ip-172-31-47-54 ProcessMonitor[2442]: 12.938953 |0|0|0| D 18 CAL0000: statusControl: Set Process pm1/ExeMgr State = BUSY_INIT PID = 11362 Jan 25 21:40:12 s_columnstore@ip-172-31-47-54 ProcessMonitor[2442]: 12.943169 |0|0|0| D 18 CAL0000: statusControl: REQUEST RECEIVED: Set Process pm1/ExeMgr State = ACTIVE Jan 25 21:40:12 s_columnstore@ip-172-31-47-54 ProcessMonitor[2442]: 12.943224 |0|0|0| D 18 CAL0000: statusControl: Set Process pm1/ExeMgr State = ACTIVE PID = 11362 Jan 25 21:40:14 s_columnstore@ip-172-31-47-54 ProcessMonitor[2442]: 14.921789 |0|0|0| D 18 CAL0000: Inform Process Mgr that process was restarted: ExeMgr ==> info.log <== Jan 25 21:40:14 s_columnstore@ip-172-31-47-54 ProcessMonitor[2442]: 14.922249 |0|0|0| I 18 CAL0000: Calpont Process ExeMgr restarted successfully!! Jan 25 21:40:14 s_columnstore@ip-172-31-47-54 ProcessManager[2565]: 14.924776 |0|0|0| I 17 CAL0000: MSG RECEIVED: Process Restarted on pm1/ExeMgr ==> debug.log <== Jan 25 21:40:14 s_columnstore@ip-172-31-47-54 ProcessManager[2565]: 14.925140 |0|0|0| D 17 CAL0000: setQuerySystemState = 0 Jan 25 21:40:14 s_columnstore@ip-172-31-47-54 ProcessManager[2565]: 14.925287 |0|0|0| D 17 CAL0000: setQuerySystemState successful Jan 25 21:40:14 s_columnstore@ip-172-31-47-54 ProcessManager[2565]: 14.925514 |0|0|0| D 17 CAL0000: Set System State = BUSY_INIT Jan 25 21:40:14 s_columnstore@ip-172-31-47-54 ProcessMonitor[2442]: 14.925939 |0|0|0| D 18 CAL0000: statusControl: REQUEST RECEIVED: Set System State = BUSY_INIT Jan 25 21:40:14 s_columnstore@ip-172-31-47-54 ProcessManager[2565]: 14.927525 |0|0|0| D 17 CAL0000: reinitProcessType: ReInit all cpimport Jan 25 21:40:14 s_columnstore@ip-172-31-47-54 ProcessManager[2565]: 14.929036 |0|0|0| D 17 CAL0000: sendMsgProcMon: Process module pm1 Jan 25 21:40:14 s_columnstore@ip-172-31-47-54 ProcessManager[2565]: 14.929277 |0|0|0| D 17 CAL0000: cpimport process is reinited by request. Jan 25 21:40:14 s_columnstore@ip-172-31-47-54 ProcessManager[2565]: 14.929504 |0|0|0| D 17 CAL0000: reinitProcessType: ACK received from Process-Monitor, return status = 0 ==> info.log <== Jan 25 21:40:14 s_columnstore@ip-172-31-47-54 ProcessMonitor[2442]: 14.929318 |0|0|0| I 18 CAL0000: MSG RECEIVED: Re-Init process request on: cpimport ==> debug.log <== Jan 25 21:40:14 s_columnstore@ip-172-31-47-54 ProcessManager[2565]: 14.931626 |0|0|0| D 17 CAL0000: setQuerySystemState = 1 Jan 25 21:40:14 s_columnstore@ip-172-31-47-54 ProcessManager[2565]: 14.931747 |0|0|0| D 17 CAL0000: setQuerySystemState successful Jan 25 21:40:14 s_columnstore@ip-172-31-47-54 ProcessManager[2565]: 14.931958 |0|0|0| D 17 CAL0000: Set System State = ACTIVE Jan 25 21:40:14 s_columnstore@ip-172-31-47-54 ProcessMonitor[2442]: 14.932370 |0|0|0| D 18 CAL0000: statusControl: REQUEST RECEIVED: Set System State = ACTIVE ==> info.log <== Jan 25 21:40:14 s_columnstore@ip-172-31-47-54 ProcessMonitor[2442]: 14.937128 |0|0|0| I 18 CAL0000: PROCREINITPROCESS: completed, no ack to ProcMgr