[MCOL-3278] ProcMon and ProcMgr crashed - Signal: 6 - libmessageqcpp.so.1 Created: 2019-04-23 Updated: 2023-10-26 Resolved: 2020-04-15 |
|
| Status: | Closed |
| Project: | MariaDB ColumnStore |
| Component/s: | ? |
| Affects Version/s: | 1.1.6 |
| Fix Version/s: | N/A |
| Type: | Bug | Priority: | Major |
| Reporter: | David Hill (Inactive) | Assignee: | Patrick LeBlanc (Inactive) |
| Resolution: | Won't Fix | Votes: | 0 |
| Labels: | None | ||
| Environment: |
3 um 5 pm Aamazon EBS system |
||
| Description |
|
Customer reported: Problem was a colxml error prevented cpimport from happening. Appears to me to be a temporary communication error. Found this from the support report: ProMon restarted on pm1 – causing the errors to occur on UM1, the reason colxml and cpimport failed. They couldnt get correct status of the DBROOTs. ProMgr did restart 18 minutes later. Only thing to note is that ProcMon and ProcMgr both crashed with similar errors. Based on the logs, not sure why ProcMon restarted and ProcMgr followed a bit later. I will open a new BUG. Um1 logs Apr 23 04:59:53 mcs1-um1 joblist[125849]: 53.777357 |2147483648|0|0| C 05 CAL0000: IDB-2034: At least one DBRoot required for that query is offline. Apr 23 04:59:53 mcs1-um1 oamcpp[125849]: 53.714950 |0|0|0| E 08 CAL0000: OamCache::checkReload exception while getModuleStatus pm1 Invalid Parameter passed in getModuleStatus API Apr 23 04:59:53 mcs1-um1 writeengine[125849]: 53.833765 |0|0|0| E 19 CAL0087: BulkLoad Error: colxml runtime exception: Error reading columns for table canary.future_bigsum_tmp: IDB-2043: An internal error occurred. Check the error log file & contact support. Pm1 Apr 23 04:59:49 mcs1-pm1 messagequeue[113607]: 49.076723 |0|0|0| W 31 CAL0000: Client read close socket for InetStreamSocket::readToMagic(): I/O error1: rc-1; poll signal interrupt ( POLLHUP POLLERR ) Date/time: 2019-04-23 04:59:47 /usr/local/mariadb/columnstore/bin/ProcMon(_Z12fatalHandleri+0x150)[0x5557d5baf8c0] And ProcMgr restarted 18 minutes after ProcMon. Apr 23 05:17:14 mcs1-pm1 ProcessMonitor[84163]: 14.581682 |0|0|0| C 18 CAL0000: *****MariaDB ColumnStore Process Restarting: ProcessManager, old PID = 113607 Is trace was from an earlier ProcMgr crash. Didnt see one for this one. Date/time: 2019-04-22 00:02:48 [0x55da18a03c70] |
| Comments |
| Comment by Todd Stoffel (Inactive) [ 2020-04-15 ] |
|
OAM is being deprecated and replaced by an enhanced API and the MaxScale orchestration project. |