[MCOL-4798] ExeMgr hit cpu , cluster in read only and reported PrimProc error reading file ,Error reading compression header. Created: 2021-07-05 Updated: 2023-11-17 Resolved: 2022-07-27 |
|
| Status: | Closed |
| Project: | MariaDB ColumnStore |
| Component/s: | ExeMgr |
| Affects Version/s: | 5.5.2 |
| Fix Version/s: | 22.08.1 |
| Type: | Bug | Priority: | Critical |
| Reporter: | Massimo | Assignee: | David Hall (Inactive) |
| Resolution: | Duplicate | Votes: | 1 |
| Labels: | None | ||
| Issue Links: |
|
||||||||||||
| Sprint: | 2021-16, 2021-17 | ||||||||||||
| Description |
|
Customer report CPU alert by ExcMgr process. Once we check the status of the cluster was in READ-ONLY , reporting the same error: Jul 5 08:24:49 pixid-csx2 messagequeue[1366]: 49.012334 |0|0|0| W 31 CAL0000: MessageQueueClient::write: error writing 4790 bytes to IOSocket: sd: 100 inet: 10.10.1.92 port: 8601. Socket error was InetStreamSocket::write error: Broken pipe – write from InetStreamSocket: sd: 100 inet: 10.10.1.92 port: 8601 looking back to the log, looks like there were many error before Jul 5 06:40:03 pixid-csx2 IDBFile[7950]: 03.159455 |0|0|0| D 35 CAL0002: Failed to open file: (dbroot 3 offline)/000.dir/000.dir/014.dir/099.dir/000.dir/FILE000.cdf, exception: unable to open Unbuffered file We need to restart all the cluster, which fix the issue. |
| Comments |
| Comment by Massimo [ 2021-07-16 ] |
|
manjot toddstoffel we do not have the possibilities to reproduce a case, that s why we collect all logs you request and plsu /var/log/messages. First they dont have a test env, or we dont have access. everything is on the log, which should tell the problem |
| Comment by Massimo [ 2021-09-01 ] |
|
drrtuy toddstoffel any update on this? |
| Comment by David Hall (Inactive) [ 2022-03-04 ] |
|
We can't reproduce |