[MCOL-833] could not open file for OID after a outage recover from pm2 PrimProc Created: 2017-07-25 Updated: 2023-10-26 Resolved: 2017-09-01 |
|
| Status: | Closed |
| Project: | MariaDB ColumnStore |
| Component/s: | ?, ExeMgr |
| Affects Version/s: | 1.0.9 |
| Fix Version/s: | 1.1.0 |
| Type: | Bug | Priority: | Major |
| Reporter: | David Hill (Inactive) | Assignee: | Daniel Lee (Inactive) |
| Resolution: | Fixed | Votes: | 0 |
| Labels: | None | ||
| Description |
|
Had customer report errors with PrimProc could not open file for OID for unknown reasons, Was able to reproduce this error by doing the following. Wasnt sure if same problem with 801/804, so opened a new BUG. 1. setup a 1um / 2 pm system with 50gb tpch1 database 3. Did a pkill on pm2 PrimProc pm1 errors logs soon started after the recovery was performed: Jul 14 16:24:46 ip-172-30-0-176 PrimProc[93531]: 46.550644 |0|0|0| W 28 CAL0000: IDB-2039: Data file does not exist, please contact your system administrator for more information. This file exist on pm2, so ExeMgr is sending the request to the wrong pm1/PrimProc data2]# ll 000.dir/000.dir/012.dir/012.dir/000.dir/FILE002.cdf |
| Comments |
| Comment by David Hill (Inactive) [ 2017-07-26 ] |
|
commit 842838e5cec49d42e209cd8a9284ac4699a53d99 dbcon/joblist/distributedenginecomm.cpp | 2 +- commit 26ac4aa31c98bb937c01ae9d0287df7e027c4ec3 mcol-833 - merge code from 1.0 for missing file fix dbcon/joblist/distributedenginecomm.cpp | 8 ++++---- test scenarios in |
| Comment by Daniel Lee (Inactive) [ 2017-09-01 ] |
|
Build verified: 1.1.0 GitHub source /root/columnstore/mariadb-columnstore-server /root/columnstore/mariadb-columnstore-server/mariadb-columnstore-engine |