[MCOL-781] Crash on SystemState and BRMShmImpl after Update to 1.09 Created: 2017-06-21 Updated: 2017-08-09 Resolved: 2017-08-09 |
|
| Status: | Closed |
| Project: | MariaDB ColumnStore |
| Component/s: | PrimProc |
| Affects Version/s: | None |
| Fix Version/s: | Icebox |
| Type: | Bug | Priority: | Major |
| Reporter: | Christian2 | Assignee: | David Hill (Inactive) |
| Resolution: | Cannot Reproduce | Votes: | 0 |
| Labels: | None | ||
| Environment: |
Red Hat Enterprise Linux Server release 7.3, single Server CS 1.09 |
||
| Attachments: |
|
| Description |
|
Hello, we are facing a lot of prblems because of DBRM errors with a columunstore made on 1.07. After teh current update from 1.07 over 1.08 (start an stop with no issue, incl. boost 1.57 installation) to 1.09 the system was available and good to answer some querries. AS weel creating a table. With import from exiting table (cpimport) the system crashed in the same manner as version 1.07 before. Part of the log: Jun 21 13:03:56 kmodekarlsap001 controllernode[64512]: 56.857546 |0|0|0| E 29 CAL0000: DBRM: error: SessionManager::getSystemState() failed (network) The support output is attached as well. Thanks a lot in advance |
| Comments |
| Comment by David Hill (Inactive) [ 2017-06-21 ] |
|
For the logs, it does look like there is some DBRM file problem. 1. Try one of the local copies and see if one of those works better. So here the dbrm files and its currently using the A version that is shown by the current file shown below. By date, the A version is the latest. B is an older version but its the same size and the other is even older and a different size. That is the version you could try to bring the system up with and see if that resolves things. total 7584
/usr/local/mariadb/columnstore/data1/systemFiles/dbrm/BRM_savesA You can try these steps to get it work with the different sized version. Hopefully that version good to use.
Try this to see if it its stabilize. 2. If you do maintenance backups where the dbrm and all the data files are backed up, you would need to reinstall those back on. These are the 2 options as being able to recover from what looks like to be a DBRM files issue. |