Uploaded image for project: 'MariaDB ColumnStore'
  1. MariaDB ColumnStore
  2. MCOL-976

System in DBRM_READ_ONLY mode after Non-parent PM recovery under DataRedundancy

    XMLWordPrintable

    Details

    • Type: Bug
    • Status: Closed (View Workflow)
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 1.0.11, 1.1.0
    • Fix Version/s: 1.0.12, 1.1.2
    • Component/s: None
    • Labels:
    • Sprint:
      2017-21, 2017-22, 2017-23

      Description

      Setup a system with DataRedundancy and shutdown a non-parent PM module. Wait for system to report Active. Restart the shutdown module. After returning to Active the system will show an alarm:

      AlarmID = 31
      Brief Description = DBRM_READ_ONLY
      Alarm Severity = CRITICAL
      Time Issued = Fri Oct 13 17:20:15 2017
      Reporting Module = pm1
      Reporting Process = DBRMControllerNode
      Reported Device = System

      Oct 13 17:19:54 testPM1 joblist[14318]: 54.472863 |0|0|0| E 05 CAL0000: /home/test/mariadb/centOS7/mariadb-columnstore-engine/writeengine/client/we_clients.cpp @ 268 Could not connect to pm3_WriteEngineServer: InetStreamSocket::connect: connect() error: Connection timed out to: InetStreamSocket: sd: 13 inet: 192.168.56.213 port: 8630
      Oct 13 17:19:54 testPM1 joblist[14318]: 54.598050 |0|0|0| E 05 CAL0000: /home/test/mariadb/centOS7/mariadb-columnstore-engine/writeengine/client/we_clients.cpp @ 268 Could not connect to pm3_WriteEngineServer: InetStreamSocket::connect: connect() error: Connection timed out to: InetStreamSocket: sd: 15 inet: 192.168.56.213 port: 8630
      Oct 13 17:19:55 testPM1 oamcpp[14318]: 55.499143 |0|0|0| E 08 CAL0000: OamCache::checkReload shows state for pm3 as AUTO_DISABLED
      Oct 13 17:19:55 testPM1 DMLProc[14318]: 55.502492 |0|0|0| I 20 CAL0002: DMLProc will rollback 0 tables.
      Oct 13 17:20:15 testPM1 controllernode[30650]: 15.525003 |0|0|0| C 29 CAL0000: DBRM Controller: network error distributing command to worker 3
      Oct 13 17:20:35 testPM1 controllernode[30650]: 35.536485 |0|0|0| C 29 CAL0000: DBRM Controller: undo(): warning, could not contact worker number 3
      Oct 13 17:20:35 testPM1 controllernode[30650]: 35.536577 |0|0|0| C 29 CAL0000: DBRM Controller: Caught network error. Sending command 17, length 1. Setting read-only mode.
      Oct 13 17:20:35 testPM1 DMLProc[14318]: 35.539316 |0|0|0| I 20 CAL0002: DMLProc finished rollbackAll.
      Oct 13 17:20:35 testPM1 ProcessMonitor[4463]: 35.623039 |0|0|0| D 18 CAL0000: statusControl: REQUEST RECEIVED: Set Process pm1/DMLProc State = ACTIVE
      Oct 13 17:20:35 testPM1 ProcessMonitor[4463]: 35.623154 |0|0|0| D 18 CAL0000: statusControl: Set Process pm1/DMLProc State = ACTIVE PID = 14318
      Oct 13 17:20:35 testPM1 ProcessMonitor[4463]: 35.623202 |0|0|0| D 18 CAL0000: statusControl: REQUEST RECEIVED: Set System State = ACTIVE

        Attachments

          Activity

            People

            Assignee:
            hill David Hill (Inactive)
            Reporter:
            ben.thompson Ben Thompson
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

              Dates

              Created:
              Updated:
              Resolved:

                Git Integration

                Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.