Uploaded image for project: 'MariaDB ColumnStore'
  1. MariaDB ColumnStore
  2. MCOL-804

UM1 looses connection to PMs

    XMLWordPrintable

Details

    • Bug
    • Status: Closed (View Workflow)
    • Critical
    • Resolution: Not a Bug
    • 1.0.8
    • Icebox
    • DDLProc, DMLProc, PrimProc, ProcMgr
    • None
    • Centos7, 4 Performance Modules, 1 User Module

    Description

      We had several issues with Infinidb 4.6 where DML, DDL, PrimProc, ExeMgr will not run or looses connection so the system has been upgraded to MCS-1.8 and we still face the same issue. UM1 frequently looses connection with PMs or one of the PMs goes out of the cluster. here are some logs from UM1.

      Jul 5 17:42:52 ip- joblist[93803]: 52.693608 |0|0|0| C 05 CAL0000: /home/builder/mariadb-columnstore-server/mariadb-columnstore-engine/dbcon/execplan/clientrotator.cpp @ 318 Could not get a ExeMgr connection.
      Jul 5 17:42:52 ip- joblist[93803]: 52.693658 |0|0|0| C 05 CAL0000: /home/builder/mariadb-columnstore-server/mariadb-columnstore-engine/dbcon/execplan/clientrotator.cpp @ 146 /home/builder/mariadb-columnstore-server/mariadb-columnstore-engine/dbcon/execplan/clientrotator.cpp: Could not get a connection to a ExeMgr
      Jul 5 17:43:13 ip- joblist[93803]: 13.719869 |0|0|0| C 05 CAL0000: /home/builder/mariadb-columnstore-server/mariadb-columnstore-engine/dbcon/execplan/clientrotator.cpp @ 318 Could not get a ExeMgr connection.
      Jul 6 03:52:57 ip- joblist[47893]: 57.817911 |0|0|0| C 05 CAL0000: /home/builder/mariadb-columnstore-server/mariadb-columnstore-engine/dbcon/execplan/clientrotator.cpp @ 318 Could not get a ExeMgr connection.
      Jul 6 03:52:57 ip- joblist[47893]: 57.817962 |0|0|0| C 05 CAL0000: /home/builder/mariadb-columnstore-server/mariadb-columnstore-engine/dbcon/execplan/clientrotator.cpp @ 146 /home/builder/mariadb-columnstore-server/mariadb-columnstore-engine/dbcon/execplan/clientrotator.cpp: Could not get a connection to a ExeMgr
      Jul 6 03:52:58 ip- joblist[66076]: 58.901732 |0|0|0| C 05 CAL0000: /home/builder/mariadb-columnstore-server/mariadb-columnstore-engine/dbcon/joblist/distributedenginecomm.cpp @ 382 DEC: lost connection to pm1
      Jul 6 03:52:59 ip- joblist[66076]: 59.281593 |0|0|0| C 05 CAL0000: /home/builder/mariadb-columnstore-server/mariadb-columnstore-engine/dbcon/joblist/distributedenginecomm.cpp @ 382 DEC: lost connection to pm2
      Jul 6 03:52:59 ip- joblist[66076]: 59.325477 |0|0|0| C 05 CAL0000: /home/builder/mariadb-columnstore-server/mariadb-columnstore-engine/dbcon/joblist/distributedenginecomm.cpp @ 382 DEC: lost connection to pm3
      Jul 6 03:52:59 ip- joblist[66076]: 59.555777 |0|0|0| C 05 CAL0000: /home/builder/mariadb-columnstore-server/mariadb-columnstore-engine/dbcon/joblist/distributedenginecomm.cpp @ 382 DEC: lost connection to pm4
      Jul 6 03:53:21 ip- joblist[47893]: 21.153464 |0|0|0| C 05 CAL0000: /home/builder/mariadb-columnstore-server/mariadb-columnstore-engine/dbcon/execplan/clientrotator.cpp @ 318 Could not get a ExeMgr connection.
      Jul 6 03:53:21 ip- joblist[47893]: 21.153500 |0|0|0| C 05 CAL0000: /home/builder/mariadb-columnstore-server/mariadb-columnstore-engine/dbcon/execplan/clientrotator.cpp @ 318 Could not get a ExeMgr connection.
      Jul 6 03:53:21 ip- mysqld[47893]: 21.153521 |0|0|0| C 05 CAL0000: /home/builder/mariadb-columnstore-server/mariadb-columnstore-engine/dbcon/execplan/clientrotator.cpp @ 146 /home/builder/mariadb-columnstore-server/mariadb-columnstore-engine/dbcon/execplan/clientrotator.cpp: Could not get a connection to a ExeMgr
      Jul 6 03:53:45 ip- joblist[47893]: 45.174945 |0|0|0| C 05 CAL0000: /home/builder/mariadb-columnstore-server/mariadb-columnstore-engine/dbcon/execplan/clientrotator.cpp @ 318 Could not get a ExeMgr connection.
      Jul 6 03:54:19 ip- joblist[66076]: 19.532358 |2147483648|0|0| C 05 CAL0000: IDB-2023: PrimProc is not running (or connection to PrimProc dropped).
      Jul 6 03:54:19 ip- joblist[66076]: 19.532434 |2147483648|0|0| C 05 CAL0000: IDB-2023: PrimProc is not running (or connection to PrimProc dropped).
      Jul 6 03:54:19 ip- joblist[66076]: 19.533035 |2147483648|0|0| C 05 CAL0000: IDB-2023: PrimProc is not running (or connection to PrimProc dropped).
      Jul 6 03:54:19 ip- joblist[66076]: 19.533362 |2147483648|0|0| C 05 CAL0000: IDB-2023: PrimProc is not running (or connection to PrimProc dropped).
      Jul 6 03:54:19 ip- joblist[66076]: 19.533428 |2147483648|0|0| C 05 CAL0000: IDB-2023: PrimProc is not running (or connection to PrimProc dropped).

      Thanks in advance.

      Attachments

        1. columnstoreSupportReport1.tar.gz
          7.49 MB
          Abhinav santi
        2. debug.log.tar.gz
          7.86 MB
          Abhinav santi

        Activity

          People

            Unassigned Unassigned
            abhinav.santi Abhinav santi
            Votes:
            1 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Git Integration

                Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.