Uploaded image for project: 'MariaDB ColumnStore'
  1. MariaDB ColumnStore
  2. MCOL-219

DBT3 query #11 caused ExeMgr to restart

    XMLWordPrintable

Details

    • Bug
    • Status: Closed (View Workflow)
    • Critical
    • Resolution: Duplicate
    • 1.0.1
    • 1.0.2
    • ExeMgr
    • None

    Description

      Build tested: alpha 1.0.1

      mscadmin> getsoft
      getsoftwareinfo Tue Jun 28 20:06:51 2016

      Name : mariadb-columnstore-platform Relocations: (not relocatable)
      Version : 1.0 Vendor: MariaDB Corporation Ab
      Release : 1 Build Date: Mon 13 Jun 2016 06:50:26 PM CDT
      Install Date: Fri 17 Jun 2016 10:53:22 AM CDT Build Host: srvbuilder

      Tested on a 1gb DBT3 (TPCH) database.

      Msg in the debug.log indicated that ExeMgr was being stopped by ProcessMonitor. I am not sure what caused this to happen. My VM has 32gb memory and I allocated 16gb for TotalUMMemory. This does not seem to be a memory problem.

      MariaDB [testdb]> select
      -> ps_partkey,
      -> sum(ps_supplycost * ps_availqty) as value
      -> from
      -> partsupp,
      -> supplier,
      -> nation
      -> where
      -> ps_suppkey = s_suppkey
      -> and s_nationkey = n_nationkey
      -> and n_name = 'UNITED STATES'
      -> group by
      -> ps_partkey having
      -> sum(ps_supplycost * ps_availqty) > (
      -> select
      -> sum(ps_supplycost * ps_availqty) * 0.0001000000
      -> from
      -> partsupp,
      -> supplier,
      -> nation
      -> where
      -> ps_suppkey = s_suppkey
      -> and s_nationkey = n_nationkey
      -> and n_name = 'UNITED STATES'
      -> )
      -> order by
      -> value desc;
      ERROR 1815 (HY000): Internal error: Lost connection to ExeMgr. Please contact your administrator

      Entry in crit.log
      Jun 28 20:21:04 columnStore ProcessMonitor[2464]: 04.940892 |0|0|0| C 18 CAL0000: *****Calpont Process Restarting: ExeMgr, old PID = 24815

      entry in info.log
      Jun 28 20:21:09 columnStore ProcessMonitor[2464]: 09.011743 |0|0|0| I 18 CAL0000: Calpont Process ExeMgr restarted successfully!!
      Jun 28 20:21:09 columnStore ProcessManager[2742]: 09.059576 |0|0|0| I 17 CAL0000: MSG RECEIVED: Process Restarted on pm1/ExeMgr

      Entry in debug.log
      Jun 28 20:21:02 columnStore ExeMgr[24815]: 02.232592 |147|0|0| D 16 CAL0041: Start SQL statement: select ps_partkey, sum(ps_supplycost * ps_availqty) as value from partsupp, supplier, nation where ps_suppkey = s_suppkey and s_nationkey = n_nationkey and n_name = 'UNITED STATES' group by ps_partkey having sum(ps_supplycost * ps_availqty) > ( select sum(ps_supplycost * ps_availqty) * 0.0001000000 from partsupp, supplier, nation where ps_suppkey = s_suppkey and s_nationkey = n_nationkey and n_name = 'UNITED STATES' ) order by value desc; |testdb|
      Jun 28 20:21:04 columnStore ProcessMonitor[2464]: 04.941432 |0|0|0| D 18 CAL0000: STOPPING Process: ExeMgr
      Jun 28 20:21:04 columnStore ProcessMonitor[2464]: 04.941797 |0|0|0| D 18 CAL0000: Send SET Alarm ID 13 on device ExeMgr
      Jun 28 20:21:04 columnStore ProcessMonitor[2464]: 04.941893 |0|0|0| D 18 CAL0000: StatusUpdate of Process ExeMgr State = 1 PID = 0
      Jun 28 20:21:04 columnStore ProcessMonitor[2464]: 04.953145 |0|0|0| D 18 CAL0000: statusControl: REQUEST RECEIVED: Set Process pm1/ExeMgr State = AUTO_OFFLINE
      Jun 28 20:21:04 columnStore ProcessMonitor[2464]: 04.953315 |0|0|0| D 18 CAL0000: statusControl: Set Process pm1/ExeMgr State = AUTO_OFFLINE PID = 0
      Jun 28 20:21:04 columnStore ProcessMonitor[2464]: 04.969862 |0|0|0| D 18 CAL0000: Send CLEAR Alarm ID 25 on device ExeMgr
      Jun 28 20:21:04 columnStore ProcessMonitor[2464]: 04.984033 |0|0|0| D 18 CAL0000: Pkill Process just to make sure: ExeMgr*
      Jun 28 20:21:04 columnStore ProcessMonitor[2464]: 04.984160 |0|0|0| D 18 CAL0000: STARTING Process: ExeMgr
      Jun 28 20:21:04 columnStore ProcessMonitor[2464]: 04.984200 |0|0|0| D 18 CAL0000: Process location: /usr/local/mariadb/columnstore/bin/ExeMgr
      Jun 28 20:21:04 columnStore ProcessMonitor[2464]: 04.984813 |0|0|0| D 18 CAL0000: Dependent process of PrimProc/pm1 is 4
      Jun 28 20:21:04 columnStore ProcessMonitor[2464]: 04.990144 |0|0|0| D 18 CAL0000: Send SET Alarm ID 13 on device ExeMgr
      Jun 28 20:21:04 columnStore ProcessMonitor[2464]: 04.994028 |0|0|0| D 18 CAL0000: Pkill Process just to make sure: ExeMgr*
      Jun 28 20:21:04 columnStore ProcessMonitor[2464]: 04.994102 |0|0|0| D 18 CAL0000: StatusUpdate of Process ExeMgr State = 3 PID = 0
      Jun 28 20:21:04 columnStore ProcessMonitor[2464]: 04.994544 |0|0|0| D 18 CAL0000: statusControl: REQUEST RECEIVED: Set Process pm1/ExeMgr State = AUTO_INIT
      Jun 28 20:21:04 columnStore ProcessMonitor[2464]: 04.994587 |0|0|0| D 18 CAL0000: statusControl: Set Process pm1/ExeMgr State = AUTO_INIT PID = 0
      Jun 28 20:21:05 columnStore ProcessMonitor[2464]: 05.006422 |0|0|0| D 18 CAL0000: Send CLEAR Alarm ID 27 on device DBRM
      Jun 28 20:21:06 columnStore ProcessMonitor[2464]: 06.009339 |0|0|0| D 18 CAL0000: StatusUpdate of Process ExeMgr State = 21 PID = 39209
      Jun 28 20:21:06 columnStore ProcessMonitor[2464]: 06.009952 |0|0|0| D 18 CAL0000: ExeMgr PID is 39209
      Jun 28 20:21:06 columnStore ProcessMonitor[2464]: 06.009956 |0|0|0| D 18 CAL0000: statusControl: REQUEST RECEIVED: Set Process pm1/ExeMgr State = PID_UPDATE
      Jun 28 20:21:06 columnStore ProcessMonitor[2464]: 06.010061 |0|0|0| D 18 CAL0000: statusControl: Set Process pm1/ExeMgr State = PID_UPDATE PID = 39209
      Jun 28 20:21:06 columnStore ProcessMonitor[2464]: 06.010589 |0|0|0| D 18 CAL0000: Send CLEAR Alarm ID 21 on device ExeMgr
      Jun 28 20:21:06 columnStore ProcessMonitor[2464]: 06.029837 |0|0|0| D 18 CAL0000: Send CLEAR Alarm ID 25 on device ExeMgr
      Jun 28 20:21:06 columnStore ProcessMonitor[2464]: 06.047830 |0|0|0| D 18 CAL0000: Send CLEAR Alarm ID 13 on device ExeMgr
      Jun 28 20:21:07 columnStore ProcessMonitor[2464]: 07.039883 |0|0|0| D 18 CAL0000: statusControl: REQUEST RECEIVED: Set Process pm1/ExeMgr State = BUSY_INIT
      Jun 28 20:21:07 columnStore ProcessMonitor[2464]: 07.039920 |0|0|0| D 18 CAL0000: statusControl: Set Process pm1/ExeMgr State = BUSY_INIT PID = 39209
      Jun 28 20:21:07 columnStore ProcessMonitor[2464]: 07.042495 |0|0|0| D 18 CAL0000: statusControl: REQUEST RECEIVED: Set Process pm1/ExeMgr State = ACTIVE
      Jun 28 20:21:07 columnStore ProcessMonitor[2464]: 07.042547 |0|0|0| D 18 CAL0000: statusControl: Set Process pm1/ExeMgr State = ACTIVE PID = 39209
      Jun 28 20:21:09 columnStore ProcessMonitor[2464]: 09.011016 |0|0|0| D 18 CAL0000: Inform Process Mgr that process was restarted: ExeMgr

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              dleeyh Daniel Lee (Inactive)
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Git Integration

                  Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.