Uploaded image for project: 'MariaDB ColumnStore'
  1. MariaDB ColumnStore
  2. MCOL-6159

All nodes are offline when creating cluster on Debian

    XMLWordPrintable

Details

    • Bug
    • Status: Confirmed (View Workflow)
    • Major
    • Resolution: Unresolved
    • None
    • None
    • None
    • None
    • 2025-8, 25.08 - 3, 2025-9

    Description

      When trying to deploy a cluster of 3 nodes with columnstore-ansible-aws script
      on debian 12 cluster cannot start properly, all nodes are offline:

      admin@ip-10-0-1-64:~$ sudo mcs status
      {
        "timestamp": "2025-08-28 18:16:15.794504",
        "ip-10-0-1-64.us-west-2.compute.internal": {
          "timestamp": "2025-08-28 18:16:15.800390",
          "uptime": 427,
          "dbrm_mode": "offline",
          "cluster_mode": "offline",
          "dbroots": [
            "1"
          ],
          "module_id": 1,
          "services": []
        },
        "ip-10-0-1-223.us-west-2.compute.internal": {
          "timestamp": "2025-08-28 18:16:15.844568",
          "uptime": 427,
          "dbrm_mode": "offline",
          "cluster_mode": "offline",
          "dbroots": [
            "2"
          ],
          "module_id": 2,
          "services": []
        },
        "ip-10-0-1-201.us-west-2.compute.internal": {
          "timestamp": "2025-08-28 18:16:15.881698",
          "uptime": 426,
          "dbrm_mode": "offline",
          "cluster_mode": "offline",
          "dbroots": [
            "3"
          ],
          "module_id": 3,
          "services": []
        },
        "num_nodes": 3
      }
      

      there is ConnectionRefusedError in logs:
      https://mariadb-foundation.sentry.io/issues/59954515/?query=&referrer=issue-stream

      28/Aug/2025 18:13:18 [ERROR] (root)

      {CP Server Thread-5} Cannot establish or use DBRM connection.
      Traceback (most recent call last):
      File "/usr/share/columnstore/cmapi/mcs_node_control/models/node_status.py", line 38, in get_cluster_mode
      with DBRM() as dbrm:
      File "/usr/share/columnstore/cmapi/mcs_node_control/models/dbrm.py", line 54, in _enter_
      self.connect()
      File "/usr/share/columnstore/cmapi/mcs_node_control/models/dbrm.py", line 48, in connect
      self.dbrm_socket.connect(dbrm_host, dbrm_port)
      File "/usr/share/columnstore/cmapi/mcs_node_control/models/dbrm_socket.py", line 208, in connect
      self._socket.connect((host, port))
      ConnectionRefusedError: [Errno 111] Connection refused
      28/Aug/2025 18:13:19 [DEBUG] (cmapi_server) {CP Server Thread-5}

      get_status returns

      {'timestamp': '2025-08-28 18:13:18.967212', 'uptime': 250, 'dbrm_mode': 'slave', 'cluster_mode': 'readonly', 'dbroots': ['1'], 'module_id': 1, 'services': []}

      28/Aug/2025 18:13:19 [INFO] (access_logger)

      {CP Server Thread-5}

      Finished processing incoming GET request from "10.0.1.64" to "/cmapi/0.4.0/node/status" in 0.0371 seconds. uid: 9af8c2a9-3939-424d-acee-3f2bbffd3ee3

      Attachments

        1. screenshot-1.png
          655 kB
          Aleksei Bukhalov
        2. screenshot-2.png
          631 kB
          Aleksei Bukhalov

        Issue Links

          Activity

            People

              AlexanderPresniakov Alexander Presniakov
              abukhalov Aleksei Bukhalov
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:

                Git Integration

                  Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.