Uploaded image for project: 'MariaDB ColumnStore'
  1. MariaDB ColumnStore
  2. MCOL-5749

CMAPI | ConnectionRefusedError unhandled exception

Details

    • Bug
    • Status: Closed (View Workflow)
    • Major
    • Resolution: Fixed
    • 23.10.1, 23.02.8
    • 23.02.9, 23.10.2
    • None
    • 2024-1

    Description

      The newest version of cmapi change an exception type to ConnectionRefusedError from socket.timeout. Essentially triggering workernode/ loadbrm to reload every 10-20 seconds.
      During this time, on large S3 installations with slow networking, DBRMs, specially the extent map might not be downloaded/processed in time so the map can be truncated at run time share memory.

      The solution is to avoid ConnectionRefusedError error during startup polling to cause this restart/reloading.

      /usr/share/columnstore/cmapi/cmapi_server/managers/process.py : 173

      173                 except (socket.timeout,ConnectionRefusedError) :
          174                     logging.debug(
          175                         f'"{name}" {workernodes[name]["IPAddr"]}:'
          176                         f'{workernodes[name]["Port"]} not started yet.'
          177                     )
          178                 else:
          179                     # delete started workernode from workernodes dict
      

      Attachments

        Activity

          allen.herrera Allen Herrera created issue -
          julien.fritsch Julien Fritsch made changes -
          Field Original Value New Value
          Assignee Leonid Fedorov [ JIRAUSER48443 ]
          drrtuy Roman made changes -
          Assignee Leonid Fedorov [ JIRAUSER48443 ] Roman [ drrtuy ]
          drrtuy Roman made changes -
          Status Open [ 1 ] In Progress [ 3 ]
          drrtuy Roman made changes -
          Assigned for Review Leonid Fedorov [ JIRAUSER48443 ]
          Fix Version/s 23.02.9 [ 29805 ]
          Fix Version/s 23.10.2 [ 29807 ]
          Sprint 2024-1 [ 755 ]
          Affects Version/s 23.02.8 [ 29635 ]
          Affects Version/s 23.10.1 [ 29105 ]
          drrtuy Roman made changes -
          Status In Progress [ 3 ] In Review [ 10002 ]
          leonid.fedorov Leonid Fedorov made changes -
          Status In Review [ 10002 ] In Testing [ 10301 ]
          kirill.perov@mariadb.com Kirill Perov (Inactive) added a comment - - edited

          drrtuy, allen.herrera, leonid.fedorov, I don't understand how can I test it
          Please provide info.

          kirill.perov@mariadb.com Kirill Perov (Inactive) added a comment - - edited drrtuy , allen.herrera , leonid.fedorov , I don't understand how can I test it Please provide info.

          We have the prove this simple changeset works for our customer in fields, my suggestion is to avoid wasting QA power for this particular case

          leonid.fedorov Leonid Fedorov added a comment - We have the prove this simple changeset works for our customer in fields, my suggestion is to avoid wasting QA power for this particular case
          leonid.fedorov Leonid Fedorov made changes -
          Resolution Fixed [ 1 ]
          Status In Testing [ 10301 ] Closed [ 6 ]
          AirFocus AirFocus made changes -
          Labels triage
          mariadb-jira-automation Jira Automation (IT) made changes -
          Zendesk Related Tickets 201918
          Zendesk active tickets 201918

          People

            drrtuy Roman
            allen.herrera Allen Herrera
            Leonid Fedorov Leonid Fedorov
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Git Integration

                Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.