Uploaded image for project: 'MariaDB ColumnStore'
  1. MariaDB ColumnStore
  2. MCOL-5643

Make disk joins faster and more memory efficient

    XMLWordPrintable

Details

    • New Feature
    • Status: Open (View Workflow)
    • Major
    • Resolution: Unresolved
    • None
    • 26.10
    • None
    • None

    Description

      This is a followup to MCOL-5627.

      MCOL-5627 fix introduces a performance degradation in disk joins. This is due to less precise splitting of data because of memory constraints. We keep more of "small side" and more of corresponding "large side" data in single file and this makes hash join to behave more like nested loop join.

      MCOL-5627 contains a hint on how to solve this problem. We can prepend RGData with a vector of uin32_t hashes for each row in the RowGroup and use these to filter out large side RGData's that are not needed for currently processed small side.

      Attachments

        Issue Links

          Activity

            People

              sergey.zefirov Sergey Zefirov
              sergey.zefirov Sergey Zefirov
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:

                Time Tracking

                  Estimated:
                  Original Estimate - 60d
                  60d
                  Remaining:
                  Remaining Estimate - 60d
                  60d
                  Logged:
                  Time Spent - Not Specified
                  Not Specified

                  Git Integration

                    Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.