Details

    • 2021-17, 2022-22, 2022-23, 2023-4, 2023-5, 2023-6, 2023-7, 2023-8, 2023-10, 2023-11

    Description

      As of 22.08.01 MCS does DISTINCT processing TupleAnnexStep. This step leverages hashmap for the purpose. This solution is simple but it:

      • lacks scalability
      • can't leverage disk-based capabilities of RowStorage class used by GROUP BY
      • ResourceManager that accounts RAM consumption doesn't counts the hashmap

      This issue is about a new DISTINCT implementation(presumably based on RowStorage) that:

      • can do external DISTINCT spilling on disk if necessary,
      • ResourceManager counts the implemenation RAM consumption
      • scales(this might be tricky since DISTINCT processing overlaps with ORDER BY)

      Attachments

        Issue Links

          Activity

            drrtuy Roman created issue -
            drrtuy Roman made changes -
            Field Original Value New Value
            alexey.vorovich alexey vorovich (Inactive) made changes -
            Fix Version/s 22.11.01 [ 28458 ]
            Fix Version/s 22.08.3 [ 28456 ]
            alexey.antipovsky Alexey Antipovsky made changes -
            Status Open [ 1 ] In Progress [ 3 ]
            toddstoffel Todd Stoffel (Inactive) made changes -
            Rank Ranked higher
            toddstoffel Todd Stoffel (Inactive) made changes -
            Rank Ranked higher
            toddstoffel Todd Stoffel (Inactive) made changes -
            Rank Ranked higher
            toddstoffel Todd Stoffel (Inactive) made changes -
            Sprint 2021-17 [ 614 ] 2021-17, 2021-18 [ 614, 672 ]
            alexey.vorovich alexey vorovich (Inactive) made changes -
            Fix Version/s 23.02 [ 28209 ]
            Fix Version/s 23.03.1 [ 28458 ]
            toddstoffel Todd Stoffel (Inactive) made changes -
            Sprint 2021-17, 2022-22 [ 614, 672 ] 2021-17, 2022-22, 2022-23 [ 614, 672, 686 ]
            toddstoffel Todd Stoffel (Inactive) made changes -
            Assignee Alexey Antipovsky [ JIRAUSER47594 ] Roman [ drrtuy ]
            toddstoffel Todd Stoffel (Inactive) made changes -
            Sprint 2021-17, 2022-22, 2022-23 [ 614, 672, 686 ] 2021-17, 2022-22, 2022-23, 2022-24 [ 614, 672, 686, 698 ]
            toddstoffel Todd Stoffel (Inactive) made changes -
            Sprint 2021-17, 2022-22, 2022-23, 2023-4 [ 614, 672, 686, 698 ] 2021-17, 2022-22, 2022-23, 2023-4, 2023-5 [ 614, 672, 686, 698, 702 ]
            toddstoffel Todd Stoffel (Inactive) made changes -
            Sprint 2021-17, 2022-22, 2022-23, 2023-4, 2023-5 [ 614, 672, 686, 698, 702 ] 2021-17, 2022-22, 2022-23, 2023-4, 2023-5, 2023-6 [ 614, 672, 686, 698, 702, 706 ]
            toddstoffel Todd Stoffel (Inactive) made changes -
            Fix Version/s 23.08 [ 28540 ]
            Fix Version/s 23.02 [ 28209 ]
            toddstoffel Todd Stoffel (Inactive) made changes -
            Sprint 2021-17, 2022-22, 2022-23, 2023-4, 2023-5, 2023-6 [ 614, 672, 686, 698, 702, 706 ] 2021-17, 2022-22, 2022-23, 2023-4, 2023-5, 2023-6, 2023-7 [ 614, 672, 686, 698, 702, 706, 726 ]
            toddstoffel Todd Stoffel (Inactive) made changes -
            Sprint 2021-17, 2022-22, 2022-23, 2023-4, 2023-5, 2023-6, 2023-7 [ 614, 672, 686, 698, 702, 706, 726 ] 2021-17, 2022-22, 2022-23, 2023-4, 2023-5, 2023-6, 2023-7, 2023-8 [ 614, 672, 686, 698, 702, 706, 726, 728 ]
            drrtuy Roman made changes -
            Status In Progress [ 3 ] Stalled [ 10000 ]
            Andrey Andrey Piskunov (Inactive) made changes -
            alexey.vorovich alexey vorovich (Inactive) made changes -
            Labels rf_big_data
            alexey.vorovich alexey vorovich (Inactive) made changes -
            Labels rf_big_data rm_big_data
            toddstoffel Todd Stoffel (Inactive) made changes -
            Sprint 2021-17, 2022-22, 2022-23, 2023-4, 2023-5, 2023-6, 2023-7, 2023-8 [ 614, 672, 686, 698, 702, 706, 726, 728 ] 2021-17, 2022-22, 2022-23, 2023-4, 2023-5, 2023-6, 2023-7, 2023-8, 2023-9 [ 614, 672, 686, 698, 702, 706, 726, 728, 733 ]
            toddstoffel Todd Stoffel (Inactive) made changes -
            Sprint 2021-17, 2022-22, 2022-23, 2023-4, 2023-5, 2023-6, 2023-7, 2023-8, 2023-9 [ 614, 672, 686, 698, 702, 706, 726, 728, 733 ] 2021-17, 2022-22, 2022-23, 2023-4, 2023-5, 2023-6, 2023-7, 2023-8, 2023-10 [ 614, 672, 686, 698, 702, 706, 726, 728, 734 ]
            toddstoffel Todd Stoffel (Inactive) made changes -
            Sprint 2021-17, 2022-22, 2022-23, 2023-4, 2023-5, 2023-6, 2023-7, 2023-8, 2023-10 [ 614, 672, 686, 698, 702, 706, 726, 728, 734 ] 2021-17, 2022-22, 2022-23, 2023-4, 2023-5, 2023-6, 2023-7, 2023-8, 2023-10, 2023-11 [ 614, 672, 686, 698, 702, 706, 726, 728, 734, 737 ]
            leonid.fedorov Leonid Fedorov made changes -
            Sprint 2021-17, 2022-22, 2022-23, 2023-4, 2023-5, 2023-6, 2023-7, 2023-8, 2023-10, 2023-11 [ 614, 672, 686, 698, 702, 706, 726, 728, 734, 737 ] 2021-17, 2022-22, 2022-23, 2023-4, 2023-5, 2023-6, 2023-7, 2023-8, 2023-10, 2023-11, 2023-13 [ 614, 672, 686, 698, 702, 706, 726, 728, 734, 737, 748 ]
            leonid.fedorov Leonid Fedorov made changes -
            Sprint 2021-17, 2022-22, 2022-23, 2023-4, 2023-5, 2023-6, 2023-7, 2023-8, 2023-10, 2023-11, 2023-12 [ 614, 672, 686, 698, 702, 706, 726, 728, 734, 737, 748 ] 2021-17, 2022-22, 2022-23, 2023-4, 2023-5, 2023-6, 2023-7, 2023-8, 2023-10, 2023-11 [ 614, 672, 686, 698, 702, 706, 726, 728, 734, 737 ]
            julien.fritsch Julien Fritsch made changes -
            Labels rm_big_data rm_big_data triage
            drrtuy Roman made changes -
            julien.fritsch Julien Fritsch made changes -
            Labels rm_big_data triage rm_big_data

            People

              drrtuy Roman
              drrtuy Roman
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:

                Git Integration

                  Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.