Details
-
New Feature
-
Status: Stalled (View Workflow)
-
Major
-
Resolution: Unresolved
-
22.08.1
-
2021-17, 2022-22, 2022-23, 2023-4, 2023-5, 2023-6, 2023-7, 2023-8, 2023-10, 2023-11
Description
As of 22.08.01 MCS does DISTINCT processing TupleAnnexStep. This step leverages hashmap for the purpose. This solution is simple but it:
- lacks scalability
- can't leverage disk-based capabilities of RowStorage class used by GROUP BY
- ResourceManager that accounts RAM consumption doesn't counts the hashmap
This issue is about a new DISTINCT implementation(presumably based on RowStorage) that:
- can do external DISTINCT spilling on disk if necessary,
- ResourceManager counts the implemenation RAM consumption
- scales(this might be tricky since DISTINCT processing overlaps with ORDER BY)
Attachments
Issue Links
Activity
Fix Version/s | 22.11.01 [ 28458 ] | |
Fix Version/s | 22.08.3 [ 28456 ] |
Status | Open [ 1 ] | In Progress [ 3 ] |
Rank | Ranked higher |
Rank | Ranked higher |
Rank | Ranked higher |
Sprint | 2021-17 [ 614 ] | 2021-17, 2021-18 [ 614, 672 ] |
Fix Version/s | 23.02 [ 28209 ] | |
Fix Version/s | 23.03.1 [ 28458 ] |
Sprint | 2021-17, 2022-22 [ 614, 672 ] | 2021-17, 2022-22, 2022-23 [ 614, 672, 686 ] |
Assignee | Alexey Antipovsky [ JIRAUSER47594 ] | Roman [ drrtuy ] |
Sprint | 2021-17, 2022-22, 2022-23 [ 614, 672, 686 ] | 2021-17, 2022-22, 2022-23, 2022-24 [ 614, 672, 686, 698 ] |
Sprint | 2021-17, 2022-22, 2022-23, 2023-4 [ 614, 672, 686, 698 ] | 2021-17, 2022-22, 2022-23, 2023-4, 2023-5 [ 614, 672, 686, 698, 702 ] |
Sprint | 2021-17, 2022-22, 2022-23, 2023-4, 2023-5 [ 614, 672, 686, 698, 702 ] | 2021-17, 2022-22, 2022-23, 2023-4, 2023-5, 2023-6 [ 614, 672, 686, 698, 702, 706 ] |
Fix Version/s | 23.08 [ 28540 ] | |
Fix Version/s | 23.02 [ 28209 ] |
Sprint | 2021-17, 2022-22, 2022-23, 2023-4, 2023-5, 2023-6 [ 614, 672, 686, 698, 702, 706 ] | 2021-17, 2022-22, 2022-23, 2023-4, 2023-5, 2023-6, 2023-7 [ 614, 672, 686, 698, 702, 706, 726 ] |
Sprint | 2021-17, 2022-22, 2022-23, 2023-4, 2023-5, 2023-6, 2023-7 [ 614, 672, 686, 698, 702, 706, 726 ] | 2021-17, 2022-22, 2022-23, 2023-4, 2023-5, 2023-6, 2023-7, 2023-8 [ 614, 672, 686, 698, 702, 706, 726, 728 ] |
Status | In Progress [ 3 ] | Stalled [ 10000 ] |
Labels | rf_big_data |
Labels | rf_big_data | rm_big_data |
Sprint | 2021-17, 2022-22, 2022-23, 2023-4, 2023-5, 2023-6, 2023-7, 2023-8 [ 614, 672, 686, 698, 702, 706, 726, 728 ] | 2021-17, 2022-22, 2022-23, 2023-4, 2023-5, 2023-6, 2023-7, 2023-8, 2023-9 [ 614, 672, 686, 698, 702, 706, 726, 728, 733 ] |
Sprint | 2021-17, 2022-22, 2022-23, 2023-4, 2023-5, 2023-6, 2023-7, 2023-8, 2023-9 [ 614, 672, 686, 698, 702, 706, 726, 728, 733 ] | 2021-17, 2022-22, 2022-23, 2023-4, 2023-5, 2023-6, 2023-7, 2023-8, 2023-10 [ 614, 672, 686, 698, 702, 706, 726, 728, 734 ] |
Sprint | 2021-17, 2022-22, 2022-23, 2023-4, 2023-5, 2023-6, 2023-7, 2023-8, 2023-10 [ 614, 672, 686, 698, 702, 706, 726, 728, 734 ] | 2021-17, 2022-22, 2022-23, 2023-4, 2023-5, 2023-6, 2023-7, 2023-8, 2023-10, 2023-11 [ 614, 672, 686, 698, 702, 706, 726, 728, 734, 737 ] |
Sprint | 2021-17, 2022-22, 2022-23, 2023-4, 2023-5, 2023-6, 2023-7, 2023-8, 2023-10, 2023-11 [ 614, 672, 686, 698, 702, 706, 726, 728, 734, 737 ] | 2021-17, 2022-22, 2022-23, 2023-4, 2023-5, 2023-6, 2023-7, 2023-8, 2023-10, 2023-11, 2023-13 [ 614, 672, 686, 698, 702, 706, 726, 728, 734, 737, 748 ] |
Sprint | 2021-17, 2022-22, 2022-23, 2023-4, 2023-5, 2023-6, 2023-7, 2023-8, 2023-10, 2023-11, 2023-12 [ 614, 672, 686, 698, 702, 706, 726, 728, 734, 737, 748 ] | 2021-17, 2022-22, 2022-23, 2023-4, 2023-5, 2023-6, 2023-7, 2023-8, 2023-10, 2023-11 [ 614, 672, 686, 698, 702, 706, 726, 728, 734, 737 ] |
Labels | rm_big_data | rm_big_data triage |
Labels | rm_big_data triage | rm_big_data |