Uploaded image for project: 'MariaDB ColumnStore'
  1. MariaDB ColumnStore
  2. MCOL-2223

Engine independent statistics for Columnstore

Details

    Description

      CS now has a very rudimentary query optimization capabilities and we want to improve the situtation. We consider to use Server's optimizer for the purpose but the Server needs statistics namely values distribution histograms and Number of Distinct Values distribution histograms.
      There are different levels of complexity for the task:

      • implement standalone segment files reader that in the end populates both mysql.column_stats and mysql.table_stats using out of band mariadb client connection
      • implement ANALYZE TABLE functionality for Columnstore engine
      • implement ANALYZE TABLE and Histograms with equal-width bins for values distribution histograms(similar to MDEV-12313) together with NDV histograms to decrease I/O

      We expect to have both unit and regression tests but this is optional.

      Attachments

        Issue Links

          Activity

            drrtuy Roman created issue -
            drrtuy Roman made changes -
            Field Original Value New Value
            Labels gsoc19
            serg Sergei Golubchik made changes -
            Summary Engine independant statistics for Columnstore Engine independent statistics for Columnstore
            drrtuy Roman made changes -
            toddstoffel Todd Stoffel (Inactive) made changes -
            Fix Version/s N/A [ 22302 ]
            toddstoffel Todd Stoffel (Inactive) made changes -
            Fix Version/s 1.6 [ 23712 ]
            Fix Version/s N/A [ 22302 ]
            toddstoffel Todd Stoffel (Inactive) made changes -
            Rank Ranked lower
            toddstoffel Todd Stoffel (Inactive) made changes -
            Rank Ranked lower
            cvicentiu Vicențiu Ciorbaru made changes -
            Labels gsoc19 gsoc19 gsoc20
            toddstoffel Todd Stoffel (Inactive) made changes -
            Fix Version/s Icebox [ 22302 ]
            Fix Version/s 1.6 [ 23712 ]
            toddstoffel Todd Stoffel (Inactive) made changes -
            Rank Ranked higher
            toddstoffel Todd Stoffel (Inactive) made changes -
            Fix Version/s 1.6 [ 24715 ]
            Fix Version/s Icebox [ 22302 ]
            toddstoffel Todd Stoffel (Inactive) made changes -
            Epic Link MCOL-1049 [ 64464 ]
            toddstoffel Todd Stoffel (Inactive) made changes -
            Rank Ranked higher
            toddstoffel Todd Stoffel (Inactive) made changes -
            Fix Version/s 1.5.4 [ 24413 ]
            Fix Version/s 1.6 [ 24715 ]
            yitanz Yitan Ze made changes -
            Comment [ Hello,
            I'm applying for GSoP, and this task got my attention. I've done a similar project in a big data search/analytics platform too. On that platform, I implemented 40+ search commands including calculating individual/cumulative statistical value for each line. I've heard about MariaDB for a long time and I hope to contribute to this community. Are there still any available spots for this position? If yes, I would like to know more about this task. My email is yitanze@gmail.com. ]
            toddstoffel Todd Stoffel (Inactive) made changes -
            Labels gsoc19 gsoc20 Compatibility
            David.Hall David Hall (Inactive) made changes -
            Fix Version/s 1.5.6 [ 24716 ]
            Fix Version/s 1.5.4 [ 24413 ]
            David.Hall David Hall (Inactive) made changes -
            Fix Version/s 1.5.4 [ 24413 ]
            Fix Version/s 1.5.6 [ 24716 ]
            toddstoffel Todd Stoffel (Inactive) made changes -
            Rank Ranked lower
            David.Hall David Hall (Inactive) made changes -
            Fix Version/s 1.5.5 [ 24414 ]
            Fix Version/s 1.5.4 [ 24413 ]
            toddstoffel Todd Stoffel (Inactive) made changes -
            Fix Version/s 5.5.1 [ 25030 ]
            Fix Version/s 5.5.1 [ 25030 ]
            Fix Version/s 1.5.5 [ 24414 ]
            David.Hall David Hall (Inactive) made changes -
            Fix Version/s 5.5.1 [ 25030 ]
            toddstoffel Todd Stoffel (Inactive) made changes -
            Fix Version/s Icebox [ 22302 ]
            toddstoffel Todd Stoffel (Inactive) made changes -
            Rank Ranked lower
            toddstoffel Todd Stoffel (Inactive) made changes -
            Rank Ranked higher
            toddstoffel Todd Stoffel (Inactive) made changes -
            Rank Ranked higher
            drrtuy Roman made changes -
            Resolution Won't Do [ 10201 ]
            Status Open [ 1 ] Closed [ 6 ]

            People

              drrtuy Roman
              drrtuy Roman
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Git Integration

                  Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.