Uploaded image for project: 'MariaDB Server'
  1. MariaDB Server
  2. MDEV-21131

Histograms: Most-Common-Values histograms

    XMLWordPrintable

    Details

    • Type: Task
    • Status: Open (View Workflow)
    • Priority: Major
    • Resolution: Unresolved
    • Fix Version/s: None
    • Component/s: None
    • Labels:

      Description

      Often, columns have a few values that are very frequent.
      For this case, one can use a histogram that is a collection of (value, frequency) pairs.

      There are [approximate] algorithms that allow to find most common values while using a limited amount of memory and/or basing on sample of the table.

      Another important property is that most-common-value collection/storage can be generalized to tuples of multiple columns.

        Attachments

          Issue Links

            Activity

              People

              Assignee:
              Unassigned Unassigned
              Reporter:
              psergei Sergei Petrunia
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

                Dates

                Created:
                Updated:

                  Git Integration