Uploaded image for project: 'MariaDB Server'
  1. MariaDB Server
  2. MDEV-26892

JSON histograms become invalid with a specific (corrupt) value in table

Details

    • Bug
    • Status: Closed (View Workflow)
    • Minor
    • Resolution: Fixed
    • N/A
    • 10.7.1
    • Optimizer
    • None

    Description

      create or replace table t (a varchar(32)) DEFAULT CHARSET=cp1257;
      set histogram_type= JSON_HB, histogram_size= 1;
      insert into t values ('foo'),(unhex('9C'));
      analyze table t persistent for all;
       
      select * from mysql.column_stats where table_name = 't';
       
      select * from t;
       
      # Cleanup
      drop table t;
      

      preview-10.7-MDEV-26519-json-histograms 508f5f3f1

      MariaDB [test]> select * from mysql.column_stats where table_name = 't';
      +---------+------------+-------------+-----------+-----------+-------------+------------+---------------+-----------+-----------+---------------------------------------------------------------+
      | db_name | table_name | column_name | min_value | max_value | nulls_ratio | avg_length | avg_frequency | hist_size | hist_type | histogram                                                     |
      +---------+------------+-------------+-----------+-----------+-------------+------------+---------------+-----------+-----------+---------------------------------------------------------------+
      | test    | t          | a           | foo       | �          |      0.0000 |     2.0000 |        1.0000 |         0 | JSON_HB   | {
        "histogram_hb_v2": [
          {
            "start": "foo"
          ]
        } |
      +---------+------------+-------------+-----------+-----------+-------------+------------+---------------+-----------+-----------+---------------------------------------------------------------+
      1 row in set (0.000 sec)
       
      MariaDB [test]> 
      MariaDB [test]> select * from t;
      ERROR 4183 (HY000): Failed to parse histogram: Root JSON element must be a JSON object at offset 0.
      

      The original test didn't insert UNHEX-ed values. I didn't investigate how exactly the symbol ended up in the table, most likely through value truncation in non-strict mode, as we have seen before.

      Between the invalid value in the column and the meaningless histogram size, I consider it as much a corner case as it gets. It is still worth investigation though, at least to make sure it is really only limited to histogram size 1.

      Attachments

        Issue Links

          Activity

            elenst Elena Stepanova created issue -
            elenst Elena Stepanova made changes -
            Field Original Value New Value
            psergei Sergei Petrunia made changes -
            Fix Version/s 10.7.1 [ 26120 ]
            Fix Version/s 10.7 [ 24805 ]
            Resolution Fixed [ 1 ]
            Status Open [ 1 ] Closed [ 6 ]
            serg Sergei Golubchik made changes -
            Workflow MariaDB v3 [ 126601 ] MariaDB v4 [ 159803 ]

            People

              psergei Sergei Petrunia
              elenst Elena Stepanova
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Git Integration

                  Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.