Uploaded image for project: 'MariaDB Server'
  1. MariaDB Server
  2. MDEV-19474

Histogram statistics are used even with optimizer_use_condition_selectivity=3

Details

    Description

      Dataset

      create table t1(a int);
      insert into t1 values (1),(2),(2),(3),(4);
      set use_stat_tables='preferably';
      set optimizer_use_condition_selectivity=4;
      set histogram_size= 255;
      analyze table t1;
      

      MariaDB [test]> analyze select * from t1 where a=2;
      +------+-------------+-------+------+---------------+------+---------+------+------+--------+----------+------------+-------------+
      | id   | select_type | table | type | possible_keys | key  | key_len | ref  | rows | r_rows | filtered | r_filtered | Extra       |
      +------+-------------+-------+------+---------------+------+---------+------+------+--------+----------+------------+-------------+
      |    1 | SIMPLE      | t1    | ALL  | NULL          | NULL | NULL    | NULL | 5    | 5.00   |    39.84 |      40.00 | Using where |
      +------+-------------+-------+------+---------------+------+---------+------+------+--------+----------+------------+-------------+
      1 row in set (0.002 sec)
      

      So with optimizer_use_condition_selectivity=4 , we get filtered as 39.84

      MariaDB [test]> set optimizer_use_condition_selectivity=3;
      Query OK, 0 rows affected (0.000 sec)
       
      MariaDB [test]> analyze select * from t1 where a=2;
      +------+-------------+-------+------+---------------+------+---------+------+------+--------+----------+------------+-------------+
      | id   | select_type | table | type | possible_keys | key  | key_len | ref  | rows | r_rows | filtered | r_filtered | Extra       |
      +------+-------------+-------+------+---------------+------+---------+------+------+--------+----------+------------+-------------+
      |    1 | SIMPLE      | t1    | ALL  | NULL          | NULL | NULL    | NULL | 5    | 5.00   |    39.84 |      40.00 | Using where |
      +------+-------------+-------+------+---------------+------+---------+------+------+--------+----------+------------+-------------+
      1 row in set (0.002 sec)
      

      So with optimizer_use_condition_selectivity=3 , we get * filtered as 39.84*

      MariaDB [test]> flush tables;  // Statistics are flushed, need to read again to use them
      Query OK, 0 rows affected (0.002 sec)
       
      MariaDB [test]> analyze select * from t1 where a=2;
      +------+-------------+-------+------+---------------+------+---------+------+------+--------+----------+------------+-------------+
      | id   | select_type | table | type | possible_keys | key  | key_len | ref  | rows | r_rows | filtered | r_filtered | Extra       |
      +------+-------------+-------+------+---------------+------+---------+------+------+--------+----------+------------+-------------+
      |    1 | SIMPLE      | t1    | ALL  | NULL          | NULL | NULL    | NULL | 5    | 5.00   |    25.00 |      40.00 | Using where |
      +------+-------------+-------+------+---------------+------+---------+------+------+--------+----------+------------+-------------+
      1 row in set (0.008 sec)
      

      So the problem here is we don't stop using histogram statistics with optimizer_use_condition_selectivity=3.
      When the statistics are flushed, we don't re-read the histogram statistics and that is why we get the different value for filtered after flushing the EITS

      Attachments

        Activity

          varun Varun Gupta (Inactive) created issue -
          varun Varun Gupta (Inactive) made changes -
          Field Original Value New Value
          Description Dataset

          {code:sql}
          create table t1(a int);
          insert into t1 values (1),(2),(2),(3),(4);
          set use_stat_tables='preferably';
          set optimizer_use_condition_selectivity=4;
          set histogram_size= 255;
          analyze table t1;
          {code}


          {noformat}
          MariaDB [test]> analyze select * from t1 where a=2;
          +------+-------------+-------+------+---------------+------+---------+------+------+--------+----------+------------+-------------+
          | id | select_type | table | type | possible_keys | key | key_len | ref | rows | r_rows | filtered | r_filtered | Extra |
          +------+-------------+-------+------+---------------+------+---------+------+------+--------+----------+------------+-------------+
          | 1 | SIMPLE | t1 | ALL | NULL | NULL | NULL | NULL | 5 | 5.00 | 39.84 | 40.00 | Using where |
          +------+-------------+-------+------+---------------+------+---------+------+------+--------+----------+------------+-------------+
          1 row in set (0.002 sec)
          {noformat}

          So with optimizer_use_condition_selectivity=4 , we get * filtered as 39.84*

          {noformat}
          MariaDB [test]> set optimizer_use_condition_selectivity=3;
          Query OK, 0 rows affected (0.000 sec)

          MariaDB [test]> analyze select * from t1 where a=2;
          +------+-------------+-------+------+---------------+------+---------+------+------+--------+----------+------------+-------------+
          | id | select_type | table | type | possible_keys | key | key_len | ref | rows | r_rows | filtered | r_filtered | Extra |
          +------+-------------+-------+------+---------------+------+---------+------+------+--------+----------+------------+-------------+
          | 1 | SIMPLE | t1 | ALL | NULL | NULL | NULL | NULL | 5 | 5.00 | 39.84 | 40.00 | Using where |
          +------+-------------+-------+------+---------------+------+---------+------+------+--------+----------+------------+-------------+
          1 row in set (0.002 sec)
          {noformat}

          So with optimizer_use_condition_selectivity=3 , we get * filtered as 39.84*

          {noformat}
          MariaDB [test]> flush tables; // Statistics are flushed, need to read again to use them
          Query OK, 0 rows affected (0.002 sec)

          MariaDB [test]> analyze select * from t1 where a=2;
          +------+-------------+-------+------+---------------+------+---------+------+------+--------+----------+------------+-------------+
          | id | select_type | table | type | possible_keys | key | key_len | ref | rows | r_rows | filtered | r_filtered | Extra |
          +------+-------------+-------+------+---------------+------+---------+------+------+--------+----------+------------+-------------+
          | 1 | SIMPLE | t1 | ALL | NULL | NULL | NULL | NULL | 5 | 5.00 | 25.00 | 40.00 | Using where |
          +------+-------------+-------+------+---------------+------+---------+------+------+--------+----------+------------+-------------+
          1 row in set (0.008 sec)
          {noformat}

          So the problem here is we don't stop using histogram statistics with optimizer_use_condition_selectivity=3.
          When the statistics are flushed, we don't re-read the histogram statistics and that is why we get the different value for filtered after flushing the EITS

          varun Varun Gupta (Inactive) made changes -
          Status Open [ 1 ] Confirmed [ 10101 ]
          varun Varun Gupta (Inactive) made changes -
          Description Dataset

          {code:sql}
          create table t1(a int);
          insert into t1 values (1),(2),(2),(3),(4);
          set use_stat_tables='preferably';
          set optimizer_use_condition_selectivity=4;
          set histogram_size= 255;
          analyze table t1;
          {code}


          {noformat}
          MariaDB [test]> analyze select * from t1 where a=2;
          +------+-------------+-------+------+---------------+------+---------+------+------+--------+----------+------------+-------------+
          | id | select_type | table | type | possible_keys | key | key_len | ref | rows | r_rows | filtered | r_filtered | Extra |
          +------+-------------+-------+------+---------------+------+---------+------+------+--------+----------+------------+-------------+
          | 1 | SIMPLE | t1 | ALL | NULL | NULL | NULL | NULL | 5 | 5.00 | 39.84 | 40.00 | Using where |
          +------+-------------+-------+------+---------------+------+---------+------+------+--------+----------+------------+-------------+
          1 row in set (0.002 sec)
          {noformat}

          So with optimizer_use_condition_selectivity=4 , we get * filtered as 39.84*

          {noformat}
          MariaDB [test]> set optimizer_use_condition_selectivity=3;
          Query OK, 0 rows affected (0.000 sec)

          MariaDB [test]> analyze select * from t1 where a=2;
          +------+-------------+-------+------+---------------+------+---------+------+------+--------+----------+------------+-------------+
          | id | select_type | table | type | possible_keys | key | key_len | ref | rows | r_rows | filtered | r_filtered | Extra |
          +------+-------------+-------+------+---------------+------+---------+------+------+--------+----------+------------+-------------+
          | 1 | SIMPLE | t1 | ALL | NULL | NULL | NULL | NULL | 5 | 5.00 | 39.84 | 40.00 | Using where |
          +------+-------------+-------+------+---------------+------+---------+------+------+--------+----------+------------+-------------+
          1 row in set (0.002 sec)
          {noformat}

          So with optimizer_use_condition_selectivity=3 , we get * filtered as 39.84*

          {noformat}
          MariaDB [test]> flush tables; // Statistics are flushed, need to read again to use them
          Query OK, 0 rows affected (0.002 sec)

          MariaDB [test]> analyze select * from t1 where a=2;
          +------+-------------+-------+------+---------------+------+---------+------+------+--------+----------+------------+-------------+
          | id | select_type | table | type | possible_keys | key | key_len | ref | rows | r_rows | filtered | r_filtered | Extra |
          +------+-------------+-------+------+---------------+------+---------+------+------+--------+----------+------------+-------------+
          | 1 | SIMPLE | t1 | ALL | NULL | NULL | NULL | NULL | 5 | 5.00 | 25.00 | 40.00 | Using where |
          +------+-------------+-------+------+---------------+------+---------+------+------+--------+----------+------------+-------------+
          1 row in set (0.008 sec)
          {noformat}

          So the problem here is we don't stop using histogram statistics with optimizer_use_condition_selectivity=3.
          When the statistics are flushed, we don't re-read the histogram statistics and that is why we get the different value for filtered after flushing the EITS

          Dataset

          {code:sql}
          create table t1(a int);
          insert into t1 values (1),(2),(2),(3),(4);
          set use_stat_tables='preferably';
          set optimizer_use_condition_selectivity=4;
          set histogram_size= 255;
          analyze table t1;
          {code}


          {noformat}
          MariaDB [test]> analyze select * from t1 where a=2;
          +------+-------------+-------+------+---------------+------+---------+------+------+--------+----------+------------+-------------+
          | id | select_type | table | type | possible_keys | key | key_len | ref | rows | r_rows | filtered | r_filtered | Extra |
          +------+-------------+-------+------+---------------+------+---------+------+------+--------+----------+------------+-------------+
          | 1 | SIMPLE | t1 | ALL | NULL | NULL | NULL | NULL | 5 | 5.00 | 39.84 | 40.00 | Using where |
          +------+-------------+-------+------+---------------+------+---------+------+------+--------+----------+------------+-------------+
          1 row in set (0.002 sec)
          {noformat}

          So with optimizer_use_condition_selectivity=4 , we get *filtered as 39.84*

          {noformat}
          MariaDB [test]> set optimizer_use_condition_selectivity=3;
          Query OK, 0 rows affected (0.000 sec)

          MariaDB [test]> analyze select * from t1 where a=2;
          +------+-------------+-------+------+---------------+------+---------+------+------+--------+----------+------------+-------------+
          | id | select_type | table | type | possible_keys | key | key_len | ref | rows | r_rows | filtered | r_filtered | Extra |
          +------+-------------+-------+------+---------------+------+---------+------+------+--------+----------+------------+-------------+
          | 1 | SIMPLE | t1 | ALL | NULL | NULL | NULL | NULL | 5 | 5.00 | 39.84 | 40.00 | Using where |
          +------+-------------+-------+------+---------------+------+---------+------+------+--------+----------+------------+-------------+
          1 row in set (0.002 sec)
          {noformat}

          So with optimizer_use_condition_selectivity=3 , we get * filtered as 39.84*

          {noformat}
          MariaDB [test]> flush tables; // Statistics are flushed, need to read again to use them
          Query OK, 0 rows affected (0.002 sec)

          MariaDB [test]> analyze select * from t1 where a=2;
          +------+-------------+-------+------+---------------+------+---------+------+------+--------+----------+------------+-------------+
          | id | select_type | table | type | possible_keys | key | key_len | ref | rows | r_rows | filtered | r_filtered | Extra |
          +------+-------------+-------+------+---------------+------+---------+------+------+--------+----------+------------+-------------+
          | 1 | SIMPLE | t1 | ALL | NULL | NULL | NULL | NULL | 5 | 5.00 | 25.00 | 40.00 | Using where |
          +------+-------------+-------+------+---------------+------+---------+------+------+--------+----------+------------+-------------+
          1 row in set (0.008 sec)
          {noformat}

          So the problem here is we don't stop using histogram statistics with optimizer_use_condition_selectivity=3.
          When the statistics are flushed, we don't re-read the histogram statistics and that is why we get the different value for filtered after flushing the EITS

          varun Varun Gupta (Inactive) made changes -
          Status Confirmed [ 10101 ] In Progress [ 3 ]
          varun Varun Gupta (Inactive) made changes -
          Assignee Varun Gupta [ varun ] Igor Babaev [ igor ]
          Status In Progress [ 3 ] In Review [ 10002 ]
          julien.fritsch Julien Fritsch made changes -
          Fix Version/s 10.1 [ 16100 ]
          varun Varun Gupta (Inactive) made changes -
          Assignee Igor Babaev [ igor ] Michael Widenius [ monty ]
          varun Varun Gupta (Inactive) made changes -
          Assignee Igor Babaev [ igor ] Michael Widenius [ monty ]
          monty Michael Widenius made changes -
          Status In Review [ 10002 ] Stalled [ 10000 ]
          monty Michael Widenius made changes -
          Assignee Michael Widenius [ monty ] Varun Gupta [ varun ]
          varun Varun Gupta (Inactive) made changes -
          Status Stalled [ 10000 ] In Progress [ 3 ]
          varun Varun Gupta (Inactive) made changes -
          Fix Version/s 10.2.37 [ 25112 ]
          Fix Version/s 10.3.28 [ 25111 ]
          Fix Version/s 10.4.18 [ 25110 ]
          Fix Version/s 10.5.9 [ 25109 ]
          Fix Version/s 10.2 [ 14601 ]
          Fix Version/s 10.3 [ 22126 ]
          Fix Version/s 10.4 [ 22408 ]
          Resolution Fixed [ 1 ]
          Status In Progress [ 3 ] Closed [ 6 ]
          dbart Daniel Bartholomew made changes -
          Fix Version/s 10.2.38 [ 25207 ]
          Fix Version/s 10.2.37 [ 25112 ]
          psergei Sergei Petrunia made changes -
          Fix Version/s 10.5.10 [ 25204 ]
          Fix Version/s 10.4.19 [ 25205 ]
          Fix Version/s 10.3.29 [ 25206 ]
          Fix Version/s 10.5.9 [ 25109 ]
          Fix Version/s 10.4.18 [ 25110 ]
          Fix Version/s 10.3.28 [ 25111 ]
          serg Sergei Golubchik made changes -
          Workflow MariaDB v3 [ 96729 ] MariaDB v4 [ 156192 ]

          People

            varun Varun Gupta (Inactive)
            varun Varun Gupta (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Git Integration

                Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.