Uploaded image for project: 'MariaDB Server'
  1. MariaDB Server
  2. MDEV-28846

Poor performance when rowid filter contains no elements

Details

    Description

      See details in the attached test case. If we create the right spread of values in the key and pick the range just right, when rowid_filter optimizer switch is enabled, the optimizer will sacrifice a part of the key for the rowid_filter. As a result we use only the u_delete_user and end up examining 5K+ rows out of 10K in the table. If we disable rowid_filter, the user_name constraint is being used, and we examine only 1 row.

      Attachments

        Issue Links

          Activity

            spachev Sasha Pachev created issue -
            spachev Sasha Pachev made changes -
            Field Original Value New Value
            Affects Version/s 10.4 [ 22408 ]
            alice Alice Sherepa made changes -
            Fix Version/s 10.4 [ 22408 ]
            alice Alice Sherepa made changes -
            Assignee Sergei Petrunia [ psergey ]
            alice Alice Sherepa made changes -
            Status Open [ 1 ] Confirmed [ 10101 ]
            rob.schwyzer@mariadb.com Rob Schwyzer (Inactive) made changes -
            rob.schwyzer@mariadb.com Rob Schwyzer (Inactive) made changes -
            rob.schwyzer@mariadb.com Rob Schwyzer (Inactive) made changes -
            igor Igor Babaev (Inactive) made changes -
            Summary rowid_filter optimization results in failure to use the full length of the key Poor performance when rowid filter contains no elements
            psergei Sergei Petrunia made changes -
            Component/s Optimizer [ 10200 ]
            igor Igor Babaev (Inactive) made changes -
            Assignee Sergei Petrunia [ psergey ] Igor Babaev [ igor ]
            igor Igor Babaev (Inactive) made changes -
            Status Confirmed [ 10101 ] In Progress [ 3 ]
            rob.schwyzer@mariadb.com Rob Schwyzer (Inactive) made changes -
            rob.schwyzer@mariadb.com Rob Schwyzer (Inactive) made changes -
            rob.schwyzer@mariadb.com Rob Schwyzer (Inactive) made changes -
            rob.schwyzer@mariadb.com Rob Schwyzer (Inactive) made changes -
            rob.schwyzer@mariadb.com Rob Schwyzer (Inactive) made changes -
            rob.schwyzer@mariadb.com Rob Schwyzer (Inactive) made changes -
            rob.schwyzer@mariadb.com Rob Schwyzer (Inactive) made changes -
            ralf.gebhardt Ralf Gebhardt made changes -
            Priority Major [ 3 ] Critical [ 2 ]
            alice Alice Sherepa made changes -
            Labels not-10.5

            Here' a synthetic test case that demonstrate the problem:

            --source include/have_sequence.inc
             
            create table t1 (
              pk int primary key auto_increment,
              nm varchar(32),
              fl1 tinyint default 0,
              fl2 tinyint default 0,
              index idx1(nm, fl1),
              index idx2(fl2)
            ) engine=myisam;
             
            create table name (
              pk int primary key auto_increment,
              nm bigint
            ) engine=myisam;
             
            create table flag2 (
              pk int primary key auto_increment,
              fl2 tinyint 
            ) engine=myisam;
              
            insert into name(nm) select seq from seq_1_to_1000 order by rand(17);
            insert into flag2(fl2) select seq mod 2 from seq_1_to_1000 order by rand(19);
             
            insert into t1(nm,fl2)
              select nm, fl2 from name, flag2 where name.pk = flag2.pk;
             
            drop table name, flag2;
             
            select * from t1 limit 20;
             
            analyze table t1 persistent for all;
             
            let $a= `select concat((select nm from t1 order by RAND(13) limit 1),'%')`;
            eval select '$a' as a;
             
            set optimizer_switch='rowid_filter=off';
            eval
            explain
            select * from t1  where nm like '$a' AND fl2 = 0;
            show status like 'Last_query_cost';
            eval
            analyze format=json 
            select * from t1  where nm like '$a' AND fl2 = 0;
            eval
            select * from t1  where nm like '$a' AND fl2 = 0;
             
            set optimizer_switch='rowid_filter=on';
            eval
            explain
            select * from t1  where nm like '$a' AND fl2 = 0;
            show status like 'Last_query_cost';
            eval
            analyze format=json 
            select * from t1  where nm like '$a' AND fl2 = 0;
            eval
            select * from t1  where nm like '$a' AND fl2 = 0;
             
            set optimizer_switch='rowid_filter=default';
             
            drop table t1;
            

            When running this test case we see:

            set optimizer_switch='rowid_filter=off';
            explain
            select * from t1  where nm like '683%' AND fl2 = 0;
            id      select_type     table   type    possible_keys   key     key_len ref     rows    Extra
            1       SIMPLE  t1      range   idx1,idx2       idx1    35      NULL    1       Using index condition; Using where
            show status like 'Last_query_cost';
            Variable_name   Value
            Last_query_cost 2.707390
            analyze format=json 
            select * from t1  where nm like '683%' AND fl2 = 0;
            ANALYZE
            {
              "query_block": {
                "select_id": 1,
                "r_loops": 1,
                "r_total_time_ms": 0.0397,
                "table": {
                  "table_name": "t1",
                  "access_type": "range",
                  "possible_keys": ["idx1", "idx2"],
                  "key": "idx1",
                  "key_length": "35",
                  "used_key_parts": ["nm"],
                  "r_loops": 1,
                  "rows": 1,
                  "r_rows": 1,
                  "r_total_time_ms": 0.0257,
                  "filtered": 49.2,
                  "r_filtered": 100,
                  "index_condition": "t1.nm like '683%'",
                  "attached_condition": "t1.fl2 = 0"
                }
              }
            }
            select * from t1  where nm like '683%' AND fl2 = 0;
            pk      nm      fl1     fl2
            232     683     0       0
            set optimizer_switch='rowid_filter=on';
            explain
            select * from t1  where nm like '683%' AND fl2 = 0;
            id      select_type     table   type    possible_keys   key     key_len ref     rows    Extra
            1       SIMPLE  t1      ref|filter      idx1,idx2       idx2|idx1       2|35    const   492 (0%)        Using where; Using rowid filter
            show status like 'Last_query_cost';
            Variable_name   Value
            Last_query_cost 1.431790
            analyze format=json 
            select * from t1  where nm like '683%' AND fl2 = 0;
            ANALYZE
            {
              "query_block": {
                "select_id": 1,
                "r_loops": 1,
                "r_total_time_ms": 0.2755,
                "table": {
                  "table_name": "t1",
                  "access_type": "ref",
                  "possible_keys": ["idx1", "idx2"],
                  "key": "idx2",
                  "key_length": "2",
                  "used_key_parts": ["fl2"],
                  "ref": ["const"],
                  "rowid_filter": {
                    "range": {
                      "key": "idx1",
                      "used_key_parts": ["nm"]
                    },
                    "rows": 1,
                    "selectivity_pct": 0.1,
                    "r_rows": 1,
                    "r_selectivity_pct": 0.2,
                    "r_buffer_size": 0,
                    "r_filling_time_ms": 0.0203
                  },
                  "r_loops": 1,
                  "rows": 492,
                  "r_rows": 1,
                  "r_total_time_ms": 0.2607,
                  "filtered": 0.1,
                  "r_filtered": 100,
                  "attached_condition": "t1.nm like '683%'"
                }
              }
            }
            

            Why does the optimizer choose the plan that employs rowid filter optimization when it's allowed, though the execution by this plan takes much more time than the execution without rowid filter ("r_total_time_ms": 0.2755 vs "r_total_time_ms": 0.0257)? Why is the cost of the plan that uses rowid filter is so cheap?

            igor Igor Babaev (Inactive) added a comment - Here' a synthetic test case that demonstrate the problem: --source include/have_sequence.inc   create table t1 ( pk int primary key auto_increment, nm varchar (32), fl1 tinyint default 0, fl2 tinyint default 0, index idx1(nm, fl1), index idx2(fl2) ) engine=myisam;   create table name ( pk int primary key auto_increment, nm bigint ) engine=myisam;   create table flag2 ( pk int primary key auto_increment, fl2 tinyint ) engine=myisam; insert into name (nm) select seq from seq_1_to_1000 order by rand(17); insert into flag2(fl2) select seq mod 2 from seq_1_to_1000 order by rand(19);   insert into t1(nm,fl2) select nm, fl2 from name , flag2 where name .pk = flag2.pk;   drop table name , flag2;   select * from t1 limit 20;   analyze table t1 persistent for all ;   let $a= ` select concat(( select nm from t1 order by RAND(13) limit 1), '%' )`; eval select '$a' as a;   set optimizer_switch= 'rowid_filter=off' ; eval explain select * from t1 where nm like '$a' AND fl2 = 0; show status like 'Last_query_cost' ; eval analyze format=json select * from t1 where nm like '$a' AND fl2 = 0; eval select * from t1 where nm like '$a' AND fl2 = 0;   set optimizer_switch= 'rowid_filter=on' ; eval explain select * from t1 where nm like '$a' AND fl2 = 0; show status like 'Last_query_cost' ; eval analyze format=json select * from t1 where nm like '$a' AND fl2 = 0; eval select * from t1 where nm like '$a' AND fl2 = 0;   set optimizer_switch= 'rowid_filter=default' ;   drop table t1; When running this test case we see: set optimizer_switch='rowid_filter=off'; explain select * from t1 where nm like '683%' AND fl2 = 0; id select_type table type possible_keys key key_len ref rows Extra 1 SIMPLE t1 range idx1,idx2 idx1 35 NULL 1 Using index condition; Using where show status like 'Last_query_cost'; Variable_name Value Last_query_cost 2.707390 analyze format=json select * from t1 where nm like '683%' AND fl2 = 0; ANALYZE { "query_block": { "select_id": 1, "r_loops": 1, "r_total_time_ms": 0.0397, "table": { "table_name": "t1", "access_type": "range", "possible_keys": ["idx1", "idx2"], "key": "idx1", "key_length": "35", "used_key_parts": ["nm"], "r_loops": 1, "rows": 1, "r_rows": 1, "r_total_time_ms": 0.0257, "filtered": 49.2, "r_filtered": 100, "index_condition": "t1.nm like '683%'", "attached_condition": "t1.fl2 = 0" } } } select * from t1 where nm like '683%' AND fl2 = 0; pk nm fl1 fl2 232 683 0 0 set optimizer_switch='rowid_filter=on'; explain select * from t1 where nm like '683%' AND fl2 = 0; id select_type table type possible_keys key key_len ref rows Extra 1 SIMPLE t1 ref|filter idx1,idx2 idx2|idx1 2|35 const 492 (0%) Using where; Using rowid filter show status like 'Last_query_cost'; Variable_name Value Last_query_cost 1.431790 analyze format=json select * from t1 where nm like '683%' AND fl2 = 0; ANALYZE { "query_block": { "select_id": 1, "r_loops": 1, "r_total_time_ms": 0.2755, "table": { "table_name": "t1", "access_type": "ref", "possible_keys": ["idx1", "idx2"], "key": "idx2", "key_length": "2", "used_key_parts": ["fl2"], "ref": ["const"], "rowid_filter": { "range": { "key": "idx1", "used_key_parts": ["nm"] }, "rows": 1, "selectivity_pct": 0.1, "r_rows": 1, "r_selectivity_pct": 0.2, "r_buffer_size": 0, "r_filling_time_ms": 0.0203 }, "r_loops": 1, "rows": 492, "r_rows": 1, "r_total_time_ms": 0.2607, "filtered": 0.1, "r_filtered": 100, "attached_condition": "t1.nm like '683%'" } } } Why does the optimizer choose the plan that employs rowid filter optimization when it's allowed, though the execution by this plan takes much more time than the execution without rowid filter ("r_total_time_ms": 0.2755 vs "r_total_time_ms": 0.0257)? Why is the cost of the plan that uses rowid filter is so cheap?
            rob.schwyzer@mariadb.com Rob Schwyzer (Inactive) made changes -
            rob.schwyzer@mariadb.com Rob Schwyzer (Inactive) made changes -
            rob.schwyzer@mariadb.com Rob Schwyzer (Inactive) made changes -
            lizardo Luis Eduardo Oliveira Lizardo (Inactive) made changes -
            rob.schwyzer@mariadb.com Rob Schwyzer (Inactive) made changes -
            marko Marko Mäkelä made changes -
            rob.schwyzer@mariadb.com Rob Schwyzer (Inactive) made changes -
            rob.schwyzer@mariadb.com Rob Schwyzer (Inactive) made changes -
            rob.schwyzer@mariadb.com Rob Schwyzer (Inactive) made changes -
            julien.fritsch Julien Fritsch made changes -
            Status In Progress [ 3 ] Needs Feedback [ 10501 ]
            julien.fritsch Julien Fritsch made changes -
            Assignee Igor Babaev [ igor ] Rob Schwyzer [ JIRAUSER40844 ]
            julien.fritsch Julien Fritsch made changes -
            Assignee Rob Schwyzer [ JIRAUSER40844 ] Igor Babaev [ igor ]
            Status Needs Feedback [ 10501 ] Open [ 1 ]
            rob.schwyzer@mariadb.com Rob Schwyzer (Inactive) made changes -
            rob.schwyzer@mariadb.com Rob Schwyzer (Inactive) made changes -
            julien.fritsch Julien Fritsch made changes -
            Status Open [ 1 ] Confirmed [ 10101 ]
            rob.schwyzer@mariadb.com Rob Schwyzer (Inactive) made changes -
            rob.schwyzer@mariadb.com Rob Schwyzer (Inactive) made changes -
            igor Igor Babaev (Inactive) made changes -
            Status Confirmed [ 10101 ] In Progress [ 3 ]
            igor Igor Babaev (Inactive) made changes -
            Status In Progress [ 3 ] In Testing [ 10301 ]
            igor Igor Babaev (Inactive) made changes -
            Status In Testing [ 10301 ] Stalled [ 10000 ]
            igor Igor Babaev (Inactive) made changes -
            Status Stalled [ 10000 ] In Progress [ 3 ]
            rob.schwyzer@mariadb.com Rob Schwyzer (Inactive) made changes -
            rob.schwyzer@mariadb.com Rob Schwyzer (Inactive) made changes -
            rob.schwyzer@mariadb.com Rob Schwyzer (Inactive) made changes -
            igor Igor Babaev (Inactive) made changes -
            Assignee Igor Babaev [ igor ] Oleksandr Byelkin [ sanja ]
            Status In Progress [ 3 ] In Review [ 10002 ]
            rob.schwyzer@mariadb.com Rob Schwyzer (Inactive) made changes -

            OK to push

            sanja Oleksandr Byelkin added a comment - OK to push
            sanja Oleksandr Byelkin made changes -
            Assignee Oleksandr Byelkin [ sanja ] Igor Babaev [ igor ]
            Status In Review [ 10002 ] Stalled [ 10000 ]
            rob.schwyzer@mariadb.com Rob Schwyzer (Inactive) made changes -

            A fix for this bug was pushed into 10.4. A slightly different patch was applied to 10.5. The latter should be merged upstream.

            igor Igor Babaev (Inactive) added a comment - A fix for this bug was pushed into 10.4. A slightly different patch was applied to 10.5. The latter should be merged upstream.
            igor Igor Babaev (Inactive) made changes -
            Fix Version/s 10.4.27 [ 28405 ]
            Fix Version/s 10.5.18 [ 28421 ]
            Fix Version/s 10.6.11 [ 28441 ]
            Fix Version/s 10.7.7 [ 28442 ]
            Fix Version/s 10.8.6 [ 28443 ]
            Fix Version/s 10.9.4 [ 28444 ]
            Fix Version/s 10.10.2 [ 28410 ]
            Fix Version/s 10.4 [ 22408 ]
            Resolution Fixed [ 1 ]
            Status Stalled [ 10000 ] Closed [ 6 ]
            rob.schwyzer@mariadb.com Rob Schwyzer (Inactive) made changes -
            rob.schwyzer@mariadb.com Rob Schwyzer (Inactive) made changes -
            rob.schwyzer@mariadb.com Rob Schwyzer (Inactive) made changes -
            rob.schwyzer@mariadb.com Rob Schwyzer (Inactive) made changes -
            psergei Sergei Petrunia made changes -
            Johnston Rex Johnston made changes -
            mariadb-jira-automation Jira Automation (IT) made changes -
            Zendesk Related Tickets 201658 172577
            Zendesk active tickets 201658

            People

              igor Igor Babaev (Inactive)
              spachev Sasha Pachev
              Votes:
              2 Vote for this issue
              Watchers:
              11 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Git Integration

                  Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.