Uploaded image for project: 'MariaDB Server'
  1. MariaDB Server
  2. MDEV-23704

Optimizer does not use semijoin optimization for some WHERE (pk1, pk2, pk3) IN ((1,2,3), ...) queries

Details

    Description

      For query like this:

      select * from t where (PK1, PK2, PK3) in ((v1,v2,v3), (v4,v5,v6), ...)
      

      with long enough list of tuples for multi-column PK values there are 3 possible plans presented by quotes from ANALYZE FORMAT=JSON outputs):

      1. Range scan, can be forced in some cases with FORCE INDEX(PRIMARY):

      {
        "query_block": {
          "select_id": 1,
          "r_loops": 1,
          "r_total_time_ms": 19.184,
          "table": {
            "table_name": "t",
            "access_type": "range",
            "possible_keys": ["PRIMARY"],
            "key": "PRIMARY",
            "key_length": "74",
            "used_key_parts": [
              "PK1",
              "PK2",
              "PK3"
            ],
            "r_loops": 1,
            "rows": 1000,
            "r_rows": 1000,
            "r_total_time_ms": 15.165,
            "filtered": 100,
            "r_filtered": 100,
            "attached_condition": "(t.PK1,t.PK2,t.PK3) in (<cache>((349,'*********','01')),<cache>((349,'*********','01')),)"
          }
        }
      }
      

      2. For the same IN list in other environment we end up with full table scan that is very slow:

      {
        "query_block": {
          "select_id": 1,
          "r_loops": 1,
          "r_total_time_ms": 51060,
          "table": {
            "table_name": "t",
            "access_type": "ALL",
            "possible_keys": [
              "PRIMARY",
               ...
            ],
            "r_loops": 1,
            "rows": 7957643,
            "r_rows": 7.96e6,
            "r_total_time_ms": 48324,
            "filtered": 100,
            "r_filtered": 0.0126,
            "attached_condition": "(t.PK1,t.PK2,t.PK3) in (<cache>((***,'*******','01')),<cache>((***,'*******','01')),.......)"
          }
        }
      }
      

      3. Finally in other environment with very similar data for the same IN list we end up with semijoin optimization applied:

      {
        "query_block": {
          "select_id": 1,
          "r_loops": 1,
          "r_total_time_ms": 11.565,
          "duplicates_removal": {
            "table": {
              "table_name": "<derived3>",
              "access_type": "ALL",
              "r_loops": 1,
              "rows": 1000,
              "r_rows": 1000,
              "r_total_time_ms": 0.1065,
              "filtered": 100,
              "r_filtered": 100,
              "materialized": {
                "query_block": {
                  "union_result": {
                    "table_name": "<unit3>",
                    "access_type": "ALL",
                    "r_loops": 0,
                    "r_rows": null,
                    "query_specifications": [
                      {
                        "query_block": {
                          "select_id": 3,
                          "table": {
                            "message": "No tables used"
                          }
                        }
                      }
                    ]
                  }
                }
              }
            },
            "table": {
              "table_name": "t",
              "access_type": "eq_ref",
              "possible_keys": [
                "PRIMARY",
              ...
              ],
              "key": "PRIMARY",
              "key_length": "74",
              "used_key_parts": [
                "PK1",
                "PK2",
                "PK3"
              ],
              "ref": ["tvc_0._col_1", "func", "func"],
              "r_loops": 1000,
              "rows": 1,
              "r_rows": 0.998,
              "r_total_time_ms": 7.7333,
              "filtered": 100,
              "r_filtered": 100,
              "attached_condition": "t.PK2 = convert(tvc_0._col_2 using utf8) and t.PK3 = convert(tvc_0._col_3 using utf8)"
            }
          }
        }
      }
      

      This plan is the fastest even comparing to range join, but there is no way to force it.

      So, I think there is a bug when semijoin optimization is not used even when "range" is extended to full table scan as an alternative. I'd also like to have a way to force the best plan.

      Attachments

        Issue Links

          Activity

            MDEV-21265 is about IN-predicate-to-subquery conversion handling a broader set of column/value datatypes.

            psergei Sergei Petrunia added a comment - MDEV-21265 is about IN-predicate-to-subquery conversion handling a broader set of column/value datatypes.

            Ok, so conversion to IN-subquery didn't work. But why does the optimizer use full table scan?

            Consider this case:

            create table t100 (
              pk1 int,
              pk2 varchar(10) collate utf8mb4_general_ci,
              pk3 varchar(10),
              filler varchar(100),
              PRIMARY KEY (pk1,pk2,pk3),
              KEY pk2 (pk2),
              KEY pk1 (pk1),
              KEY pk3 (pk3)
            ) engine=innodb;
            

            All indexes are essential.
            ... insert 100K rows.

            explain select * from t100
            where (pk1, pk2, pk3) in 
            ((1, '12345', '1'),
            (2, '12345', '1'),
            ...
            (999, '12345', '1'),
            (1000, '12345', '1'));
            

            produces (on 10.3.24):

            id      select_type     table   type    possible_keys   key     key_len ref     rows    Extra
            1       SIMPLE  t100    ALL     PRIMARY,pk2,pk1,pk3     NULL    NULL    NULL    90342   Using where
            

            This happens, because the range optimizer reaches SEL_ARG::MAX_SEL_ARGS (=16000) limitation and aborts.

            A naive logic: 1000-elements IN-list should produce ~3K SEL_ARG objects for the PK and ~1K objects for the 3 other indexes, which gives in total 3+1=4K objects.
            This is not what is happening apparently. I am not sure if this a bug or some fundamental limitation.

            psergei Sergei Petrunia added a comment - Ok, so conversion to IN-subquery didn't work. But why does the optimizer use full table scan? Consider this case: create table t100 ( pk1 int , pk2 varchar (10) collate utf8mb4_general_ci, pk3 varchar (10), filler varchar (100), PRIMARY KEY (pk1,pk2,pk3), KEY pk2 (pk2), KEY pk1 (pk1), KEY pk3 (pk3) ) engine=innodb; All indexes are essential. ... insert 100K rows. explain select * from t100 where (pk1, pk2, pk3) in ((1, '12345' , '1' ), (2, '12345' , '1' ), ... (999, '12345' , '1' ), (1000, '12345' , '1' )); produces (on 10.3.24): id select_type table type possible_keys key key_len ref rows Extra 1 SIMPLE t100 ALL PRIMARY,pk2,pk1,pk3 NULL NULL NULL 90342 Using where This happens, because the range optimizer reaches SEL_ARG::MAX_SEL_ARGS (=16000) limitation and aborts. A naive logic: 1000-elements IN-list should produce ~3K SEL_ARG objects for the PK and ~1K objects for the 3 other indexes, which gives in total 3+1=4K objects. This is not what is happening apparently. I am not sure if this a bug or some fundamental limitation.

            Testcase demonstrating the above: a2.test

            psergei Sergei Petrunia added a comment - Testcase demonstrating the above: a2.test
            psergei Sergei Petrunia added a comment - - edited

            Re-trying with the customer's table definition (which has multiple indexes).

            Number of SEL_ARGS alloced (note this is as counted in PARAM::alloced_sel_args, so it less than the actual number of objects allocated).

            IN-list size	PARAM::alloced_sel_args
            100	2065
            200	4165
            300	6265
            400	8365
            500	10465
            600	12565
            700	14665
            800	16022
            900	16022
            

            It's high but it's linear.

            psergei Sergei Petrunia added a comment - - edited Re-trying with the customer's table definition (which has multiple indexes). Number of SEL_ARGS alloced (note this is as counted in PARAM::alloced_sel_args, so it less than the actual number of objects allocated). IN-list size PARAM::alloced_sel_args 100 2065 200 4165 300 6265 400 8365 500 10465 600 12565 700 14665 800 16022 900 16022 It's high but it's linear.

            This is an optimizer deficiency but I won't consider this as a bug.

            MariaDB's answer to very large IN-lists is IN-to-subquery conversion.

            That conversion is currently limited in datatype support (as described in the linked MDEVs). This can and will be fixed as part of those MDEVs.

            psergei Sergei Petrunia added a comment - This is an optimizer deficiency but I won't consider this as a bug. MariaDB's answer to very large IN-lists is IN-to-subquery conversion. That conversion is currently limited in datatype support (as described in the linked MDEVs). This can and will be fixed as part of those MDEVs.

            People

              psergei Sergei Petrunia
              valerii Valerii Kravchuk
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Git Integration

                  Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.