[MDEV-16402] Support Index Condition Pushdown for clustered PK scans - Jira

Details

Type: Task
Status: Confirmed (View Workflow)
Priority: Major
Resolution: Unresolved
Fix Version/s: 11.2(EOL)
Component/s: Optimizer, Storage Engine - InnoDB, Storage Engine - XtraDB
Labels:

Description

The following test will lock 3 records and their preceding gaps inside InnoDB, even though only the 2 first records match the condition id<=20:

--source include/have_innodb.inc

CREATE TABLE t1(id INT PRIMARY KEY, c INT NOT NULL) ENGINE=InnoDB;

INSERT INTO t1 VALUES(10,10),(20,20),(30,30),(40,40);

SELECT * FROM t1 WHERE id<=20 FOR UPDATE;

DROP TABLE t1;

InnoDB cannot be blamed for this, because the SQL layer is not invoking ha_innobase::idx_cond_push(), so prebuilt->idx_cond will be NULL and InnoDB cannot know that the id=30 record does not match the WHERE condition.

To confirm this:

break ha_innobase::create

continue

finish

break lock_rec_lock

break ha_innobase::delete_table

continue

There will be 3 calls to lock_rec_lock() before the DROP TABLE. There should be only 2 calls, with type=LOCK_ORDINARY to lock the keys id=10,id=20 and the gaps preceding them.

Another reason to enable index condition pushdown (ICP) for table scan (or PRIMARY KEY scan) is that if the query would return BLOBs, it would be much more efficient to avoid copying and throwing away the BLOBs of non-matching rows.

Attachments

Issue Links

relates to

MDEV-27999 select~for update skip locked locks unnecessary record

Closed

MDEV-10962 Deadlock with 3 concurrent DELETEs by unique key

Closed

MDEV-21794 Optimizer flag rowid_filter leads to long query

Closed

MDEV-33533 Crash at execution of DELETE when trying to use rowid filter

Closed

MDEV-36308 Unreasonable block in repeatable read

Open

PERF-196 Loading...

(1 relates to)

Activity

Ascending order - Click to sort in descending order

View 4 older comments

Marko Mäkelä added a comment - 2018-06-21 13:07 - edited

The index condition pushdown was initially enabled on all indexes in MySQL, but it was disabled on PRIMARY KEY due to a performance regression in sysbench.

Several of the queries in sysbench have a WHERE condition that the optimizer uses for executing these queries as range scans. The upper and lower limit of the range scan will ensure that the WHERE condition is fulfilled. […] upper and lower limits ensure that the WHERE condition is fulfilled, the pushed index condition will not filter out any records.

Unfortunately, InnoDB does not currently make use of the range limits, so it is unnecessarily locking (and possibly later unlocking) too many records.

Marko Mäkelä added a comment - 2018-06-21 13:07 - edited The index condition pushdown was initially enabled on all indexes in MySQL, but it was disabled on PRIMARY KEY due to a performance regression in sysbench . Several of the queries in sysbench have a WHERE condition that the optimizer uses for executing these queries as range scans. The upper and lower limit of the range scan will ensure that the WHERE condition is fulfilled. […] upper and lower limits ensure that the WHERE condition is fulfilled, the pushed index condition will not filter out any records. Unfortunately, InnoDB does not currently make use of the range limits, so it is unnecessarily locking (and possibly later unlocking) too many records.

Marko Mäkelä added a comment - 2022-06-07 07:19 - edited

In ~~MDEV-28747~~, oleg.smirnov reported experimenting with this. The MySQL commit message mentions WL#6061, which is hidden from the public view.

The commit also mentions "several of the queries in sysbench". Which queries might those be? Would it be possible to avoid pushing down WHERE conditions that never actually filter anything? Could ~~MDEV-16188~~ already serve those queries, making the MySQL commit redundant? If not, could removing some InnoDB prefetch logic in MDEV-16232 fix the performance problem?

Marko Mäkelä added a comment - 2022-06-07 07:19 - edited In MDEV-28747 , oleg.smirnov reported experimenting with this. The MySQL commit message mentions WL#6061, which is hidden from the public view. The commit also mentions "several of the queries in sysbench". Which queries might those be? Would it be possible to avoid pushing down WHERE conditions that never actually filter anything? Could MDEV-16188 already serve those queries, making the MySQL commit redundant? If not, could removing some InnoDB prefetch logic in MDEV-16232 fix the performance problem?

Marko Mäkelä added a comment - 2022-06-10 10:23

The code change that was mentioned in a ~~MDEV-28747~~ comment would correspond to the following patch for 10.5 or later:

diff --git a/sql/opt_index_cond_pushdown.cc b/sql/opt_index_cond_pushdown.cc

index 6a24fa95b68..c35345be5d5 100644

--- a/sql/opt_index_cond_pushdown.cc

+++ b/sql/opt_index_cond_pushdown.cc

@@ -340,7 +340,7 @@ void push_index_cond(JOIN_TAB *tab, uint keyno)

       tab->join->thd->lex->sql_command != SQLCOM_UPDATE_MULTI &&

       tab->join->thd->lex->sql_command != SQLCOM_DELETE_MULTI &&

       tab->type != JT_CONST && tab->type != JT_SYSTEM &&

-      !tab->table->file->is_clustering_key(keyno)) // 6

+      true)

     DBUG_EXECUTE("where",

                  print_where(tab->select_cond, "full cond", QT_ORDINARY););

Marko Mäkelä added a comment - 2022-06-10 10:23 The code change that was mentioned in a MDEV-28747 comment would correspond to the following patch for 10.5 or later: diff --git a/sql/opt_index_cond_pushdown.cc b/sql/opt_index_cond_pushdown.cc index 6a24fa95b68..c35345be5d5 100644 --- a/sql/opt_index_cond_pushdown.cc +++ b/sql/opt_index_cond_pushdown.cc @@ -340,7 +340,7 @@ void push_index_cond(JOIN_TAB *tab, uint keyno) tab->join->thd->lex->sql_command != SQLCOM_UPDATE_MULTI && tab->join->thd->lex->sql_command != SQLCOM_DELETE_MULTI && tab->type != JT_CONST && tab->type != JT_SYSTEM && - !tab->table->file->is_clustering_key(keyno)) // 6 + true) { DBUG_EXECUTE("where", print_where(tab->select_cond, "full cond", QT_ORDINARY););

Marko Mäkelä added a comment - 2023-04-25 12:45

It makes sense to implement MDEV-16232 first. After that, the only remaining value of this optimization might be that we can avoid fetching BLOBs for rows that do not match the condition.

Marko Mäkelä added a comment - 2023-04-25 12:45 It makes sense to implement MDEV-16232 first. After that, the only remaining value of this optimization might be that we can avoid fetching BLOBs for rows that do not match the condition.

Marko Mäkelä added a comment - 2025-03-18 16:32

MDEV-16232 is a rather complex task. I think that this needs to be investigated independently of that.

Marko Mäkelä added a comment - 2025-03-18 16:32 MDEV-16232 is a rather complex task. I think that this needs to be investigated independently of that.

MariaDB Server

Support Index Condition Pushdown for clustered PK scans

Details

Description

Attachments

Issue Links

Activity

People

Dates

Git Integration