[MDEV-30078] SQL Layer support for: Use fewer InnoDB mini-transactions - Jira

Sergei Petrunia created issue - 2022-11-23 14:00

Sergei Petrunia made changes - 2022-11-23 14:00

Field	Original Value	New Value
Link		This issue relates to MDEV-16232 [ MDEV-16232 ]

Sergei Petrunia made changes - 2022-11-23 14:07

Description

in MDEV-16232, InnoDB is implementing an optimization to cut down on latch acquisitions/releases and other kinds of overhead when the query is about to perform multiple operations on the [index] records on the same page.

In the Storage engine API, this translates into the following calls:

{code}
  h->start_operations_batch(); // (1)
  BATCH_OF_OPERATIONS; // (2)
  h->start_operations_batch(); // (3)
{code}

InnoDB will hold internal page latches (and something else?) between calls (1) and (2),
so the SQL layer must guarantee the following properties of the code that may execute in BATCH_OF_OPERATIONS:

{code}
* Access only one InnoDB table (further "The Table").
** In general, do not access other tables, except maybe temp.tables

* Access The Table using one index.

* * It is allowed (even intended) the we read row(s) and then
    modify or delete them during one BATCH_OF_OPERATIONS.
{code}

TODO
- Do we need to call end_operations_batch() in case of an error condition?

in MDEV-16232, InnoDB is implementing an optimization to cut down on latch acquisitions/releases and other kinds of overhead when the query is about to perform multiple operations on the [index] records on the same page.

In the Storage engine API, this translates into the following calls:

{code}
  h->start_operations_batch(); // (1)
  BATCH_OF_OPERATIONS; // (2)
  h->start_operations_batch(); // (3)
{code}

InnoDB will hold internal page latches (and something else?) between calls (1) and (2),
so the SQL layer must guarantee the following properties of the code that may execute in BATCH_OF_OPERATIONS:

{code}
* Access only one InnoDB table (further "The Table").
** In general, do not access other tables, except maybe temp.tables

* Access The Table using one index.

* * It is allowed (even intended) the we read row(s) and then
    modify or delete them during one BATCH_OF_OPERATIONS.
{code}

h2. Implementation considerations

* Do not use batch mode if we use index_merge.
* Do not use batch mode if we use Multi-Range-Read.

* * Can use batch mode for single-table DELETE, provided no other tables are accessed
** the WHERE condition doesn't access other tables (that is, doesn't use subqueries)
** there are no triggers (which could potentially do the same).

* Single-table UPDATEs are the same as DELETEs.

TODO
- Do we need to call end_operations_batch() in case of an error condition?

Sergei Golubchik made changes - 2022-11-25 17:51

Fix Version/s

10.12 [ 28320 ]

Sergei Golubchik made changes - 2022-11-25 17:51

Assignee

Sergei Petrunia [ psergey ]

Sergei Golubchik made changes - 2022-11-25 17:52

Link

This issue blocks MDEV-16232 [ MDEV-16232 ]

Sergei Golubchik made changes - 2022-11-25 17:52

Link

This issue relates to MDEV-16232 [ MDEV-16232 ]

Sergei Petrunia made changes - 2022-11-26 17:16

Description

in MDEV-16232, InnoDB is implementing an optimization to cut down on latch acquisitions/releases and other kinds of overhead when the query is about to perform multiple operations on the [index] records on the same page.

In the Storage engine API, this translates into the following calls:

{code}
  h->start_operations_batch(); // (1)
  BATCH_OF_OPERATIONS; // (2)
  h->start_operations_batch(); // (3)
{code}

InnoDB will hold internal page latches (and something else?) between calls (1) and (2),
so the SQL layer must guarantee the following properties of the code that may execute in BATCH_OF_OPERATIONS:

{code}
* Access only one InnoDB table (further "The Table").
** In general, do not access other tables, except maybe temp.tables

* Access The Table using one index.

* * It is allowed (even intended) the we read row(s) and then
    modify or delete them during one BATCH_OF_OPERATIONS.
{code}

h2. Implementation considerations

* Do not use batch mode if we use index_merge.
* Do not use batch mode if we use Multi-Range-Read.

* * Can use batch mode for single-table DELETE, provided no other tables are accessed
** the WHERE condition doesn't access other tables (that is, doesn't use subqueries)
** there are no triggers (which could potentially do the same).

* Single-table UPDATEs are the same as DELETEs.

TODO
- Do we need to call end_operations_batch() in case of an error condition?

in MDEV-16232, InnoDB is implementing an optimization to cut down on latch acquisitions/releases and other kinds of overhead when the query is about to perform multiple operations on the [index] records on the same page.

In the Storage engine API, this translates into the following calls:

{code}
  h->start_operations_batch(); // (1)
  BATCH_OF_OPERATIONS; // (2)
  h->start_operations_batch(); // (3)
{code}

InnoDB will hold internal page latches (and something else?) between calls (1) and (2),
so the SQL layer must guarantee the following properties of the code that may execute in BATCH_OF_OPERATIONS:

{code}
* Access only one InnoDB table (further "The Table").
** Also, do not access other kinds of tables, except maybe temporary (work) tables.

* Access The Table using one index.

* * It is allowed (even intended) the we read row(s) and then
   modify or delete them during one BATCH_OF_OPERATIONS.
{code}

h2. Applicability

* Do not use batch mode if we use index_merge.
* Do not use batch mode if we use Multi-Range-Read.

* * Can use batch mode for single-table DELETE, provided no other tables are accessed
** the WHERE condition doesn't access other tables (that is, doesn't use subqueries)
** there are no triggers (which could potentially access the tables).

* Single-table UPDATEs are the same as DELETEs.

h2. Visibility

Need some way to see if the optimization is applied.

This is probably a too fine detail to show in EXPLAIN output.
Let's have a counter instead. It will count the number batches (that is, number of start_operations_batch() calls).

h2. Control
It's a good idea to be able to switch the optimization on and off.
The switch should be at the SQL layer.

TODO
- Do we need to call end_operations_batch() in case of an error condition?Control

Sergei Petrunia made changes - 2022-11-26 17:24

Description

in MDEV-16232, InnoDB is implementing an optimization to cut down on latch acquisitions/releases and other kinds of overhead when the query is about to perform multiple operations on the [index] records on the same page.

In the Storage engine API, this translates into the following calls:

{code}
  h->start_operations_batch(); // (1)
  BATCH_OF_OPERATIONS; // (2)
  h->start_operations_batch(); // (3)
{code}

InnoDB will hold internal page latches (and something else?) between calls (1) and (2),
so the SQL layer must guarantee the following properties of the code that may execute in BATCH_OF_OPERATIONS:

{code}
* Access only one InnoDB table (further "The Table").
** Also, do not access other kinds of tables, except maybe temporary (work) tables.

* Access The Table using one index.

* * It is allowed (even intended) the we read row(s) and then
   modify or delete them during one BATCH_OF_OPERATIONS.
{code}

h2. Applicability

* Do not use batch mode if we use index_merge.
* Do not use batch mode if we use Multi-Range-Read.

* * Can use batch mode for single-table DELETE, provided no other tables are accessed
** the WHERE condition doesn't access other tables (that is, doesn't use subqueries)
** there are no triggers (which could potentially access the tables).

* Single-table UPDATEs are the same as DELETEs.

h2. Visibility

Need some way to see if the optimization is applied.

This is probably a too fine detail to show in EXPLAIN output.
Let's have a counter instead. It will count the number batches (that is, number of start_operations_batch() calls).

h2. Control
It's a good idea to be able to switch the optimization on and off.
The switch should be at the SQL layer.

TODO
- Do we need to call end_operations_batch() in case of an error condition?Control

in MDEV-16232, InnoDB is implementing an optimization to cut down on latch acquisitions/releases and other kinds of overhead when the query is about to perform multiple operations on the [index] records on the same page.

In the Storage engine API, this translates into the following calls:

{code}
  h->start_operations_batch(); // (1)
  BATCH_OF_OPERATIONS; // (2)
  h->start_operations_batch(); // (3)
{code}

InnoDB will hold internal page latches (and something else?) between calls (1) and (2),
so the SQL layer must guarantee the following properties of the code that may execute in BATCH_OF_OPERATIONS:

{code}
* Access only one InnoDB table (further "The Table").
** Also, do not access other kinds of tables, except maybe temporary (work) tables.

* Access The Table using one index.

* * It is allowed (even intended) the we read row(s) and then
   modify or delete them during one BATCH_OF_OPERATIONS.
{code}

h2. Applicability

* Do not use batch mode if we use index_merge.
* Do not use batch mode if we use Multi-Range-Read.

* * Can use batch mode for single-table DELETE, provided no other tables are accessed
** the WHERE condition doesn't access other tables (that is, doesn't use subqueries)
** there are no triggers (which could potentially access the tables).

* Single-table UPDATEs are the same as DELETEs.

h2. Visibility

Need some way to see if the optimization is applied.

This is probably a too fine detail to show in EXPLAIN output.
Let's have a counter instead. It will count the number batches (that is, number of start_operations_batch() calls).

h2. Control
It's a good idea to be able to switch the optimization on and off.
The switch should be at the SQL layer.

h2. Todo

- Do we need to call end_operations_batch() in case of an error condition? (A: Yes)

Sergei Petrunia made changes - 2022-11-26 17:49

Description

in MDEV-16232, InnoDB is implementing an optimization to cut down on latch acquisitions/releases and other kinds of overhead when the query is about to perform multiple operations on the [index] records on the same page.

In the Storage engine API, this translates into the following calls:

{code}
  h->start_operations_batch(); // (1)
  BATCH_OF_OPERATIONS; // (2)
  h->start_operations_batch(); // (3)
{code}

InnoDB will hold internal page latches (and something else?) between calls (1) and (2),
so the SQL layer must guarantee the following properties of the code that may execute in BATCH_OF_OPERATIONS:

{code}
* Access only one InnoDB table (further "The Table").
** Also, do not access other kinds of tables, except maybe temporary (work) tables.

* Access The Table using one index.

* * It is allowed (even intended) the we read row(s) and then
   modify or delete them during one BATCH_OF_OPERATIONS.
{code}

h2. Applicability

* Do not use batch mode if we use index_merge.
* Do not use batch mode if we use Multi-Range-Read.

* * Can use batch mode for single-table DELETE, provided no other tables are accessed
** the WHERE condition doesn't access other tables (that is, doesn't use subqueries)
** there are no triggers (which could potentially access the tables).

* Single-table UPDATEs are the same as DELETEs.

h2. Visibility

Need some way to see if the optimization is applied.

This is probably a too fine detail to show in EXPLAIN output.
Let's have a counter instead. It will count the number batches (that is, number of start_operations_batch() calls).

h2. Control
It's a good idea to be able to switch the optimization on and off.
The switch should be at the SQL layer.

h2. Todo

- Do we need to call end_operations_batch() in case of an error condition? (A: Yes)

in MDEV-16232, InnoDB is implementing an optimization to cut down on latch acquisitions/releases and other kinds of overhead when the query is about to perform multiple operations on the [index] records on the same page.

In the Storage engine API, this translates into the following calls:

{code}
  h->start_operations_batch(); // (1)
  BATCH_OF_OPERATIONS; // (2)
  h->start_operations_batch(); // (3)
{code}

InnoDB will hold internal page latches (and something else?) between calls (1) and (2),
so the SQL layer must guarantee the following properties of the code that may execute in BATCH_OF_OPERATIONS:

{code}
* Access only one InnoDB table (further "The Table").
** Also, do not access other kinds of tables, except maybe temporary (work) tables.

* Access The Table using only one index
** But non-index-only scan on the secondary index is allowed

* * It is allowed (even intended) the we read row(s) and then
   modify or delete them during one BATCH_OF_OPERATIONS.
{code}

h2. Applicability

* Do not use batch mode if we use index_merge.
* Do not use batch mode if we use Multi-Range-Read.

** The WHERE condition doesn't access other tables (that is, doesn't use subqueries)
** For UPDATE/DELETEs: the modified table has no triggers (which could potentially access other tables)

h2. Where to put the batching calls

h3. Step #1
Single-Table UPDATE and DELETE that satisfy the Applicability criteria should execute the whole query in one batch.
The same goes for single-table SELECT: It can start a batch when select starts and end it when it ends.

This means we will add just two
{code:cpp}
if (table does batching)
{ ... }
{code}
onto the main execution path.

h3. Step #2
A multi-table select may use batches for the last table in the join. Access to the last table must satisfy the Applicability criteria.

Can we avoid calling "if (table does batching)" for every row examined for any table?
It's tempting to use special functions in READ_RECORD. This will allow to avoid any overhead. (This is for starting the batch, what about ending it? Especially if the SELECT in question is a subquery with LIMIT?)

h2. Visibility

Need some way to see if the optimization is applied.

This is probably a too fine detail to show in EXPLAIN output.
Let's have a counter instead. It will count the number batches (that is, number of start_operations_batch() calls).

h2. Control
It's a good idea to be able to switch the optimization on and off.
The switch should be at the SQL layer.

h2. Todo

- Do we need to call end_operations_batch() in case of an error condition? (A: Yes)

Sergei Petrunia made changes - 2022-11-27 09:30

Description

in MDEV-16232, InnoDB is implementing an optimization to cut down on latch acquisitions/releases and other kinds of overhead when the query is about to perform multiple operations on the [index] records on the same page.

In the Storage engine API, this translates into the following calls:

{code}
  h->start_operations_batch(); // (1)
  BATCH_OF_OPERATIONS; // (2)
  h->start_operations_batch(); // (3)
{code}

InnoDB will hold internal page latches (and something else?) between calls (1) and (2),
so the SQL layer must guarantee the following properties of the code that may execute in BATCH_OF_OPERATIONS:

{code}
* Access only one InnoDB table (further "The Table").
** Also, do not access other kinds of tables, except maybe temporary (work) tables.

* Access The Table using only one index
** But non-index-only scan on the secondary index is allowed

* * It is allowed (even intended) the we read row(s) and then
   modify or delete them during one BATCH_OF_OPERATIONS.
{code}

h2. Applicability

* Do not use batch mode if we use index_merge.
* Do not use batch mode if we use Multi-Range-Read.

** The WHERE condition doesn't access other tables (that is, doesn't use subqueries)
** For UPDATE/DELETEs: the modified table has no triggers (which could potentially access other tables)

h2. Where to put the batching calls

h3. Step #1
Single-Table UPDATE and DELETE that satisfy the Applicability criteria should execute the whole query in one batch.
The same goes for single-table SELECT: It can start a batch when select starts and end it when it ends.

This means we will add just two
{code:cpp}
if (table does batching)
{ ... }
{code}
onto the main execution path.

h3. Step #2
A multi-table select may use batches for the last table in the join. Access to the last table must satisfy the Applicability criteria.

Can we avoid calling "if (table does batching)" for every row examined for any table?
It's tempting to use special functions in READ_RECORD. This will allow to avoid any overhead. (This is for starting the batch, what about ending it? Especially if the SELECT in question is a subquery with LIMIT?)

h2. Visibility

Need some way to see if the optimization is applied.

This is probably a too fine detail to show in EXPLAIN output.
Let's have a counter instead. It will count the number batches (that is, number of start_operations_batch() calls).

h2. Control
It's a good idea to be able to switch the optimization on and off.
The switch should be at the SQL layer.

h2. Todo

- Do we need to call end_operations_batch() in case of an error condition? (A: Yes)

in MDEV-16232, InnoDB is implementing an optimization to cut down on latch acquisitions/releases and other kinds of overhead when the query is about to perform multiple operations on the [index] records on the same page.

In the Storage engine API, this translates into the following calls:

{code}
  h->start_operations_batch(); // (1)
  BATCH_OF_OPERATIONS; // (2)
  h->start_operations_batch(); // (3)
{code}

InnoDB will hold internal page latches (and something else?) between calls (1) and (2),
so the SQL layer must guarantee the following properties of the code that may execute in BATCH_OF_OPERATIONS:

{code}
* Access only one InnoDB table (further "The Table").
** Also, do not access other kinds of tables, except maybe temporary (work) tables.

* Access The Table using only one index
** But non-index-only scan on the secondary index is allowed

* * It is allowed (even intended) the we read row(s) and then
   modify or delete them during one BATCH_OF_OPERATIONS.
{code}

h2. Applicability

* Do not use batch mode if we use index_merge.
* Do not use batch mode if we use Multi-Range-Read.

** The WHERE condition doesn't access other tables (that is, doesn't use subqueries)
** For UPDATE/DELETEs: the modified table has no triggers (which could potentially access other tables)

h2. Where to put the batching calls

h3. Step #1
Single-Table UPDATE and DELETE that satisfy the Applicability criteria should execute the whole query in one batch.
The same goes for single-table SELECT: It can start a batch when select starts and end it when it ends.

This means we will add just two
{code:cpp}
if (table does batching)
{ ... }
{code}
onto the main execution path.

(Note: current patch implements this)

h3. Step #2
A multi-table select may use batches for the last table in the join. Access to the last table must satisfy the Applicability criteria.

Can we avoid calling "if (table does batching)" for every row examined for any table?
It's tempting to use special functions in READ_RECORD. This will allow to avoid any overhead. (This is for starting the batch, what about ending it? Especially if the SELECT in question is a subquery with LIMIT?)

h2. Visibility

Need some way to see if the optimization is applied.

This is probably a too fine detail to show in EXPLAIN output.
Let's have a counter instead. It will count the number batches (that is, number of start_operations_batch() calls).

We should also write about the decision whether batching is to be used into the optimizer trace.

h2. Control
It's a good idea to be able to switch the optimization on and off.
The switch should be at the SQL layer.

h2. Todo

- Do we need to call end_operations_batch() in case of an error condition? (A: Yes)

Sergei Petrunia made changes - 2022-11-27 16:16

Description

in MDEV-16232, InnoDB is implementing an optimization to cut down on latch acquisitions/releases and other kinds of overhead when the query is about to perform multiple operations on the [index] records on the same page.

In the Storage engine API, this translates into the following calls:

{code}
  h->start_operations_batch(); // (1)
  BATCH_OF_OPERATIONS; // (2)
  h->start_operations_batch(); // (3)
{code}

InnoDB will hold internal page latches (and something else?) between calls (1) and (2),
so the SQL layer must guarantee the following properties of the code that may execute in BATCH_OF_OPERATIONS:

{code}
* Access only one InnoDB table (further "The Table").
** Also, do not access other kinds of tables, except maybe temporary (work) tables.

* Access The Table using only one index
** But non-index-only scan on the secondary index is allowed

* * It is allowed (even intended) the we read row(s) and then
   modify or delete them during one BATCH_OF_OPERATIONS.
{code}

h2. Applicability

* Do not use batch mode if we use index_merge.
* Do not use batch mode if we use Multi-Range-Read.

** The WHERE condition doesn't access other tables (that is, doesn't use subqueries)
** For UPDATE/DELETEs: the modified table has no triggers (which could potentially access other tables)

h2. Where to put the batching calls

h3. Step #1
Single-Table UPDATE and DELETE that satisfy the Applicability criteria should execute the whole query in one batch.
The same goes for single-table SELECT: It can start a batch when select starts and end it when it ends.

This means we will add just two
{code:cpp}
if (table does batching)
{ ... }
{code}
onto the main execution path.

(Note: current patch implements this)

h3. Step #2
A multi-table select may use batches for the last table in the join. Access to the last table must satisfy the Applicability criteria.

Can we avoid calling "if (table does batching)" for every row examined for any table?
It's tempting to use special functions in READ_RECORD. This will allow to avoid any overhead. (This is for starting the batch, what about ending it? Especially if the SELECT in question is a subquery with LIMIT?)

h2. Visibility

Need some way to see if the optimization is applied.

This is probably a too fine detail to show in EXPLAIN output.
Let's have a counter instead. It will count the number batches (that is, number of start_operations_batch() calls).

We should also write about the decision whether batching is to be used into the optimizer trace.

h2. Control
It's a good idea to be able to switch the optimization on and off.
The switch should be at the SQL layer.

h2. Todo

- Do we need to call end_operations_batch() in case of an error condition? (A: Yes)

in MDEV-16232, InnoDB is implementing an optimization to cut down on latch acquisitions/releases and other kinds of overhead when the query is about to perform multiple operations on the [index] records on the same page.

In the Storage engine API, this translates into the following calls:

{code}
  h->start_operations_batch(); // (1)
  BATCH_OF_OPERATIONS; // (2)
  h->start_operations_batch(); // (3)
{code}

InnoDB will hold internal page latches (and something else?) between calls (1) and (2),
so the SQL layer must guarantee the following properties of the code that may execute in BATCH_OF_OPERATIONS:

{code}
* Access only one InnoDB table (further "The Table").
** Also, do not access other kinds of tables, except maybe temporary (work) tables.

* Access The Table using only one index
** But non-index-only scan on the secondary index is allowed

* * It is allowed (even intended) the we read row(s) and then
   modify or delete them during one BATCH_OF_OPERATIONS.
{code}

h2. Applicability

* Do not use batch mode if we use index_merge.
* Do not use batch mode if we use Multi-Range-Read.

** The WHERE condition doesn't access other tables (that is, doesn't use subqueries)
** For UPDATE/DELETEs: the modified table has no triggers (which could potentially access other tables)

h2. Where to put the batching calls

h3. Step #1
Single-Table UPDATE and DELETE that satisfy the Applicability criteria should execute the whole query in one batch.
The same goes for single-table SELECT: It can start a batch when select starts and end it when it ends.

This means we will add just two
{code:cpp}
if (table does batching)
{ ... }
{code}
onto the main execution path.

(Note: current patch implements this)

h3. Step #2
A multi-table select may use batches for the last table in the join. Access to the last table must satisfy the Applicability criteria.

Can we avoid calling "if (table does batching)" for every row examined for any table?
It's tempting to use special functions in READ_RECORD. This will allow to avoid any overhead. (This is for starting the batch, what about ending it? Especially if the SELECT in question is a subquery with LIMIT?)

h2. Visibility

Need some way to see if the optimization is applied.

This is probably a too fine detail to show in EXPLAIN output.
Let's have a counter instead. It will count the number batches (that is, number of start_operations_batch() calls).

We should also write about the decision whether batching is to be used into the optimizer trace.

h2. Control
It's a good idea to be able to switch the optimization on and off.
The switch should be at the SQL layer.

h2. Error handling

- Do we need to call end_operations_batch() in case of an error condition? (A: Yes)

Sergei Petrunia made changes - 2022-11-27 16:20

Description

in MDEV-16232, InnoDB is implementing an optimization to cut down on latch acquisitions/releases and other kinds of overhead when the query is about to perform multiple operations on the [index] records on the same page.

In the Storage engine API, this translates into the following calls:

{code}
  h->start_operations_batch(); // (1)
  BATCH_OF_OPERATIONS; // (2)
  h->start_operations_batch(); // (3)
{code}

InnoDB will hold internal page latches (and something else?) between calls (1) and (2),
so the SQL layer must guarantee the following properties of the code that may execute in BATCH_OF_OPERATIONS:

{code}
* Access only one InnoDB table (further "The Table").
** Also, do not access other kinds of tables, except maybe temporary (work) tables.

* Access The Table using only one index
** But non-index-only scan on the secondary index is allowed

* * It is allowed (even intended) the we read row(s) and then
   modify or delete them during one BATCH_OF_OPERATIONS.
{code}

h2. Applicability

* Do not use batch mode if we use index_merge.
* Do not use batch mode if we use Multi-Range-Read.

** The WHERE condition doesn't access other tables (that is, doesn't use subqueries)
** For UPDATE/DELETEs: the modified table has no triggers (which could potentially access other tables)

h2. Where to put the batching calls

h3. Step #1
Single-Table UPDATE and DELETE that satisfy the Applicability criteria should execute the whole query in one batch.
The same goes for single-table SELECT: It can start a batch when select starts and end it when it ends.

This means we will add just two
{code:cpp}
if (table does batching)
{ ... }
{code}
onto the main execution path.

(Note: current patch implements this)

h3. Step #2
A multi-table select may use batches for the last table in the join. Access to the last table must satisfy the Applicability criteria.

Can we avoid calling "if (table does batching)" for every row examined for any table?
It's tempting to use special functions in READ_RECORD. This will allow to avoid any overhead. (This is for starting the batch, what about ending it? Especially if the SELECT in question is a subquery with LIMIT?)

h2. Visibility

Need some way to see if the optimization is applied.

This is probably a too fine detail to show in EXPLAIN output.
Let's have a counter instead. It will count the number batches (that is, number of start_operations_batch() calls).

We should also write about the decision whether batching is to be used into the optimizer trace.

h2. Control
It's a good idea to be able to switch the optimization on and off.
The switch should be at the SQL layer.

h2. Error handling

- Do we need to call end_operations_batch() in case of an error condition? (A: Yes)

in MDEV-16232, InnoDB is implementing an optimization to cut down on latch acquisitions/releases and other kinds of overhead when the query is about to perform multiple operations on the [index] records on the same page.

In the Storage engine API, this translates into the following calls:

{code}
  h->start_operations_batch(); // (1)
  BATCH_OF_OPERATIONS; // (2)
  h->start_operations_batch(); // (3)
{code}

InnoDB will hold internal page latches (and something else?) between calls (1) and (2),
so the SQL layer must guarantee the following properties of the code that may execute in BATCH_OF_OPERATIONS:

{code}
* Access only one InnoDB table (further "The Table").
** Also, do not access other kinds of tables, except maybe temporary (work) tables.

* Access The Table using only one index
** But non-index-only scan on the secondary index is allowed

* * It is allowed (even intended) the we read row(s) and then
   modify or delete them during one BATCH_OF_OPERATIONS.
{code}

h2. Applicability

* Do not use batch mode if we use index_merge.
* Do not use batch mode if we use Multi-Range-Read.

** The WHERE condition doesn't access other tables (that is, doesn't use subqueries)
** For UPDATE/DELETEs: the modified table has no triggers (which could potentially access other tables)

h2. Where to put the batching calls

h3. Step #1
Single-Table UPDATE and DELETE that satisfy the Applicability criteria should execute the whole query in one batch.
The same goes for single-table SELECT: It can start a batch when select starts and end it when it ends.

This means we will add just two
{code:cpp}
if (table does batching)
{ ... }
{code}
onto the main execution path.

(Note: current patch implements this)

h3. Step #2
A multi-table select may use batches for the last table in the join. Access to the last table must satisfy the Applicability criteria.

Can we avoid calling "if (table does batching)" for every row examined for any table?
It's tempting to use special functions in READ_RECORD. This will allow to avoid any overhead. (This is for starting the batch, what about ending it? Especially if the SELECT in question is a subquery with LIMIT?)

h2. Visibility

Need some way to see if the optimization is applied.

This is probably a too fine detail to show in EXPLAIN output.
Let's have a counter instead. It will count the number batches (that is, number of start_operations_batch() calls).

We should also write about the decision whether batching is to be used into the optimizer trace.

h2. User Control
It's a good idea to be able to switch the optimization on and off.
The switch should be at the SQL layer.

h2. Error handling

- Do we need to call end_operations_batch() in case of an error condition? (A: Yes)

Sergei Petrunia made changes - 2022-11-27 16:22

Description

in MDEV-16232, InnoDB is implementing an optimization to cut down on latch acquisitions/releases and other kinds of overhead when the query is about to perform multiple operations on the [index] records on the same page.

In the Storage engine API, this translates into the following calls:

{code}
  h->start_operations_batch(); // (1)
  BATCH_OF_OPERATIONS; // (2)
  h->start_operations_batch(); // (3)
{code}

InnoDB will hold internal page latches (and something else?) between calls (1) and (2),
so the SQL layer must guarantee the following properties of the code that may execute in BATCH_OF_OPERATIONS:

{code}
* Access only one InnoDB table (further "The Table").
** Also, do not access other kinds of tables, except maybe temporary (work) tables.

* Access The Table using only one index
** But non-index-only scan on the secondary index is allowed

* * It is allowed (even intended) the we read row(s) and then
   modify or delete them during one BATCH_OF_OPERATIONS.
{code}

h2. Applicability

* Do not use batch mode if we use index_merge.
* Do not use batch mode if we use Multi-Range-Read.

** The WHERE condition doesn't access other tables (that is, doesn't use subqueries)
** For UPDATE/DELETEs: the modified table has no triggers (which could potentially access other tables)

h2. Where to put the batching calls

h3. Step #1
Single-Table UPDATE and DELETE that satisfy the Applicability criteria should execute the whole query in one batch.
The same goes for single-table SELECT: It can start a batch when select starts and end it when it ends.

This means we will add just two
{code:cpp}
if (table does batching)
{ ... }
{code}
onto the main execution path.

(Note: current patch implements this)

h3. Step #2
A multi-table select may use batches for the last table in the join. Access to the last table must satisfy the Applicability criteria.

Can we avoid calling "if (table does batching)" for every row examined for any table?
It's tempting to use special functions in READ_RECORD. This will allow to avoid any overhead. (This is for starting the batch, what about ending it? Especially if the SELECT in question is a subquery with LIMIT?)

h2. Visibility

Need some way to see if the optimization is applied.

This is probably a too fine detail to show in EXPLAIN output.
Let's have a counter instead. It will count the number batches (that is, number of start_operations_batch() calls).

We should also write about the decision whether batching is to be used into the optimizer trace.

h2. User Control
It's a good idea to be able to switch the optimization on and off.
The switch should be at the SQL layer.

h2. Error handling

- Do we need to call end_operations_batch() in case of an error condition? (A: Yes)

in MDEV-16232, InnoDB is implementing an optimization to cut down on latch acquisitions/releases and other kinds of overhead when the query is about to perform multiple operations on the [index] records on the same page.

In the Storage engine API, this translates into the following calls:

{code}
  h->start_operations_batch(); // (1)
  BATCH_OF_OPERATIONS; // (2)
  h->start_operations_batch(); // (3)
{code}

InnoDB will hold internal page latches (and something else?) between calls (1) and (2),
so the SQL layer must guarantee the following properties of the code that may execute in BATCH_OF_OPERATIONS:

{code}
* Access only one InnoDB table (further "The Table").
** Also, do not access other kinds of tables, except maybe temporary (work) tables.

* Access The Table using only one index
** But non-index-only scan on the secondary index is allowed

* * It is allowed (even intended) the we read row(s) and then
   modify or delete them during one BATCH_OF_OPERATIONS.
{code}

h2. Applicability

* Do not use batch mode if we use index_merge.
* Do not use batch mode if we use Multi-Range-Read.

** The WHERE condition doesn't access other tables (that is, doesn't use subqueries)
** For UPDATE/DELETEs: the modified table has no triggers (which could potentially access other tables)

h2. Where to put the batching calls

h3. Step #1
Single-Table UPDATE and DELETE that satisfy the Applicability criteria should execute the whole query in one batch.
The same goes for single-table SELECT: It can start a batch when select starts and end it when it ends.

This means we will add just two
{code:cpp}
if (table does batching)
{ ... }
{code}
onto the main execution path.

(Note: current patch implements this. It doesn't have mysql-test coverage, though.)

h3. Step #2
A multi-table select may use batches for the last table in the join. Access to the last table must satisfy the Applicability criteria.

Can we avoid calling "if (table does batching)" for every row examined for any table?
It's tempting to use special functions in READ_RECORD. This will allow to avoid any overhead. (This is for starting the batch, what about ending it? Especially if the SELECT in question is a subquery with LIMIT?)

h2. Visibility

Need some way to see if the optimization is applied.

This is probably a too fine detail to show in EXPLAIN output.
Let's have a counter instead. It will count the number batches (that is, number of start_operations_batch() calls).

We should also write about the decision whether batching is to be used into the optimizer trace.

h2. User Control
It's a good idea to be able to switch the optimization on and off.
The switch should be at the SQL layer.

h2. Error handling

- Do we need to call end_operations_batch() in case of an error condition? (A: Yes)

Sergei Petrunia made changes - 2022-11-27 16:24

Description

in MDEV-16232, InnoDB is implementing an optimization to cut down on latch acquisitions/releases and other kinds of overhead when the query is about to perform multiple operations on the [index] records on the same page.

In the Storage engine API, this translates into the following calls:

{code}
  h->start_operations_batch(); // (1)
  BATCH_OF_OPERATIONS; // (2)
  h->start_operations_batch(); // (3)
{code}

InnoDB will hold internal page latches (and something else?) between calls (1) and (2),
so the SQL layer must guarantee the following properties of the code that may execute in BATCH_OF_OPERATIONS:

{code}
* Access only one InnoDB table (further "The Table").
** Also, do not access other kinds of tables, except maybe temporary (work) tables.

* Access The Table using only one index
** But non-index-only scan on the secondary index is allowed

* * It is allowed (even intended) the we read row(s) and then
   modify or delete them during one BATCH_OF_OPERATIONS.
{code}

h2. Applicability

* Do not use batch mode if we use index_merge.
* Do not use batch mode if we use Multi-Range-Read.

** The WHERE condition doesn't access other tables (that is, doesn't use subqueries)
** For UPDATE/DELETEs: the modified table has no triggers (which could potentially access other tables)

h2. Where to put the batching calls

h3. Step #1
Single-Table UPDATE and DELETE that satisfy the Applicability criteria should execute the whole query in one batch.
The same goes for single-table SELECT: It can start a batch when select starts and end it when it ends.

This means we will add just two
{code:cpp}
if (table does batching)
{ ... }
{code}
onto the main execution path.

(Note: current patch implements this. It doesn't have mysql-test coverage, though.)

h3. Step #2
A multi-table select may use batches for the last table in the join. Access to the last table must satisfy the Applicability criteria.

Can we avoid calling "if (table does batching)" for every row examined for any table?
It's tempting to use special functions in READ_RECORD. This will allow to avoid any overhead. (This is for starting the batch, what about ending it? Especially if the SELECT in question is a subquery with LIMIT?)

h2. Visibility

Need some way to see if the optimization is applied.

This is probably a too fine detail to show in EXPLAIN output.
Let's have a counter instead. It will count the number batches (that is, number of start_operations_batch() calls).

We should also write about the decision whether batching is to be used into the optimizer trace.

h2. User Control
It's a good idea to be able to switch the optimization on and off.
The switch should be at the SQL layer.

h2. Error handling

- Do we need to call end_operations_batch() in case of an error condition? (A: Yes)

in MDEV-16232, InnoDB is implementing an optimization to cut down on latch acquisitions/releases and other kinds of overhead when the query is about to perform multiple operations on the [index] records on the same page.

In the Storage engine API, this translates into the following calls:

{code}
  h->start_operations_batch(); // (1)
  BATCH_OF_OPERATIONS; // (2)
  h->start_operations_batch(); // (3)
{code}

InnoDB will hold internal page latches (and something else?) between calls (1) and (2),
so the SQL layer must guarantee the following properties of the code that may execute in BATCH_OF_OPERATIONS:

{code}
* Access only one InnoDB table (further "The Table").
** Also, do not access other kinds of tables, except maybe temporary (work) tables.

* Access The Table using only one index
** But non-index-only scan on the secondary index is allowed

* * It is allowed (even intended) the we read row(s) and then
   modify or delete them during one BATCH_OF_OPERATIONS.
{code}

h2. Applicability

Batching cannot be used if one of the following holds:

* We use index_merge.
* We use Multi-Range-Read.

* The WHERE condition accesses other tables (that is, uses subqueries)
* For UPDATE/DELETEs: the modified table has triggers (which could potentially access other tables)

h2. Where to put the batching calls

h3. Step #1
Single-Table UPDATE and DELETE that satisfy the Applicability criteria should execute the whole query in one batch.
The same goes for single-table SELECT: It can start a batch when select starts and end it when it ends.

This means we will add just two
{code:cpp}
if (table does batching)
{ ... }
{code}
onto the main execution path.

(Note: current patch implements this. It doesn't have mysql-test coverage, though.)

h3. Step #2
A multi-table select may use batches for the last table in the join. Access to the last table must satisfy the Applicability criteria.

Can we avoid calling "if (table does batching)" for every row examined for any table?
It's tempting to use special functions in READ_RECORD. This will allow to avoid any overhead. (This is for starting the batch, what about ending it? Especially if the SELECT in question is a subquery with LIMIT?)

h2. Visibility

Need some way to see if the optimization is applied.

This is probably a too fine detail to show in EXPLAIN output.
Let's have a counter instead. It will count the number batches (that is, number of start_operations_batch() calls).

We should also write about the decision whether batching is to be used into the optimizer trace.

h2. User Control
It's a good idea to be able to switch the optimization on and off.
The switch should be at the SQL layer.

h2. Error handling

- Do we need to call end_operations_batch() in case of an error condition? (A: Yes)

Sergei Petrunia made changes - 2022-11-27 16:25

Description

in MDEV-16232, InnoDB is implementing an optimization to cut down on latch acquisitions/releases and other kinds of overhead when the query is about to perform multiple operations on the [index] records on the same page.

In the Storage engine API, this translates into the following calls:

{code}
  h->start_operations_batch(); // (1)
  BATCH_OF_OPERATIONS; // (2)
  h->start_operations_batch(); // (3)
{code}

InnoDB will hold internal page latches (and something else?) between calls (1) and (2),
so the SQL layer must guarantee the following properties of the code that may execute in BATCH_OF_OPERATIONS:

{code}
* Access only one InnoDB table (further "The Table").
** Also, do not access other kinds of tables, except maybe temporary (work) tables.

* Access The Table using only one index
** But non-index-only scan on the secondary index is allowed

* * It is allowed (even intended) the we read row(s) and then
   modify or delete them during one BATCH_OF_OPERATIONS.
{code}

h2. Applicability

Batching cannot be used if one of the following holds:

* We use index_merge.
* We use Multi-Range-Read.

* The WHERE condition accesses other tables (that is, uses subqueries)
* For UPDATE/DELETEs: the modified table has triggers (which could potentially access other tables)

h2. Where to put the batching calls

h3. Step #1
Single-Table UPDATE and DELETE that satisfy the Applicability criteria should execute the whole query in one batch.
The same goes for single-table SELECT: It can start a batch when select starts and end it when it ends.

This means we will add just two
{code:cpp}
if (table does batching)
{ ... }
{code}
onto the main execution path.

(Note: current patch implements this. It doesn't have mysql-test coverage, though.)

h3. Step #2
A multi-table select may use batches for the last table in the join. Access to the last table must satisfy the Applicability criteria.

Can we avoid calling "if (table does batching)" for every row examined for any table?
It's tempting to use special functions in READ_RECORD. This will allow to avoid any overhead. (This is for starting the batch, what about ending it? Especially if the SELECT in question is a subquery with LIMIT?)

h2. Visibility

Need some way to see if the optimization is applied.

This is probably a too fine detail to show in EXPLAIN output.
Let's have a counter instead. It will count the number batches (that is, number of start_operations_batch() calls).

We should also write about the decision whether batching is to be used into the optimizer trace.

h2. User Control
It's a good idea to be able to switch the optimization on and off.
The switch should be at the SQL layer.

h2. Error handling

- Do we need to call end_operations_batch() in case of an error condition? (A: Yes)

in MDEV-16232, InnoDB is implementing an optimization to cut down on latch acquisitions/releases and other kinds of overhead when the query is about to perform multiple operations on the [index] records on the same page.

In the Storage engine API, this translates into the following calls:

{code}
  h->start_operations_batch(); // (1)
  BATCH_OF_OPERATIONS; // (2)
  h->start_operations_batch(); // (3)
{code}

InnoDB will hold internal page latches (and something else?) between calls (1) and (2),
so the SQL layer must guarantee the following properties of the code that may execute in BATCH_OF_OPERATIONS:

{code}
* Access only one InnoDB table (further "The Table").
** Also, do not access other kinds of tables, except maybe temporary (work) tables.

* Access The Table using only one index
** But non-index-only scan on the secondary index is allowed

* * It is allowed (even intended) the we read row(s) and then
   modify or delete them during one BATCH_OF_OPERATIONS.
{code}

h2. Applicability

Batching cannot be used if one of the following holds:

* We use index_merge.
* We use Multi-Range-Read.

* The WHERE condition accesses other tables (that is, uses subqueries)
* For UPDATE/DELETEs: the modified table has triggers (which could potentially access other tables)
* For UPDATE/DELETEs: there are foreign keys that depend on this table.
*
h2. Where to put the batching calls

h3. Step #1
Single-Table UPDATE and DELETE that satisfy the Applicability criteria should execute the whole query in one batch.
The same goes for single-table SELECT: It can start a batch when select starts and end it when it ends.

This means we will add just two
{code:cpp}
if (table does batching)
{ ... }
{code}
onto the main execution path.

(Note: current patch implements this. It doesn't have mysql-test coverage, though.)

h3. Step #2
A multi-table select may use batches for the last table in the join. Access to the last table must satisfy the Applicability criteria.

Can we avoid calling "if (table does batching)" for every row examined for any table?
It's tempting to use special functions in READ_RECORD. This will allow to avoid any overhead. (This is for starting the batch, what about ending it? Especially if the SELECT in question is a subquery with LIMIT?)

h2. Visibility

Need some way to see if the optimization is applied.

This is probably a too fine detail to show in EXPLAIN output.
Let's have a counter instead. It will count the number batches (that is, number of start_operations_batch() calls).

We should also write about the decision whether batching is to be used into the optimizer trace.

h2. User Control
It's a good idea to be able to switch the optimization on and off.
The switch should be at the SQL layer.

h2. Error handling

- Do we need to call end_operations_batch() in case of an error condition? (A: Yes)

Sergei Petrunia made changes - 2022-11-27 16:26

Description

in MDEV-16232, InnoDB is implementing an optimization to cut down on latch acquisitions/releases and other kinds of overhead when the query is about to perform multiple operations on the [index] records on the same page.

In the Storage engine API, this translates into the following calls:

{code}
  h->start_operations_batch(); // (1)
  BATCH_OF_OPERATIONS; // (2)
  h->start_operations_batch(); // (3)
{code}

InnoDB will hold internal page latches (and something else?) between calls (1) and (2),
so the SQL layer must guarantee the following properties of the code that may execute in BATCH_OF_OPERATIONS:

{code}
* Access only one InnoDB table (further "The Table").
** Also, do not access other kinds of tables, except maybe temporary (work) tables.

* Access The Table using only one index
** But non-index-only scan on the secondary index is allowed

* * It is allowed (even intended) the we read row(s) and then
   modify or delete them during one BATCH_OF_OPERATIONS.
{code}

h2. Applicability

Batching cannot be used if one of the following holds:

* We use index_merge.
* We use Multi-Range-Read.

* The WHERE condition accesses other tables (that is, uses subqueries)
* For UPDATE/DELETEs: the modified table has triggers (which could potentially access other tables)
* For UPDATE/DELETEs: there are foreign keys that depend on this table.
*
h2. Where to put the batching calls

h3. Step #1
Single-Table UPDATE and DELETE that satisfy the Applicability criteria should execute the whole query in one batch.
The same goes for single-table SELECT: It can start a batch when select starts and end it when it ends.

This means we will add just two
{code:cpp}
if (table does batching)
{ ... }
{code}
onto the main execution path.

(Note: current patch implements this. It doesn't have mysql-test coverage, though.)

h3. Step #2
A multi-table select may use batches for the last table in the join. Access to the last table must satisfy the Applicability criteria.

Can we avoid calling "if (table does batching)" for every row examined for any table?
It's tempting to use special functions in READ_RECORD. This will allow to avoid any overhead. (This is for starting the batch, what about ending it? Especially if the SELECT in question is a subquery with LIMIT?)

h2. Visibility

Need some way to see if the optimization is applied.

This is probably a too fine detail to show in EXPLAIN output.
Let's have a counter instead. It will count the number batches (that is, number of start_operations_batch() calls).

We should also write about the decision whether batching is to be used into the optimizer trace.

h2. User Control
It's a good idea to be able to switch the optimization on and off.
The switch should be at the SQL layer.

h2. Error handling

- Do we need to call end_operations_batch() in case of an error condition? (A: Yes)

in MDEV-16232, InnoDB is implementing an optimization to cut down on latch acquisitions/releases and other kinds of overhead when the query is about to perform multiple operations on the [index] records on the same page.

In the Storage engine API, this translates into the following calls:

{code}
  h->start_operations_batch(); // (1)
  BATCH_OF_OPERATIONS; // (2)
  h->start_operations_batch(); // (3)
{code}

InnoDB will hold internal page latches (and something else?) between calls (1) and (2),
so the SQL layer must guarantee the following properties of the code that may execute in BATCH_OF_OPERATIONS:

{code}
* Access only one InnoDB table (further "The Table").
** Also, do not access other kinds of tables, except maybe temporary (work) tables.

* Access The Table using only one index
** But non-index-only scan on the secondary index is allowed

* * It is allowed (even intended) the we read row(s) and then
   modify or delete them during one BATCH_OF_OPERATIONS.
{code}

h2. Applicability

Batching cannot be used if one of the following holds:

* We use index_merge.
* We use Multi-Range-Read.

* The WHERE condition accesses other tables (that is, uses subqueries)
* For UPDATE/DELETEs: the modified table has triggers (which could potentially access other tables)
* For UPDATE/DELETEs: there are foreign keys that depend on this table.

h2. Where to put the batching calls

h3. Step #1
Single-Table UPDATE and DELETE that satisfy the Applicability criteria should execute the whole query in one batch.
The same goes for single-table SELECT: It can start a batch when select starts and end it when it ends.

This means we will add just two
{code:cpp}
if (table does batching)
{ ... }
{code}
onto the main execution path.

(Note: current patch implements this. It doesn't have mysql-test coverage, though.)

h3. Step #2
A multi-table select may use batches for the last table in the join. Access to the last table must satisfy the Applicability criteria.

Can we avoid calling "if (table does batching)" for every row examined for any table?
It's tempting to use special functions in READ_RECORD. This will allow to avoid any overhead. (This is for starting the batch, what about ending it? Especially if the SELECT in question is a subquery with LIMIT?)

h2. Visibility

Need some way to see if the optimization is applied.

This is probably a too fine detail to show in EXPLAIN output.
Let's have a counter instead. It will count the number batches (that is, number of start_operations_batch() calls).

We should also write about the decision whether batching is to be used into the optimizer trace.

h2. User Control
It's a good idea to be able to switch the optimization on and off.
The switch should be at the SQL layer.

h2. Error handling

- Do we need to call end_operations_batch() in case of an error condition? (A: Yes)

Sergei Petrunia made changes - 2022-11-27 16:42

Description

in MDEV-16232, InnoDB is implementing an optimization to cut down on latch acquisitions/releases and other kinds of overhead when the query is about to perform multiple operations on the [index] records on the same page.

In the Storage engine API, this translates into the following calls:

{code}
  h->start_operations_batch(); // (1)
  BATCH_OF_OPERATIONS; // (2)
  h->start_operations_batch(); // (3)
{code}

InnoDB will hold internal page latches (and something else?) between calls (1) and (2),
so the SQL layer must guarantee the following properties of the code that may execute in BATCH_OF_OPERATIONS:

{code}
* Access only one InnoDB table (further "The Table").
** Also, do not access other kinds of tables, except maybe temporary (work) tables.

* Access The Table using only one index
** But non-index-only scan on the secondary index is allowed

* * It is allowed (even intended) the we read row(s) and then
   modify or delete them during one BATCH_OF_OPERATIONS.
{code}

h2. Applicability

Batching cannot be used if one of the following holds:

* We use index_merge.
* We use Multi-Range-Read.

* The WHERE condition accesses other tables (that is, uses subqueries)
* For UPDATE/DELETEs: the modified table has triggers (which could potentially access other tables)
* For UPDATE/DELETEs: there are foreign keys that depend on this table.

h2. Where to put the batching calls

h3. Step #1
Single-Table UPDATE and DELETE that satisfy the Applicability criteria should execute the whole query in one batch.
The same goes for single-table SELECT: It can start a batch when select starts and end it when it ends.

This means we will add just two
{code:cpp}
if (table does batching)
{ ... }
{code}
onto the main execution path.

(Note: current patch implements this. It doesn't have mysql-test coverage, though.)

h3. Step #2
A multi-table select may use batches for the last table in the join. Access to the last table must satisfy the Applicability criteria.

Can we avoid calling "if (table does batching)" for every row examined for any table?
It's tempting to use special functions in READ_RECORD. This will allow to avoid any overhead. (This is for starting the batch, what about ending it? Especially if the SELECT in question is a subquery with LIMIT?)

h2. Visibility

Need some way to see if the optimization is applied.

This is probably a too fine detail to show in EXPLAIN output.
Let's have a counter instead. It will count the number batches (that is, number of start_operations_batch() calls).

We should also write about the decision whether batching is to be used into the optimizer trace.

h2. User Control
It's a good idea to be able to switch the optimization on and off.
The switch should be at the SQL layer.

h2. Error handling

- Do we need to call end_operations_batch() in case of an error condition? (A: Yes)

in MDEV-16232, InnoDB is implementing an optimization to cut down on latch acquisitions/releases and other kinds of overhead when the query is about to perform multiple operations on the [index] records on the same page.

In the Storage engine API, this translates into the following calls:

{code}
  h->start_operations_batch(); // (1)
  BATCH_OF_OPERATIONS; // (2)
  h->start_operations_batch(); // (3)
{code}

InnoDB will hold internal page latches (and something else?) between calls (1) and (2),
so the SQL layer must guarantee the following properties of the code that may execute in BATCH_OF_OPERATIONS:

{code}
* Access only one InnoDB table (further "The Table").
** Also, do not access other kinds of tables, except maybe temporary (work) tables.

* Access The Table using only one index
** But non-index-only scan on the secondary index is allowed

* * It is allowed (even intended) the we read row(s) and then
   modify or delete them during one BATCH_OF_OPERATIONS.
{code}

h2. Applicability

Batching cannot be used if one of the following holds:

* We use index_merge.
* We use Multi-Range-Read.

* The WHERE condition accesses other tables (that is, uses subqueries)
* For UPDATE/DELETEs: the modified table has triggers (which could potentially access other tables)
* For UPDATE/DELETEs: there are foreign keys that depend on this table.

h2. Where to put the batching calls

h3. Step #1
Single-Table UPDATE and DELETE that satisfy the Applicability criteria should execute the whole query in one batch.
The same goes for single-table SELECT: It can start a batch when select starts and end it when it ends.

This means we will add just two
{code:cpp}
if (table does batching)
{ ... }
{code}
onto the main execution path.

h4. The prototype patch
The prototype patch implements the check for UPDATE and DELETE. It doesn't have the applicability check (it has some hack). Or mysql-test coverage. It has some hack to handle single-table selects.

h3. Step #2
A multi-table select may use batches for the last table in the join. Access to the last table must satisfy the Applicability criteria.

Can we avoid calling "if (table does batching)" for every row examined for any table?
It's tempting to use special functions in READ_RECORD. This will allow to avoid any overhead. (This is for starting the batch, what about ending it? Especially if the SELECT in question is a subquery with LIMIT?)

Note: the prototype patch doesn't handle this at all.

h2. Visibility

Need some way to see if the optimization is applied.

This is probably a too fine detail to show in EXPLAIN output.
Let's have a counter instead. It will count the number batches (that is, number of start_operations_batch() calls).

We should also write about the decision whether batching is to be used into the optimizer trace.

h2. User Control
It's a good idea to be able to switch the optimization on and off.
The switch should be at the SQL layer.

h2. Error handling

- Do we need to call end_operations_batch() in case of an error condition? (A: Yes)

Sergei Petrunia made changes - 2022-11-27 16:51

Description

in MDEV-16232, InnoDB is implementing an optimization to cut down on latch acquisitions/releases and other kinds of overhead when the query is about to perform multiple operations on the [index] records on the same page.

In the Storage engine API, this translates into the following calls:

{code}
  h->start_operations_batch(); // (1)
  BATCH_OF_OPERATIONS; // (2)
  h->start_operations_batch(); // (3)
{code}

InnoDB will hold internal page latches (and something else?) between calls (1) and (2),
so the SQL layer must guarantee the following properties of the code that may execute in BATCH_OF_OPERATIONS:

{code}
* Access only one InnoDB table (further "The Table").
** Also, do not access other kinds of tables, except maybe temporary (work) tables.

* Access The Table using only one index
** But non-index-only scan on the secondary index is allowed

* * It is allowed (even intended) the we read row(s) and then
   modify or delete them during one BATCH_OF_OPERATIONS.
{code}

h2. Applicability

Batching cannot be used if one of the following holds:

* We use index_merge.
* We use Multi-Range-Read.

* The WHERE condition accesses other tables (that is, uses subqueries)
* For UPDATE/DELETEs: the modified table has triggers (which could potentially access other tables)
* For UPDATE/DELETEs: there are foreign keys that depend on this table.

h2. Where to put the batching calls

h3. Step #1
Single-Table UPDATE and DELETE that satisfy the Applicability criteria should execute the whole query in one batch.
The same goes for single-table SELECT: It can start a batch when select starts and end it when it ends.

This means we will add just two
{code:cpp}
if (table does batching)
{ ... }
{code}
onto the main execution path.

h4. The prototype patch
The prototype patch implements the check for UPDATE and DELETE. It doesn't have the applicability check (it has some hack). Or mysql-test coverage. It has some hack to handle single-table selects.

h3. Step #2
A multi-table select may use batches for the last table in the join. Access to the last table must satisfy the Applicability criteria.

Can we avoid calling "if (table does batching)" for every row examined for any table?
It's tempting to use special functions in READ_RECORD. This will allow to avoid any overhead. (This is for starting the batch, what about ending it? Especially if the SELECT in question is a subquery with LIMIT?)

Note: the prototype patch doesn't handle this at all.

h2. Visibility

Need some way to see if the optimization is applied.

This is probably a too fine detail to show in EXPLAIN output.
Let's have a counter instead. It will count the number batches (that is, number of start_operations_batch() calls).

We should also write about the decision whether batching is to be used into the optimizer trace.

h2. User Control
It's a good idea to be able to switch the optimization on and off.
The switch should be at the SQL layer.

h2. Error handling

- Do we need to call end_operations_batch() in case of an error condition? (A: Yes)

in MDEV-16232, InnoDB is implementing an optimization to cut down on latch acquisitions/releases and other kinds of overhead when the query is about to perform multiple operations on the [index] records on the same page.

In the Storage engine API, this translates into the following calls:

{code}
  h->start_operations_batch(); // (1)
  BATCH_OF_OPERATIONS; // (2)
  h->start_operations_batch(); // (3)
{code}

InnoDB will hold internal page latches (and something else?) between calls (1) and (2),
so the SQL layer must guarantee the following properties of the code that may execute in BATCH_OF_OPERATIONS:

{code}
* Access only one InnoDB table (further "The Table").
** Also, do not access other kinds of tables, except maybe temporary (work) tables.

* Access The Table using only one index
** But non-index-only scan on the secondary index is allowed

* * It is allowed (even intended) the we read row(s) and then
   modify or delete them during one BATCH_OF_OPERATIONS.
{code}

h2. Applicability

Batching cannot be used if one of the following holds:

* We use index_merge.
* We use Multi-Range-Read.

* The WHERE condition accesses other tables (that is, uses subqueries)
* For UPDATE/DELETEs: the modified table has triggers (which could potentially access other tables)
* For UPDATE/DELETEs: there are foreign keys that depend on this table.

h2. Where to put the batching calls

h3. Step #1
Single-Table UPDATE and DELETE that satisfy the Applicability criteria should execute the whole query in one batch.
The same goes for single-table SELECT: It can start a batch when select starts and end it when it ends.

This means we will add just two
{code:cpp}
if (table does batching)
{ ... }
{code}
onto the main execution path.

h4. The prototype patch
The prototype patch implements the check for UPDATE and DELETE. It doesn't have the applicability check (it has some hack). Or mysql-test coverage. Because of that it can cause crashes:

https://jira.mariadb.org/browse/MDEV-16232?focusedCommentId=242090&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-242090

The prototype patch has some hack to handle single-table selects.

h3. Step #2
A multi-table select may use batches for the last table in the join. Access to the last table must satisfy the Applicability criteria.

Can we avoid calling "if (table does batching)" for every row examined for any table?
It's tempting to use special functions in READ_RECORD. This will allow to avoid any overhead. (This is for starting the batch, what about ending it? Especially if the SELECT in question is a subquery with LIMIT?)

Note: the prototype patch doesn't handle this at all.

h2. Visibility

Need some way to see if the optimization is applied.

This is probably a too fine detail to show in EXPLAIN output.
Let's have a counter instead. It will count the number batches (that is, number of start_operations_batch() calls).

We should also write about the decision whether batching is to be used into the optimizer trace.

h2. User Control
It's a good idea to be able to switch the optimization on and off.
The switch should be at the SQL layer.

h2. Error handling

- Do we need to call end_operations_batch() in case of an error condition? (A: Yes)

Sergei Petrunia made changes - 2022-11-27 16:55

Description

in MDEV-16232, InnoDB is implementing an optimization to cut down on latch acquisitions/releases and other kinds of overhead when the query is about to perform multiple operations on the [index] records on the same page.

In the Storage engine API, this translates into the following calls:

{code}
  h->start_operations_batch(); // (1)
  BATCH_OF_OPERATIONS; // (2)
  h->start_operations_batch(); // (3)
{code}

InnoDB will hold internal page latches (and something else?) between calls (1) and (2),
so the SQL layer must guarantee the following properties of the code that may execute in BATCH_OF_OPERATIONS:

{code}
* Access only one InnoDB table (further "The Table").
** Also, do not access other kinds of tables, except maybe temporary (work) tables.

* Access The Table using only one index
** But non-index-only scan on the secondary index is allowed

* * It is allowed (even intended) the we read row(s) and then
   modify or delete them during one BATCH_OF_OPERATIONS.
{code}

h2. Applicability

Batching cannot be used if one of the following holds:

* We use index_merge.
* We use Multi-Range-Read.

* The WHERE condition accesses other tables (that is, uses subqueries)
* For UPDATE/DELETEs: the modified table has triggers (which could potentially access other tables)
* For UPDATE/DELETEs: there are foreign keys that depend on this table.

h2. Where to put the batching calls

h3. Step #1
Single-Table UPDATE and DELETE that satisfy the Applicability criteria should execute the whole query in one batch.
The same goes for single-table SELECT: It can start a batch when select starts and end it when it ends.

This means we will add just two
{code:cpp}
if (table does batching)
{ ... }
{code}
onto the main execution path.

h4. The prototype patch
The prototype patch implements the check for UPDATE and DELETE. It doesn't have the applicability check (it has some hack). Or mysql-test coverage. Because of that it can cause crashes:

https://jira.mariadb.org/browse/MDEV-16232?focusedCommentId=242090&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-242090

The prototype patch has some hack to handle single-table selects.

h3. Step #2
A multi-table select may use batches for the last table in the join. Access to the last table must satisfy the Applicability criteria.

Can we avoid calling "if (table does batching)" for every row examined for any table?
It's tempting to use special functions in READ_RECORD. This will allow to avoid any overhead. (This is for starting the batch, what about ending it? Especially if the SELECT in question is a subquery with LIMIT?)

Note: the prototype patch doesn't handle this at all.

h2. Visibility

Need some way to see if the optimization is applied.

This is probably a too fine detail to show in EXPLAIN output.
Let's have a counter instead. It will count the number batches (that is, number of start_operations_batch() calls).

We should also write about the decision whether batching is to be used into the optimizer trace.

h2. User Control
It's a good idea to be able to switch the optimization on and off.
The switch should be at the SQL layer.

h2. Error handling

- Do we need to call end_operations_batch() in case of an error condition? (A: Yes)

in MDEV-16232, InnoDB is implementing an optimization to cut down on latch acquisitions/releases and other kinds of overhead when the query is about to perform multiple operations on the [index] records on the same page.

In the Storage engine API, this translates into the following calls:

{code}
  h->start_operations_batch(); // (1)
  BATCH_OF_OPERATIONS; // (2)
  h->start_operations_batch(); // (3)
{code}

InnoDB will hold internal page latches (and something else?) between calls (1) and (2),
so the SQL layer must guarantee the following properties of the code that may execute in BATCH_OF_OPERATIONS:

{code}
* Access only one InnoDB table (further "The Table").
** Also, do not access other kinds of tables, except maybe temporary (work) tables.

* Access The Table using only one index
** But non-index-only scan on the secondary index is allowed

* * It is allowed (even intended) the we read row(s) and then
   modify or delete them during one BATCH_OF_OPERATIONS.
{code}

h2. Applicability

Batching cannot be used if one of the following holds:

* We use index_merge.
* We use Multi-Range-Read.

* The WHERE condition accesses other tables (that is, uses subqueries)
* For UPDATE/DELETEs: the modified table has triggers (which could potentially access other tables)
* For UPDATE/DELETEs: there are foreign keys that depend on this table.

h2. Where to put the batching calls

h3. Step #1
Single-Table UPDATE and DELETE that satisfy the Applicability criteria should execute the whole query in one batch.
The same goes for single-table SELECT: It can start a batch when select starts and end it when it ends.

This means we will add just two
{code:cpp}
if (table does batching)
{ ... }
{code}
onto the main execution path.

h4. The prototype patch
The prototype patch is in this branch: {{10.9-MDEV-16232}}, this commit:
{code}
commit 92665b862b479f6582e3a15247bfc812e59e7e34
Author: Sergei Petrunia <sergey@mariadb.com>
Date: Wed Jun 8 15:43:06 2022 +0300

    MDEV-16232: Use fewer mini-transactions

    SQL layer support part #2. The main idea is:
{code}
It implements the check for UPDATE and DELETE. It doesn't have the applicability check (it has some hack). Or mysql-test coverage. Because of that it can cause crashes:

https://jira.mariadb.org/browse/MDEV-16232?focusedCommentId=242090&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-242090

The prototype patch has some hack to handle single-table selects.

h3. Step #2
A multi-table select may use batches for the last table in the join. Access to the last table must satisfy the Applicability criteria.

Can we avoid calling "if (table does batching)" for every row examined for any table?
It's tempting to use special functions in READ_RECORD. This will allow to avoid any overhead. (This is for starting the batch, what about ending it? Especially if the SELECT in question is a subquery with LIMIT?)

Note: the prototype patch doesn't handle this at all.

h2. Visibility

Need some way to see if the optimization is applied.

This is probably a too fine detail to show in EXPLAIN output.
Let's have a counter instead. It will count the number batches (that is, number of start_operations_batch() calls).

We should also write about the decision whether batching is to be used into the optimizer trace.

h2. User Control
It's a good idea to be able to switch the optimization on and off.
The switch should be at the SQL layer.

h2. Error handling

- Do we need to call end_operations_batch() in case of an error condition? (A: Yes)

Sergei Petrunia made changes - 2022-11-29 13:07

Description

in MDEV-16232, InnoDB is implementing an optimization to cut down on latch acquisitions/releases and other kinds of overhead when the query is about to perform multiple operations on the [index] records on the same page.

In the Storage engine API, this translates into the following calls:

{code}
  h->start_operations_batch(); // (1)
  BATCH_OF_OPERATIONS; // (2)
  h->start_operations_batch(); // (3)
{code}

InnoDB will hold internal page latches (and something else?) between calls (1) and (2),
so the SQL layer must guarantee the following properties of the code that may execute in BATCH_OF_OPERATIONS:

{code}
* Access only one InnoDB table (further "The Table").
** Also, do not access other kinds of tables, except maybe temporary (work) tables.

* Access The Table using only one index
** But non-index-only scan on the secondary index is allowed

* * It is allowed (even intended) the we read row(s) and then
   modify or delete them during one BATCH_OF_OPERATIONS.
{code}

h2. Applicability

Batching cannot be used if one of the following holds:

* We use index_merge.
* We use Multi-Range-Read.

* The WHERE condition accesses other tables (that is, uses subqueries)
* For UPDATE/DELETEs: the modified table has triggers (which could potentially access other tables)
* For UPDATE/DELETEs: there are foreign keys that depend on this table.

h2. Where to put the batching calls

h3. Step #1
Single-Table UPDATE and DELETE that satisfy the Applicability criteria should execute the whole query in one batch.
The same goes for single-table SELECT: It can start a batch when select starts and end it when it ends.

This means we will add just two
{code:cpp}
if (table does batching)
{ ... }
{code}
onto the main execution path.

h4. The prototype patch
The prototype patch is in this branch: {{10.9-MDEV-16232}}, this commit:
{code}
commit 92665b862b479f6582e3a15247bfc812e59e7e34
Author: Sergei Petrunia <sergey@mariadb.com>
Date: Wed Jun 8 15:43:06 2022 +0300

    MDEV-16232: Use fewer mini-transactions

    SQL layer support part #2. The main idea is:
{code}
It implements the check for UPDATE and DELETE. It doesn't have the applicability check (it has some hack). Or mysql-test coverage. Because of that it can cause crashes:

https://jira.mariadb.org/browse/MDEV-16232?focusedCommentId=242090&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-242090

The prototype patch has some hack to handle single-table selects.

h3. Step #2
A multi-table select may use batches for the last table in the join. Access to the last table must satisfy the Applicability criteria.

Can we avoid calling "if (table does batching)" for every row examined for any table?
It's tempting to use special functions in READ_RECORD. This will allow to avoid any overhead. (This is for starting the batch, what about ending it? Especially if the SELECT in question is a subquery with LIMIT?)

Note: the prototype patch doesn't handle this at all.

h2. Visibility

Need some way to see if the optimization is applied.

This is probably a too fine detail to show in EXPLAIN output.
Let's have a counter instead. It will count the number batches (that is, number of start_operations_batch() calls).

We should also write about the decision whether batching is to be used into the optimizer trace.

h2. User Control
It's a good idea to be able to switch the optimization on and off.
The switch should be at the SQL layer.

h2. Error handling

- Do we need to call end_operations_batch() in case of an error condition? (A: Yes)

in MDEV-16232, InnoDB is implementing an optimization to cut down on latch acquisitions/releases and other kinds of overhead when the query is about to perform multiple operations on the [index] records on the same page.

In the Storage engine API, this translates into the following calls:

{code}
  h->start_operations_batch(); // (1)
  BATCH_OF_OPERATIONS; // (2)
  h->end_operations_batch(); // (3)
{code}

InnoDB will hold internal page latches (and something else?) between calls (1) and (2),
so the SQL layer must guarantee the following properties of the code that may execute in BATCH_OF_OPERATIONS:

{code}
* Access only one InnoDB table (further "The Table").
** Also, do not access other kinds of tables, except maybe temporary (work) tables.

* Access The Table using only one index
** But non-index-only scan on the secondary index is allowed

* * It is allowed (even intended) the we read row(s) and then
   modify or delete them during one BATCH_OF_OPERATIONS.
{code}

h2. Applicability

Batching cannot be used if one of the following holds:

* We use index_merge.
* We use Multi-Range-Read.

* The WHERE condition accesses other tables (that is, uses subqueries)
* For UPDATE/DELETEs: the modified table has triggers (which could potentially access other tables)
* For UPDATE/DELETEs: there are foreign keys that depend on this table.

h2. Where to put the batching calls

h3. Step #1
Single-Table UPDATE and DELETE that satisfy the Applicability criteria should execute the whole query in one batch.
The same goes for single-table SELECT: It can start a batch when select starts and end it when it ends.

This means we will add just two
{code:cpp}
if (table does batching)
{ ... }
{code}
onto the main execution path.

h4. The prototype patch
The prototype patch is in this branch: {{10.9-MDEV-16232}}, this commit:
{code}
commit 92665b862b479f6582e3a15247bfc812e59e7e34
Author: Sergei Petrunia <sergey@mariadb.com>
Date: Wed Jun 8 15:43:06 2022 +0300

    MDEV-16232: Use fewer mini-transactions

    SQL layer support part #2. The main idea is:
{code}
It implements the check for UPDATE and DELETE. It doesn't have the applicability check (it has some hack). Or mysql-test coverage. Because of that it can cause crashes:

https://jira.mariadb.org/browse/MDEV-16232?focusedCommentId=242090&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-242090

The prototype patch has some hack to handle single-table selects.

h3. Step #2
A multi-table select may use batches for the last table in the join. Access to the last table must satisfy the Applicability criteria.

Can we avoid calling "if (table does batching)" for every row examined for any table?
It's tempting to use special functions in READ_RECORD. This will allow to avoid any overhead. (This is for starting the batch, what about ending it? Especially if the SELECT in question is a subquery with LIMIT?)

Note: the prototype patch doesn't handle this at all.

h2. Visibility

Need some way to see if the optimization is applied.

This is probably a too fine detail to show in EXPLAIN output.
Let's have a counter instead. It will count the number batches (that is, number of start_operations_batch() calls).

We should also write about the decision whether batching is to be used into the optimizer trace.

h2. User Control
It's a good idea to be able to switch the optimization on and off.
The switch should be at the SQL layer.

h2. Error handling

- Do we need to call end_operations_batch() in case of an error condition? (A: Yes)

Julien Fritsch made changes - 2022-12-05 17:08

Priority

Major [ 3 ]

Critical [ 2 ]

Sergei Petrunia made changes - 2022-12-26 07:27

Assignee

Sergei Petrunia [ psergey ]

Oleg Smirnov [ JIRAUSER50405 ]

Oleg Smirnov made changes - 2022-12-26 13:48

Status

Open [ 1 ]

In Progress [ 3 ]

Michael Widenius made changes - 2023-01-03 16:23

Description

in MDEV-16232, InnoDB is implementing an optimization to cut down on latch acquisitions/releases and other kinds of overhead when the query is about to perform multiple operations on the [index] records on the same page.

In the Storage engine API, this translates into the following calls:

{code}
  h->start_operations_batch(); // (1)
  BATCH_OF_OPERATIONS; // (2)
  h->end_operations_batch(); // (3)
{code}

InnoDB will hold internal page latches (and something else?) between calls (1) and (2),
so the SQL layer must guarantee the following properties of the code that may execute in BATCH_OF_OPERATIONS:

{code}
* Access only one InnoDB table (further "The Table").
** Also, do not access other kinds of tables, except maybe temporary (work) tables.

* Access The Table using only one index
** But non-index-only scan on the secondary index is allowed

* * It is allowed (even intended) the we read row(s) and then
   modify or delete them during one BATCH_OF_OPERATIONS.
{code}

h2. Applicability

Batching cannot be used if one of the following holds:

* We use index_merge.
* We use Multi-Range-Read.

* The WHERE condition accesses other tables (that is, uses subqueries)
* For UPDATE/DELETEs: the modified table has triggers (which could potentially access other tables)
* For UPDATE/DELETEs: there are foreign keys that depend on this table.

h2. Where to put the batching calls

h3. Step #1
Single-Table UPDATE and DELETE that satisfy the Applicability criteria should execute the whole query in one batch.
The same goes for single-table SELECT: It can start a batch when select starts and end it when it ends.

This means we will add just two
{code:cpp}
if (table does batching)
{ ... }
{code}
onto the main execution path.

h4. The prototype patch
The prototype patch is in this branch: {{10.9-MDEV-16232}}, this commit:
{code}
commit 92665b862b479f6582e3a15247bfc812e59e7e34
Author: Sergei Petrunia <sergey@mariadb.com>
Date: Wed Jun 8 15:43:06 2022 +0300

    MDEV-16232: Use fewer mini-transactions

    SQL layer support part #2. The main idea is:
{code}
It implements the check for UPDATE and DELETE. It doesn't have the applicability check (it has some hack). Or mysql-test coverage. Because of that it can cause crashes:

https://jira.mariadb.org/browse/MDEV-16232?focusedCommentId=242090&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-242090

The prototype patch has some hack to handle single-table selects.

h3. Step #2
A multi-table select may use batches for the last table in the join. Access to the last table must satisfy the Applicability criteria.

Can we avoid calling "if (table does batching)" for every row examined for any table?
It's tempting to use special functions in READ_RECORD. This will allow to avoid any overhead. (This is for starting the batch, what about ending it? Especially if the SELECT in question is a subquery with LIMIT?)

Note: the prototype patch doesn't handle this at all.

h2. Visibility

Need some way to see if the optimization is applied.

This is probably a too fine detail to show in EXPLAIN output.
Let's have a counter instead. It will count the number batches (that is, number of start_operations_batch() calls).

We should also write about the decision whether batching is to be used into the optimizer trace.

h2. User Control
It's a good idea to be able to switch the optimization on and off.
The switch should be at the SQL layer.

h2. Error handling

- Do we need to call end_operations_batch() in case of an error condition? (A: Yes)

in MDEV-16232, InnoDB is implementing an optimization to cut down on latch acquisitions/releases and other kinds of overhead when the query is about to perform multiple operations on the [index] records on the same page.

In the Storage engine API, this translates into the following calls:

{code}
  h->start_operations_batch(); // (1)
  BATCH_OF_OPERATIONS; // (2)
  h->end_operations_batch(); // (3)
{code}

InnoDB will hold internal page latches (and something else?) between calls (1) and (2),
so the SQL layer must guarantee the following properties of the code that may execute in BATCH_OF_OPERATIONS:

{code}
* Access only one InnoDB table (further "The Table").
** Also, do not access other kinds of tables, except maybe temporary (work) tables.

* Access The Table using only one index
** But non-index-only scan on the secondary index is allowed

* * It is allowed (even intended) the we read row(s) and then
   modify or delete them during one BATCH_OF_OPERATIONS.
{code}

h2. Applicability

The mini-transaction calls can be put around all loops where we call 'read next' repeatedly for just one table.

The places in the select/filesort code where this happens:

* When using only one table InnoDB table (it's ok mixing InnoDB tables and other tables).
**There can be any number of const tables as these are only called once before accessing any other tables.
* The last table in the join tabs.
* * This is calling read-next repeatedly until there are no more matching rows.
* The last table just before a table with a join cache
** This is calling read-next repeatedly until there are no more matching rows. All read rows are put into the join cache (until join cache is full, in which case we have to end the mini transaction).
* When scanning a table in filesort
* In multi_range_read around each range
* In index_merge when finding rowid's for one table

Batching cannot be used if one of the following holds:

* We use index_merge (?).
* We use Multi-Range-Read (?)

* The WHERE condition accesses other tables (that is, uses subqueries)
* For UPDATE/DELETEs: the modified table has triggers (which could potentially access other tables)
* For UPDATE/DELETEs: there are foreign keys that depend on this table.

h2. Where to put the batching calls

h3. Step #1
Single-Table UPDATE and DELETE that satisfy the Applicability criteria should execute the whole query in one batch.
The same goes for single-table SELECT: It can start a batch when select starts and end it when it ends.

This means we will add just two
{code:cpp}
if (table does batching)
{ ... }
{code}
onto the main execution path.

h4. The prototype patch
The prototype patch is in this branch: {{10.9-MDEV-16232}}, this commit:
{code}
commit 92665b862b479f6582e3a15247bfc812e59e7e34
Author: Sergei Petrunia <sergey@mariadb.com>
Date: Wed Jun 8 15:43:06 2022 +0300

    MDEV-16232: Use fewer mini-transactions

    SQL layer support part #2. The main idea is:
{code}
It implements the check for UPDATE and DELETE. It doesn't have the applicability check (it has some hack). Or mysql-test coverage. Because of that it can cause crashes:

https://jira.mariadb.org/browse/MDEV-16232?focusedCommentId=242090&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-242090

The prototype patch has some hack to handle single-table selects.

h3. Step #2
A multi-table select may use batches for the last table in the join. Access to the last table must satisfy the Applicability criteria.

Can we avoid calling "if (table does batching)" for every row examined for any table?
It's tempting to use special functions in READ_RECORD. This will allow to avoid any overhead. (This is for starting the batch, what about ending it? Especially if the SELECT in question is a subquery with LIMIT?)

Note: the prototype patch doesn't handle this at all.

h2. Visibility

Need some way to see if the optimization is applied.

This is probably a too fine detail to show in EXPLAIN output.
Let's have a counter instead. It will count the number batches (that is, number of start_operations_batch() calls).

We should also write about the decision whether batching is to be used into the optimizer trace.

h2. User Control
It's a good idea to be able to switch the optimization on and off.
The switch should be at the SQL layer.

h2. Error handling

- Do we need to call end_operations_batch() in case of an error condition? (A: Yes)

Oleg Smirnov made changes - 2023-02-09 09:31

Status

In Progress [ 3 ]

Stalled [ 10000 ]

Sergei Golubchik made changes - 2023-04-27 12:55

Fix Version/s		11.2 [ 28603 ]
Fix Version/s	11.0 [ 28320 ]

Julien Fritsch made changes - 2023-05-15 08:16

Link

This issue is blocked by MDEV-16232 [ MDEV-16232 ]

Julien Fritsch made changes - 2023-05-15 08:16

Link

This issue blocks MDEV-16232 [ MDEV-16232 ]

Ralf Gebhardt made changes - 2023-07-25 19:44

Fix Version/s		11.3 [ 28565 ]
Fix Version/s	11.2 [ 28603 ]

Oleg Smirnov made changes - 2023-08-15 14:41

Priority

Critical [ 2 ]

Major [ 3 ]

Sergei Golubchik made changes - 2023-09-17 18:03

Fix Version/s		11.4 [ 29301 ]
Fix Version/s	11.3 [ 28565 ]

Julien Fritsch made changes - 2023-11-30 16:29

Issue Type

Task [ 3 ]

New Feature [ 2 ]

Sergei Golubchik made changes - 2023-12-22 17:45

Fix Version/s		11.5 [ 29506 ]
Fix Version/s	11.4 [ 29301 ]

Sergei Golubchik made changes - 2024-03-19 18:32

Fix Version/s		11.6 [ 29515 ]
Fix Version/s	11.5 [ 29506 ]

Ralf Gebhardt made changes - 2024-06-21 09:21

Fix Version/s		11.7 [ 29815 ]
Fix Version/s	11.6 [ 29515 ]

Oleg Smirnov made changes - 2024-09-06 06:19

Priority

Major [ 3 ]

Minor [ 4 ]

Sergei Golubchik made changes - 2024-09-24 13:53

Fix Version/s		11.8 [ 29921 ]
Fix Version/s	11.7 [ 29815 ]

Sergei Golubchik made changes - 2024-11-21 11:27

Fix Version/s

11.8 [ 29921 ]

MariaDB Server

SQL Layer support for: Use fewer InnoDB mini-transactions

Details

Description

Applicability

Where to put the batching calls

Step #1

The prototype patch

Step #2

Visibility

User Control

Error handling

Attachments

Issue Links

Activity

People

Dates

Git Integration