[MDEV-406] ANALYZE $stmt - Jira

Sergei Petrunia created issue - 2012-07-20 13:04

Sergei Petrunia added a comment - 2012-07-20 14:39

test=# explain select count from one_k where ones < 10;
QUERY PLAN
-------------------------------------------------------------
Aggregate (cost=17.53..17.54 rows=1 width=0)
-> Seq Scan on one_k (cost=0.00..17.50 rows=11 width=0)
Filter: (ones < 10)
(3 rows)

test=# explain analyze select count from one_k where ones < 10;
QUERY PLAN
--------------------------------------------------------------------------------------------------------
Aggregate (cost=17.53..17.54 rows=1 width=0) (actual time=1.449..1.451 rows=1 loops=1)
-> Seq Scan on one_k (cost=0.00..17.50 rows=11 width=0) (actual time=0.037..1.406 rows=10 loops=1)
Filter: (ones < 10)
Total runtime: 1.560 ms
(4 rows)

Sergei Petrunia added a comment - 2012-07-20 14:39 test=# explain select count from one_k where ones < 10; QUERY PLAN ------------------------------------------------------------- Aggregate (cost=17.53..17.54 rows=1 width=0) -> Seq Scan on one_k (cost=0.00..17.50 rows=11 width=0) Filter: (ones < 10) (3 rows) test=# explain analyze select count from one_k where ones < 10; QUERY PLAN -------------------------------------------------------------------------------------------------------- Aggregate (cost=17.53..17.54 rows=1 width=0) (actual time=1.449..1.451 rows=1 loops=1) -> Seq Scan on one_k (cost=0.00..17.50 rows=11 width=0) (actual time=0.037..1.406 rows=10 loops=1) Filter: (ones < 10) Total runtime: 1.560 ms (4 rows)

Sergei Petrunia added a comment - 2012-07-20 14:40

^^ Example from PostgreSQL

Sergei Petrunia added a comment - 2012-07-20 14:40 ^^ Example from PostgreSQL

Sergei Petrunia made changes - 2012-07-20 15:21

Field	Original Value	New Value
Description	In other databases, EXPLAIN ANALYZE works as follows: - It runs the select normally, discarding its output - Instead, it produces EXPLAIN's output, but cost/#rows estimates are accompanied with actual numbers measured during execution. SHOW EXPLAIN gave us ability to produce query's EXPLAIN at arbitrary point in time. This makes it possible to implement EXPLAIN ANALYZE.	In other databases, EXPLAIN ANALYZE works as follows: - It runs the select normally, discarding its output - Instead, it produces EXPLAIN's output, but cost/#rows estimates are accompanied with actual numbers measured during execution. SHOW EXPLAIN gave us ability to produce query's EXPLAIN at arbitrary point in time. This makes it possible to implement EXPLAIN ANALYZE. == Interface == - The parser should support EXPLAIN ANALYZE syntax - EXPLAIN will produce extra columns: - after "rows" there will be "real_rows" - after "filtered" there will be "real_filtered" - there will be 'loops' column, which will tell how many times a scan was performed. == Implementation == (TODO: where do we save the flag that this is EXPLAIN_ANALYZE ? UNCACHEABLE_EXPLAIN is a bad one, because it is in every subselect... There's LEX::describe, which is a bitmap DESCRIBE{NORMAL\|EXTENDED\|PARTITIONS}, but it is checked in many places. We need a flag next to LEX::describe) EXPLAIN ANALYZE should be run, generally, like a regular select (and not like an EXPLAIN). The differences are: - need to prevent it from sending SELECT's column list to the client - need to prevent it from sending SELECT's data to the client - need to have individual counters for each table. We've got Handler counters and userstat counters already. Isn't it too much, perhaps we could - have one counter that we increment during the query - at query end, 'distribute' the increments to Handler_xxx, userstat, etc. - need to save each JOIN's plan before it is deleted (we need to save it 'late', so that we can get query plan + actual counter values) TODO: what/how to count for range_checked_for_each_record?

Sergei Petrunia made changes - 2012-07-20 15:24

Description

In other databases, EXPLAIN ANALYZE works as follows:
- It runs the select normally, discarding its output
- Instead, it produces EXPLAIN's output, but cost/#rows estimates are
  accompanied with actual numbers measured during execution.

SHOW EXPLAIN gave us ability to produce query's EXPLAIN at arbitrary point in time. This makes it possible to implement EXPLAIN ANALYZE.

== Interface ==

- The parser should support EXPLAIN ANALYZE syntax

- EXPLAIN will produce extra columns:

- after "rows" there will be "real_rows"
- after "filtered" there will be "real_filtered"
- there will be 'loops' column, which will tell how many times a scan was
   performed.

== Implementation ==

(TODO: where do we save the flag that this is EXPLAIN_ANALYZE ?
UNCACHEABLE_EXPLAIN is a bad one, because it is in every subselect...
There's LEX::describe, which is a bitmap DESCRIBE{NORMAL|EXTENDED|PARTITIONS},
but it is checked in many places. We need a flag next to LEX::describe)

EXPLAIN ANALYZE should be run, generally, like a regular select (and not like
an EXPLAIN). The differences are:

- need to prevent it from sending SELECT's column list to the client
- need to prevent it from sending SELECT's data to the client

- need to have individual counters for each table.
  We've got Handler counters and userstat counters already. Isn't it too much,
  perhaps we could
   - have one counter that we increment during the query
   - at query end, 'distribute' the increments to Handler_xxx, userstat, etc.

- need to save each JOIN's plan before it is deleted (we need to save it
  'late', so that we can get query plan + actual counter values)

TODO: what/how to count for range_checked_for_each_record?

In other databases, EXPLAIN ANALYZE works as follows:
- It runs the select normally, discarding its output
- Instead, it produces EXPLAIN's output, but cost/#rows estimates are
  accompanied with actual numbers measured during execution.

SHOW EXPLAIN gave us ability to produce query's EXPLAIN at arbitrary point in time. This makes it possible to implement EXPLAIN ANALYZE.

== Interface ==

- The parser should support EXPLAIN ANALYZE syntax

- EXPLAIN will produce extra columns:

- after "rows" there will be "real_rows"
- after "filtered" there will be "real_filtered"
- there will be 'loops' column, which will tell how many times a scan was
   performed.

== Implementation ==

(TODO: where do we save the flag that this is EXPLAIN_ANALYZE ?
UNCACHEABLE_EXPLAIN is a bad one, because it is in every subselect...
There's LEX::describe, which is a bitmap DESCRIBE{NORMAL|EXTENDED|PARTITIONS},
but it is checked in many places. We need a flag next to LEX::describe)

EXPLAIN ANALYZE should be run, generally, like a regular select (and not like
an EXPLAIN). The differences are:

- need to prevent it from sending SELECT's column list to the client
- need to prevent it from sending SELECT's data to the client

- need to have individual counters for each table.
  We've got Handler counters and userstat counters already. Isn't it too much,
  perhaps we could
   -- have one counter that we increment during the query
   -- at query end, 'distribute' the increments to Handler_xxx, userstat, etc.

- need to save each JOIN's plan before it is deleted (we need to save it
  'late', so that we can get query plan + actual counter values)

TODO: what/how to count for range_checked_for_each_record?

Sergei Petrunia made changes - 2012-07-20 15:26

Link

This issue relates to ~~MDEV-407~~ [ ~~MDEV-407~~ ]

Sergei Petrunia added a comment - 2012-09-02 12:28

When we have SHOW EXPLAIN, the natural way to save JOIN's plan is to produce a part of EXPLAIN output that describes the join.
The problem is, JOINs are optimized/executed in arbitrary order, and we want EXPLAIN ANALYZE to output to list joins in the same order EXPLAIN would list them (or at least in similar order, with "parents before children" etc).

This means, we need to save what JOIN::print_explain has produced, and then replay it back.

Sergei Petrunia added a comment - 2012-09-02 12:28 When we have SHOW EXPLAIN, the natural way to save JOIN's plan is to produce a part of EXPLAIN output that describes the join. The problem is, JOINs are optimized/executed in arbitrary order, and we want EXPLAIN ANALYZE to output to list joins in the same order EXPLAIN would list them (or at least in similar order, with "parents before children" etc). This means, we need to save what JOIN::print_explain has produced, and then replay it back.

Sergei Petrunia added a comment - 2012-09-02 12:33

The code has Protocol_local, however

that code was added by Kostja when he was backporting something from Online Backup code, and "SP OUT parameters"
currently the code is not used ( running a testsuite with DBUG_ASSERT(0) in Protocol_local::Protocol_local doesn't produce anythting)
When I read the code, I see Protocol_local::store_XXX() methods, which seem to store various types in internal buffers in a certain data format. However, I dont see any code that would try to read the stored data back!

Sergei Petrunia added a comment - 2012-09-02 12:33 The code has Protocol_local, however that code was added by Kostja when he was backporting something from Online Backup code, and "SP OUT parameters" currently the code is not used ( running a testsuite with DBUG_ASSERT(0) in Protocol_local::Protocol_local doesn't produce anythting) When I read the code, I see Protocol_local::store_XXX() methods, which seem to store various types in internal buffers in a certain data format. However, I dont see any code that would try to read the stored data back!

Sergei Petrunia added a comment - 2012-09-02 13:20

Another possible option is to use select_result_explain_buffer from the pre-review SHOW EXPLAIN code. However, that buffer relies on class Protocol to serialize the data

Sergei Petrunia added a comment - 2012-09-02 13:20 Another possible option is to use select_result_explain_buffer from the pre-review SHOW EXPLAIN code. However, that buffer relies on class Protocol to serialize the data

Sergei Petrunia added a comment - 2012-09-02 13:23

.. which is ok, because the real select output is suppressed, so EXPLAIN ANALYZE can use thd->protocol for its own purposes.

Sergei Petrunia added a comment - 2012-09-02 13:23 .. which is ok, because the real select output is suppressed, so EXPLAIN ANALYZE can use thd->protocol for its own purposes.

Sergei Petrunia made changes - 2012-09-03 19:09

Assignee

Sergei Petrunia [ psergey ]

Sergei Petrunia made changes - 2012-09-03 19:09

Status

Open [ 1 ]

In Progress [ 3 ]

Sergei Petrunia made changes - 2012-09-03 19:11

Description

In other databases, EXPLAIN ANALYZE works as follows:
- It runs the select normally, discarding its output
- Instead, it produces EXPLAIN's output, but cost/#rows estimates are
  accompanied with actual numbers measured during execution.

SHOW EXPLAIN gave us ability to produce query's EXPLAIN at arbitrary point in time. This makes it possible to implement EXPLAIN ANALYZE.

== Interface ==

- The parser should support EXPLAIN ANALYZE syntax

- EXPLAIN will produce extra columns:

- after "rows" there will be "real_rows"
- after "filtered" there will be "real_filtered"
- there will be 'loops' column, which will tell how many times a scan was
   performed.

== Implementation ==

(TODO: where do we save the flag that this is EXPLAIN_ANALYZE ?
UNCACHEABLE_EXPLAIN is a bad one, because it is in every subselect...
There's LEX::describe, which is a bitmap DESCRIBE{NORMAL|EXTENDED|PARTITIONS},
but it is checked in many places. We need a flag next to LEX::describe)

EXPLAIN ANALYZE should be run, generally, like a regular select (and not like
an EXPLAIN). The differences are:

- need to prevent it from sending SELECT's column list to the client
- need to prevent it from sending SELECT's data to the client

- need to have individual counters for each table.
  We've got Handler counters and userstat counters already. Isn't it too much,
  perhaps we could
   -- have one counter that we increment during the query
   -- at query end, 'distribute' the increments to Handler_xxx, userstat, etc.

- need to save each JOIN's plan before it is deleted (we need to save it
  'late', so that we can get query plan + actual counter values)

TODO: what/how to count for range_checked_for_each_record?

Documentation is at: http://kb.askmonty.org/en/explain-analyze/

In other databases, EXPLAIN ANALYZE works as follows:
- It runs the select normally, discarding its output
- Instead, it produces EXPLAIN's output, but cost/#rows estimates are
  accompanied with actual numbers measured during execution.

SHOW EXPLAIN gave us ability to produce query's EXPLAIN at arbitrary point in time. This makes it possible to implement EXPLAIN ANALYZE.

== Interface ==

- The parser should support EXPLAIN ANALYZE syntax

- EXPLAIN will produce extra columns:

- after "rows" there will be "real_rows"
- after "filtered" there will be "real_filtered"
- there will be 'loops' column, which will tell how many times a scan was
   performed.

== Implementation ==

(TODO: where do we save the flag that this is EXPLAIN_ANALYZE ?
UNCACHEABLE_EXPLAIN is a bad one, because it is in every subselect...
There's LEX::describe, which is a bitmap DESCRIBE{NORMAL|EXTENDED|PARTITIONS},
but it is checked in many places. We need a flag next to LEX::describe)

EXPLAIN ANALYZE should be run, generally, like a regular select (and not like
an EXPLAIN). The differences are:

- need to prevent it from sending SELECT's column list to the client
- need to prevent it from sending SELECT's data to the client

- need to have individual counters for each table.
  We've got Handler counters and userstat counters already. Isn't it too much,
  perhaps we could
   -- have one counter that we increment during the query
   -- at query end, 'distribute' the increments to Handler_xxx, userstat, etc.

- need to save each JOIN's plan before it is deleted (we need to save it
  'late', so that we can get query plan + actual counter values)

TODO: what/how to count for range_checked_for_each_record?

Sergei Petrunia made changes - 2013-08-17 11:31

Status

In Progress [ 3 ]

Stalled [ 10000 ]

Sergei Golubchik made changes - 2014-01-15 16:50

Fix Version/s

10.1.0 [ 12200 ]

Sergei Golubchik made changes - 2014-04-16 23:35

Labels

optimizer

Sergei Petrunia added a comment - 2014-05-10 14:10

Additional notes from discussion with igor:

It would be nice to show the amounts of time spent accessing the tables.

P_S code has the functionality to do it.
Reusing P_S code is not straightforward
- In P_S, each table has one counter. We need one counter per table use (this is also referred to as "Self-join problem")
- What if ANALYZE EXPLAIN is run while P_S is disabled? (or the needed instruments are disabled?) It could switch the instruments ON for the duration of the query but this could be an unpleasant surprise for those who are doing system-wide collection.

Sergei Petrunia added a comment - 2014-05-10 14:10 Additional notes from discussion with igor : It would be nice to show the amounts of time spent accessing the tables. P_S code has the functionality to do it. Reusing P_S code is not straightforward In P_S, each table has one counter. We need one counter per table use (this is also referred to as "Self-join problem") What if ANALYZE EXPLAIN is run while P_S is disabled? (or the needed instruments are disabled?) It could switch the instruments ON for the duration of the query but this could be an unpleasant surprise for those who are doing system-wide collection.

Sergei Petrunia added a comment - 2014-05-10 15:41

So,

it's better to know table access times
however, tracking this will add a cost.

I would also argue that the most frequent case is the one where we have log_slow_verbosity=explain. In this case, we don't want to add overhead to every query. At the same time, we want EXPLAIN ANALYZE output in the slow query log. This means, we need a way to run EXPLAIN ANALYZE without overhead.

Sergei Petrunia added a comment - 2014-05-10 15:41 So, it's better to know table access times however, tracking this will add a cost. I would also argue that the most frequent case is the one where we have log_slow_verbosity=explain. In this case, we don't want to add overhead to every query. At the same time, we want EXPLAIN ANALYZE output in the slow query log. This means, we need a way to run EXPLAIN ANALYZE without overhead.

Sergei Petrunia made changes - 2014-05-10 15:42

Status

Stalled [ 10000 ]

In Progress [ 3 ]

Sergei Petrunia added a comment - 2014-05-19 17:31 - edited

Notes from discussion with serg and elenst.

Instead of "EXPLAIN ANALYZE $CMD" syntax, we will use "ANALYZE $cmd". "ANALYZE $cmd" will run the command (e.g. ANALYZE UPDATE will do the updates (checked: PG's EXPLAIN ANALYZE DELETE" will do deletes)), but produce output of "EXPLAIN $cmd", amended with information about actual execution.

Notes from discussion with igor: It would be nice to count the number of disk io caused by each table. (counting time is even nicer but may cause overhead).

Sergei Petrunia added a comment - 2014-05-19 17:31 - edited Notes from discussion with serg and elenst . Instead of "EXPLAIN ANALYZE $CMD" syntax, we will use "ANALYZE $cmd". "ANALYZE $cmd" will run the command (e.g. ANALYZE UPDATE will do the updates (checked: PG's EXPLAIN ANALYZE DELETE" will do deletes)), but produce output of "EXPLAIN $cmd", amended with information about actual execution. Notes from discussion with igor : It would be nice to count the number of disk io caused by each table. (counting time is even nicer but may cause overhead).

Sergei Petrunia made changes - 2014-05-20 14:07

Description

Documentation is at: http://kb.askmonty.org/en/explain-analyze/

In other databases, EXPLAIN ANALYZE works as follows:
- It runs the select normally, discarding its output
- Instead, it produces EXPLAIN's output, but cost/#rows estimates are
  accompanied with actual numbers measured during execution.

SHOW EXPLAIN gave us ability to produce query's EXPLAIN at arbitrary point in time. This makes it possible to implement EXPLAIN ANALYZE.

== Interface ==

- The parser should support EXPLAIN ANALYZE syntax

- EXPLAIN will produce extra columns:

- after "rows" there will be "real_rows"
- after "filtered" there will be "real_filtered"
- there will be 'loops' column, which will tell how many times a scan was
   performed.

== Implementation ==

(TODO: where do we save the flag that this is EXPLAIN_ANALYZE ?
UNCACHEABLE_EXPLAIN is a bad one, because it is in every subselect...
There's LEX::describe, which is a bitmap DESCRIBE{NORMAL|EXTENDED|PARTITIONS},
but it is checked in many places. We need a flag next to LEX::describe)

EXPLAIN ANALYZE should be run, generally, like a regular select (and not like
an EXPLAIN). The differences are:

- need to prevent it from sending SELECT's column list to the client
- need to prevent it from sending SELECT's data to the client

- need to have individual counters for each table.
  We've got Handler counters and userstat counters already. Isn't it too much,
  perhaps we could
   -- have one counter that we increment during the query
   -- at query end, 'distribute' the increments to Handler_xxx, userstat, etc.

- need to save each JOIN's plan before it is deleted (we need to save it
  'late', so that we can get query plan + actual counter values)

TODO: what/how to count for range_checked_for_each_record?

(Documentation for previous iteration of the feature is at http://kb.askmonty.org/en/explain-analyze/ )

h2. == SQL syntax ==

The new syntax:

{noformat}
ANALYZE $explainable_stmt
{noformat}

ANALYZE $stmt will run the $stmt, and produce the output that EXPLAIN $stmt would produce, annotated with info about the query execution.

h2. == Adjustments to EXPLAIN output ==

EXPLAIN FORMAT=JSON is easy to extend.

As for tabular EXPLAIN form, the following columns will be added:
- loops ( need this?)
- r_rows
- r_filtered

h2. == Implementation at SQL layer ==

The parser will set LEX::analyze_stmt flag for ANALYZE statements.
There is LEX::describe which stores flags about EXPLAIN EXTENDED|PARTITIONS
but it is used to check whether the query is an EXPLAIN or not, and ANALYZE
command is not an EXPLAIN, because it actually runs the query.

Note: ANALYZE UPDATE statement actually makes the updates. With SBR, we will
have to write the statement into the binlog. The slave must be able to execute
it (I suspect current slave will choke on a statement that produces output).

h2. == Counting ==
We will collect two kinds of counters:

1. Some are counted at SQL level, like filtered%, ICP_filtered, #rows, etc.

2. Some will be counted deeper inside the engine, like number of disk reads per table.

The problems with the latter are
* the counters are global or per-table. We need them to be per-table-instance
(to handle self-join-like queries correctly)
* They may be difficult to get from the SQL layer.

h2. == Getting the counter values ==
This is where the new SHOW EXPLAIN architecture plays against us.

The problem is: at the end of JOIN::optimize(), the plan is saved into an
Explain_select structure, and EXPLAIN output is produced from Explain_select.

Explain_select object has only "explain" information, it has no connection to
objects that participate in query execution (like JOIN_TABs, or handler*, etc).

An apparent solution is to have JOIN::cleanup() save execution data using a
call that is similar to save_explain_data()

Sergei Petrunia added a comment - 2014-05-21 11:41

Tasks

1. Get ANALYZE working for tabular EXPLAIN format
1.1 Adjust the SQL parser
1.2 Add new columns (r_rows, r_filtered)
1.2 Make UPDATE/DELETE collect and print ANALYZE data
1.3 Make SELECT collect and print ANALYZE

After the above, we will have ANALYZE counterpart for the information that we've had in the tabular form of EXPLAIN output.

2. Extras in the tabular form
2.1 One item so far: "Using index condition (X%)"

3. Support ANALYZE FORMAT=JSON and print more data
3.1 Support ANALYZE FORMAT=JSON
3.2 Add more execution data
3.2.1 Join buffer reads
3.2.2 BKA buffer refills
3.2.3 Disk page reads
3.4.4 etc

Sergei Petrunia added a comment - 2014-05-21 11:41 Tasks 1. Get ANALYZE working for tabular EXPLAIN format 1.1 Adjust the SQL parser 1.2 Add new columns (r_rows, r_filtered) 1.2 Make UPDATE/DELETE collect and print ANALYZE data 1.3 Make SELECT collect and print ANALYZE After the above, we will have ANALYZE counterpart for the information that we've had in the tabular form of EXPLAIN output. 2. Extras in the tabular form 2.1 One item so far: "Using index condition (X%)" 3. Support ANALYZE FORMAT=JSON and print more data 3.1 Support ANALYZE FORMAT=JSON 3.2 Add more execution data 3.2.1 Join buffer reads 3.2.2 BKA buffer refills 3.2.3 Disk page reads 3.4.4 etc

Sergei Petrunia made changes - 2014-05-21 14:45

Summary

EXPLAIN ANALYZE

ANALYZE $stmt

Sergei Golubchik made changes - 2014-06-13 15:06

Workflow

defaullt [ 12711 ]

MariaDB v2 [ 43826 ]

Sergei Petrunia added a comment - 2014-06-17 22:09

Got basic things to work.

The design is as follows:

Query plan (aka EXPLAIN data structures) include counters.
Execution code increments them
After the query is finished, we can print them.

Sergei Petrunia added a comment - 2014-06-17 22:09 Got basic things to work. The design is as follows: Query plan (aka EXPLAIN data structures) include counters. Execution code increments them After the query is finished, we can print them.

Sergei Petrunia added a comment - 2014-06-17 22:22

Hit a problem (the example I am looking at is one with subquery, but one could probably hit it without it also).

Currently, [SHOW] EXPLAIN code does:

join->optimize()

join->save_explain_plan()   // (1)

join->exec()

join->save_explain_plan()   // (2)

The need to for call (2) was that because JOIN::exec() mades some last-minute changes to the query plan. I've tried to pull them out and put into JOIN::optimize() when working on SHOW EXPLAIN, but hit a problem that these changes are all over JOIN::exec().

For SHOW EXPLAIN, we could store these last-minute choices with the call (2).

Howeve, call (2) has disastrous consequences for ANALYZE. It overwrites the query plan and destroys the counter values.

So, I removed the call (2). I went through the known last-minute changes in the query plan (made by ORDER/GROUP BY optimizer), and added a call which saves just the changed info.

Running tests after that has shown that there is another gotcha - INFORMATION_SCHEMA. For some reason, a part of I_S optimizations is made very late, right in JOIN::exec(). When we're running EXPLAIN, it goes into JOIN::exec, goes into ##get_all_tables()##, and then that function has " if (lex->describe)

{ return 0; }

# in the middle of it.

Apparently, there is no way to get the right query plan early in the current code.

Sergei Petrunia added a comment - 2014-06-17 22:22 Hit a problem (the example I am looking at is one with subquery, but one could probably hit it without it also). Currently, [SHOW] EXPLAIN code does: join->optimize() join->save_explain_plan() // (1) join->exec() join->save_explain_plan() // (2) The need to for call (2) was that because JOIN::exec() mades some last-minute changes to the query plan. I've tried to pull them out and put into JOIN::optimize() when working on SHOW EXPLAIN, but hit a problem that these changes are all over JOIN::exec(). For SHOW EXPLAIN, we could store these last-minute choices with the call (2). Howeve, call (2) has disastrous consequences for ANALYZE. It overwrites the query plan and destroys the counter values. So, I removed the call (2). I went through the known last-minute changes in the query plan (made by ORDER/GROUP BY optimizer), and added a call which saves just the changed info. Running tests after that has shown that there is another gotcha - INFORMATION_SCHEMA. For some reason, a part of I_S optimizations is made very late, right in JOIN::exec(). When we're running EXPLAIN, it goes into JOIN::exec, goes into ##get_all_tables()##, and then that function has " if (lex->describe) { return 0; } # in the middle of it. Apparently, there is no way to get the right query plan early in the current code.

Sergei Petrunia added a comment - 2014-06-17 22:23

Possible options:

move a part of get_all_tables() into JOIN::optimize.
put another "modify the query plan" call into get_all_tables().

Sergei Petrunia added a comment - 2014-06-17 22:23 Possible options: move a part of get_all_tables() into JOIN::optimize. put another "modify the query plan" call into get_all_tables().

Sergei Petrunia added a comment - 2014-06-17 22:24 - edited

get_all_tables() is poorly written. It runs make_cond_for_info_schema(). That is, if we have a query like

select

  col1,

  (select ... from I_S.columns

  where non_frm_cond AND frm_cond AND correlation_cond)

from

  big_table

then each subquery execution will call get_all_tables which will call make_cond_for_info_schema, and in the end we will allocate O(#rows(big_table)) of Item_cond_and objects. Saw it myself in debugger.

Sergei Petrunia added a comment - 2014-06-17 22:24 - edited get_all_tables() is poorly written. It runs make_cond_for_info_schema(). That is, if we have a query like select col1, (select ... from I_S.columns where non_frm_cond AND frm_cond AND correlation_cond) from big_table then each subquery execution will call get_all_tables which will call make_cond_for_info_schema, and in the end we will allocate O(#rows(big_table)) of Item_cond_and objects. Saw it myself in debugger.

Sergei Petrunia added a comment - 2014-06-17 22:28

As MySQL 5.6:

they did some work with moving ORDER BY out of JOIN::exec() and into JOIN::optimize() AFAIU
however, they didn't move I_S optimization. They have JOIN::explain() which is an EXPLAIN-counterpart of JOIN::exec().

Sergei Petrunia added a comment - 2014-06-17 22:28 As MySQL 5.6: they did some work with moving ORDER BY out of JOIN::exec() and into JOIN::optimize() AFAIU however, they didn't move I_S optimization. They have JOIN::explain() which is an EXPLAIN-counterpart of JOIN::exec().

Sergei Petrunia made changes - 2014-06-18 19:39

Priority

Major [ 3 ]

Critical [ 2 ]

Sergei Petrunia made changes - 2014-06-19 18:50

Priority

Critical [ 2 ]

Major [ 3 ]

Sergei Petrunia added a comment - 2014-06-20 22:59

Another issue I'm facing after I've removed the second save_explain_data() call
is this:

 EXPLAIN EXTENDED DELETE v1 FROM t2, v1 WHERE t2.x = v1.a;

 id     select_type     table   type    possible_keys   key     key_len ref     rows    filtered        Extra

 1      SIMPLE  t2      ALL     NULL    NULL    NULL    NULL    4       100.00  Using where

-1      SIMPLE  t1      eq_ref  PRIMARY PRIMARY 4       test.t2.x       1       100.00

+1      SIMPLE  v1      eq_ref  PRIMARY PRIMARY 4       test.t2.x       1       100.00

VIEW's name is displayed instead of table name. It happens only on EXPLAIN
DELETE (SELECTs are ok).

Sergei Petrunia added a comment - 2014-06-20 22:59 Another issue I'm facing after I've removed the second save_explain_data() call is this: EXPLAIN EXTENDED DELETE v1 FROM t2, v1 WHERE t2.x = v1.a; id select_type table type possible_keys key key_len ref rows filtered Extra 1 SIMPLE t2 ALL NULL NULL NULL NULL 4 100.00 Using where -1 SIMPLE t1 eq_ref PRIMARY PRIMARY 4 test.t2.x 1 100.00 +1 SIMPLE v1 eq_ref PRIMARY PRIMARY 4 test.t2.x 1 100.00 VIEW's name is displayed instead of table name. It happens only on EXPLAIN DELETE (SELECTs are ok).

Sergei Petrunia added a comment - 2014-06-20 23:04

Debugging how it worked in 10.0 with the second save_explain_data() call, I see that it worked in an unacceptable way:

* JOIN::prepare() is run for the parent select

* JOIN::optimize() is run for the parent select (the child is merged)

* query plan is saved.  table->pos_in_table_list->alias == "v1"

* JOIN::exec() is called for the parent select

  * it calls select_describe()

    * which calls mysql_explain_union() for children

      * which runs JOIN::prepare, optimize, etc. for the view (select_number=2)

        this has an effect of putting back table->pos_in_table_list

        to point to "t1" but apparently it's not an acceptable solution.

Sergei Petrunia added a comment - 2014-06-20 23:04 Debugging how it worked in 10.0 with the second save_explain_data() call, I see that it worked in an unacceptable way: * JOIN::prepare() is run for the parent select * JOIN::optimize() is run for the parent select (the child is merged) * query plan is saved. table->pos_in_table_list->alias == "v1" * JOIN::exec() is called for the parent select * it calls select_describe() * which calls mysql_explain_union() for children * which runs JOIN::prepare, optimize, etc. for the view (select_number=2) this has an effect of putting back table->pos_in_table_list to point to "t1" but apparently it's not an acceptable solution.

Sergei Petrunia added a comment - 2014-06-20 23:35

Checking why EXPLAIN DELETE is affected and EXPLAIN SELECT is not...

It turns out, there is a special kind of early merge, DT_MERGE_FOR_INSERT, which is also used for multi-table DELETEs. The view is merged here:

  #0  mysql_derived_merge_for_insert (

  #1  0x00000000006320bb in mysql_handle_derived (lex=0x7fffd1b66cb0, phases=16) at /home/psergey/dev-git/10.1-explain-analyze/sql/sql_derived.cc:118

  #2  0x00000000009b0c24 in mysql_multi_delete_prepare (

  #3  0x000000000065d267 in mysql_execute_command (

  #4  0x0000000000664fe7 in mysql_parse (

and when regular SELECT handling tries to merge it, it's already merged.

A distinguishing case: view's TABLE_LIST has t->merged_for_insert=TRUE.

Sergei Petrunia added a comment - 2014-06-20 23:35 Checking why EXPLAIN DELETE is affected and EXPLAIN SELECT is not... It turns out, there is a special kind of early merge, DT_MERGE_FOR_INSERT, which is also used for multi-table DELETEs. The view is merged here: #0 mysql_derived_merge_for_insert ( #1 0x00000000006320bb in mysql_handle_derived (lex=0x7fffd1b66cb0, phases=16) at /home/psergey/dev-git/10.1-explain-analyze/sql/sql_derived.cc:118 #2 0x00000000009b0c24 in mysql_multi_delete_prepare ( #3 0x000000000065d267 in mysql_execute_command ( #4 0x0000000000664fe7 in mysql_parse ( and when regular SELECT handling tries to merge it, it's already merged. A distinguishing case: view's TABLE_LIST has t->merged_for_insert=TRUE.

Sergei Petrunia added a comment - 2014-06-24 18:55

Fixed the problem with I_S by splitting get_all_tables() into the optimizer part and executor part.

Sergei Petrunia added a comment - 2014-06-24 18:55 Fixed the problem with I_S by splitting get_all_tables() into the optimizer part and executor part.

Sergei Petrunia added a comment - 2014-06-25 09:55

Fixed the problem with order_by.test (it was a trivial bug).
Discussed "EXPLAIN UPDATE shows VIEW names" problem with Sanja

Sergei Petrunia added a comment - 2014-06-25 09:55 Fixed the problem with order_by.test (it was a trivial bug). Discussed "EXPLAIN UPDATE shows VIEW names" problem with Sanja

Sergei Petrunia made changes - 2014-06-25 09:57

Link

This issue relates to ~~MDEV-6382~~ [ ~~MDEV-6382~~ ]

Sergei Petrunia added a comment - 2014-06-25 16:37

Merged with 10.1

Sergei Petrunia added a comment - 2014-06-25 16:37 Merged with 10.1

Sergei Petrunia added a comment - 2014-06-25 16:40

As for "EXPLAIN UPDATE shows VIEW names":

In EXPLAIN output, tables from views show their aliases inside the VIEWs (as expected).

In MySQL 5.6, they don't have table->pos_in_table_list->alias == v1. instead, they have table->pos_in_table_list->alias=t1. Grepping the source code for DT_MERGE_FOR_INSERT (or for DT_MERGE or related terms) finds nothing, so I assume they have re-worked derived table merge algorithm and so do not have this "special kind of merge" problem.

Sergei Petrunia added a comment - 2014-06-25 16:40 As for "EXPLAIN UPDATE shows VIEW names": In EXPLAIN output, tables from views show their aliases inside the VIEWs (as expected). In MySQL 5.6, they don't have table->pos_in_table_list->alias == v1. instead, they have table->pos_in_table_list->alias=t1. Grepping the source code for DT_MERGE_FOR_INSERT (or for DT_MERGE or related terms) finds nothing, so I assume they have re-worked derived table merge algorithm and so do not have this "special kind of merge" problem.

Sergei Petrunia made changes - 2014-06-25 16:52

Link

This issue relates to ~~MDEV-6388~~ [ ~~MDEV-6388~~ ]

Elena Stepanova made changes - 2014-06-26 14:18

Link

This issue relates to ~~MDEV-6393~~ [ ~~MDEV-6393~~ ]

Elena Stepanova made changes - 2014-06-26 14:23

Link

This issue relates to ~~MDEV-6394~~ [ ~~MDEV-6394~~ ]

Elena Stepanova made changes - 2014-06-26 14:50

Link

This issue relates to ~~MDEV-6395~~ [ ~~MDEV-6395~~ ]

Elena Stepanova made changes - 2014-06-26 15:08

Link

This issue relates to ~~MDEV-6396~~ [ ~~MDEV-6396~~ ]

Elena Stepanova made changes - 2014-06-26 15:35

Link

This issue relates to ~~MDEV-6397~~ [ ~~MDEV-6397~~ ]

Elena Stepanova made changes - 2014-06-26 16:00

Link

This issue relates to ~~MDEV-6398~~ [ ~~MDEV-6398~~ ]

Sergei Petrunia made changes - 2014-06-26 19:19

Link

This issue relates to ~~MDEV-6400~~ [ ~~MDEV-6400~~ ]

Sergei Petrunia added a comment - 2014-06-26 19:32

Functionality intended for 10.1.0 has been pushed.

Sergei Petrunia added a comment - 2014-06-26 19:32 Functionality intended for 10.1.0 has been pushed.

Sergei Petrunia made changes - 2014-06-26 19:32

Resolution		Fixed [ 1 ]
Status	In Progress [ 3 ]	Closed [ 6 ]

Sergei Petrunia made changes - 2014-06-30 23:01

Description

(Documentation for previous iteration of the feature is at http://kb.askmonty.org/en/explain-analyze/ )

h2. == SQL syntax ==

The new syntax:

{noformat}
ANALYZE $explainable_stmt
{noformat}

ANALYZE $stmt will run the $stmt, and produce the output that EXPLAIN $stmt would produce, annotated with info about the query execution.

h2. == Adjustments to EXPLAIN output ==

EXPLAIN FORMAT=JSON is easy to extend.

As for tabular EXPLAIN form, the following columns will be added:
- loops ( need this?)
- r_rows
- r_filtered

h2. == Implementation at SQL layer ==

The parser will set LEX::analyze_stmt flag for ANALYZE statements.
There is LEX::describe which stores flags about EXPLAIN EXTENDED|PARTITIONS
but it is used to check whether the query is an EXPLAIN or not, and ANALYZE
command is not an EXPLAIN, because it actually runs the query.

Note: ANALYZE UPDATE statement actually makes the updates. With SBR, we will
have to write the statement into the binlog. The slave must be able to execute
it (I suspect current slave will choke on a statement that produces output).

h2. == Counting ==
We will collect two kinds of counters:

1. Some are counted at SQL level, like filtered%, ICP_filtered, #rows, etc.

2. Some will be counted deeper inside the engine, like number of disk reads per table.

The problems with the latter are
* the counters are global or per-table. We need them to be per-table-instance
(to handle self-join-like queries correctly)
* They may be difficult to get from the SQL layer.

h2. == Getting the counter values ==
This is where the new SHOW EXPLAIN architecture plays against us.

The problem is: at the end of JOIN::optimize(), the plan is saved into an
Explain_select structure, and EXPLAIN output is produced from Explain_select.

Explain_select object has only "explain" information, it has no connection to
objects that participate in query execution (like JOIN_TABs, or handler*, etc).

An apparent solution is to have JOIN::cleanup() save execution data using a
call that is similar to save_explain_data()

(Documentation is at https://mariadb.com/kb/en/analyze-statement/)

h2. == SQL syntax ==

The new syntax:

{noformat}
ANALYZE $explainable_stmt
{noformat}

ANALYZE $stmt will run the $stmt, and produce the output that EXPLAIN $stmt would produce, annotated with info about the query execution.

h2. == Adjustments to EXPLAIN output ==

EXPLAIN FORMAT=JSON is easy to extend.

As for tabular EXPLAIN form, the following columns will be added:
- loops ( need this?)
- r_rows
- r_filtered

h2. == Implementation at SQL layer ==

The parser will set LEX::analyze_stmt flag for ANALYZE statements.
There is LEX::describe which stores flags about EXPLAIN EXTENDED|PARTITIONS
but it is used to check whether the query is an EXPLAIN or not, and ANALYZE
command is not an EXPLAIN, because it actually runs the query.

Note: ANALYZE UPDATE statement actually makes the updates. With SBR, we will
have to write the statement into the binlog. The slave must be able to execute
it (I suspect current slave will choke on a statement that produces output).

h2. == Counting ==
We will collect two kinds of counters:

1. Some are counted at SQL level, like filtered%, ICP_filtered, #rows, etc.

2. Some will be counted deeper inside the engine, like number of disk reads per table.

The problems with the latter are
* the counters are global or per-table. We need them to be per-table-instance
(to handle self-join-like queries correctly)
* They may be difficult to get from the SQL layer.

h2. == Getting the counter values ==
This is where the new SHOW EXPLAIN architecture plays against us.

The problem is: at the end of JOIN::optimize(), the plan is saved into an
Explain_select structure, and EXPLAIN output is produced from Explain_select.

Explain_select object has only "explain" information, it has no connection to
objects that participate in query execution (like JOIN_TABs, or handler*, etc).

An apparent solution is to have JOIN::cleanup() save execution data using a
call that is similar to save_explain_data()

Sergei Petrunia made changes - 2014-07-07 15:05

Link

This issue relates to ~~MDEV-6422~~ [ ~~MDEV-6422~~ ]

Sergei Petrunia made changes - 2014-10-17 22:29

Labels

optimizer

analyze-stmt optimizer

Sergei Petrunia made changes - 2014-10-17 22:33

Link

This issue relates to ~~MDEV-6388~~ [ ~~MDEV-6388~~ ]

Sergei Petrunia made changes - 2014-10-17 22:33

Link

This issue relates to ~~MDEV-6388~~ [ ~~MDEV-6388~~ ]

Elena Stepanova made changes - 2014-11-05 02:00

Link

This issue relates to ~~MDEV-7023~~ [ ~~MDEV-7023~~ ]

Elena Stepanova made changes - 2014-11-05 13:23

Link

This issue relates to ~~MDEV-7024~~ [ ~~MDEV-7024~~ ]

Elena Stepanova made changes - 2014-11-05 16:24

Link

This issue relates to ~~MDEV-7025~~ [ ~~MDEV-7025~~ ]

Elena Stepanova made changes - 2014-11-05 17:37

Link

This issue relates to ~~MDEV-7027~~ [ ~~MDEV-7027~~ ]

Elena Stepanova made changes - 2014-11-14 16:51

Link

This issue relates to MDEV-7115 [ MDEV-7115 ]

Elena Stepanova made changes - 2014-11-14 16:56

Link

This issue relates to MDEV-7117 [ MDEV-7117 ]

Rasmus Johansson (Inactive) made changes - 2015-05-18 17:51

Workflow

MariaDB v2 [ 43826 ]

MariaDB v3 [ 64424 ]

Geoff Montee (Inactive) made changes - 2019-10-09 18:12

Link

This issue causes MDEV-17079 [ MDEV-17079 ]

Sergei Golubchik made changes - 2021-12-06 21:22

Workflow

MariaDB v3 [ 64424 ]

MariaDB v4 [ 131952 ]

MariaDB Server

ANALYZE $stmt

Details

Description

== SQL syntax ==

== Adjustments to EXPLAIN output ==

== Implementation at SQL layer ==

== Counting ==

== Getting the counter values ==

Attachments

Issue Links

Activity

People

Dates

Git Integration