Details
Description
We've encountered inconsistent query results. By an trial/error we found out that the cause of the inconsistency is use of LATERAL DERIVED in the query plan. When it's used, the query doesn't return correct results.
We've found similar open issues already reported, but I can't say if it's the same cause or not:
Disabling the optimization fixed the issue for us and the query results started to behave:
set global optimizer_switch='split_materialized=off'
|
I'm attaching minimal dump, that can be used to reproduce the issue. The query to reproduce the issue is:
SELECT COUNT(*) |
FROM project_time_lines_dates ptld |
|
INNER JOIN date_revision dr |
ON dr.id = ptld.date_revision_id |
|
INNER JOIN ( |
SELECT |
ptld.project_id AS project_id, |
ptld.project_type_id AS project_type_id, |
ptld.leaf_component_id AS leaf_component_id, |
dr.date_id AS date_id, |
MAX(dr.order_idx) AS max_order |
FROM project_time_lines_dates ptld |
INNER JOIN date_revision dr |
ON dr.id = ptld.date_revision_id |
WHERE dr.date_id IN ( |
SELECT DISTINCT dr.date_id |
FROM time_sheet_activities_time_sheet_roles_date_revisions tstrdr |
INNER JOIN date_revision dr |
ON dr.id = tstrdr.start_date_revision_id |
WHERE tstrdr.offered_for_overdue |
)
|
|
GROUP BY |
ptld.project_id,
|
ptld.project_type_id,
|
ptld.leaf_component_id,
|
dr.date_id
|
) max_anticipated
|
ON max_anticipated.project_id = ptld.project_id |
AND max_anticipated.project_type_id = ptld.project_type_id |
AND max_anticipated.date_id = dr.date_id |
AND max_anticipated.max_order = dr.order_idx |
|
WHERE dr.is_anticipated = TRUE |
AND ptld.project_id = 5896 |
The anticipated result is 2, but when LATERAL DERIVED is used, the number of returned rows is 0.
EXPLAIN of the query with disabled LATERAL DERIVED looks like this:
+------+-------------+------------+--------+---------------------------------------------------------------------------------------+------------------------+---------+---------------------------------------------+------+----------------------------------------------+
|
| id | select_type | table | type | possible_keys | key | key_len | ref | rows | Extra |
|
+------+-------------+------------+--------+---------------------------------------------------------------------------------------+------------------------+---------+---------------------------------------------+------+----------------------------------------------+
|
| 1 | PRIMARY | ptld | ref | project_id_project_type_id_leaf_component_id_date_revision_id,project_id,project_id_2 | project_id_2 | 4 | const | 34 | Using index |
|
| 1 | PRIMARY | dr | eq_ref | PRIMARY,date_id | PRIMARY | 4 | bugreport.ptld.date_revision_id | 1 | Using where |
|
| 1 | PRIMARY | <derived2> | ref | key1 | key1 | 9 | bugreport.dr.date_id,bugreport.dr.order_idx | 10 | Using where |
|
| 2 | DERIVED | ptld | ref | project_id_project_type_id_leaf_component_id_date_revision_id,project_id,project_id_2 | project_id_2 | 4 | const | 34 | Using index; Using temporary; Using filesort |
|
| 2 | DERIVED | dr | eq_ref | PRIMARY,date_id | PRIMARY | 4 | bugreport.ptld.date_revision_id | 1 | |
|
| 2 | DERIVED | dr | ref | PRIMARY,date_id | date_id | 4 | bugreport.dr.date_id | 1 | Using index |
|
| 2 | DERIVED | tstrdr | ref | start_date_revision_id | start_date_revision_id | 5 | bugreport.dr.id | 4 | Using where; FirstMatch(dr) |
|
+------+-------------+------------+--------+---------------------------------------------------------------------------------------+------------------------+---------+---------------------------------------------+------+----------------------------------------------+
|
EXPLAIN of the query with enabled LATERAL DERIVED looks like this:
+------+-----------------+------------+--------+---------------------------------------------------------------------------------------+------------------------+---------+---------------------------------+------+---------------------------------+
|
| id | select_type | table | type | possible_keys | key | key_len | ref | rows | Extra |
|
+------+-----------------+------------+--------+---------------------------------------------------------------------------------------+------------------------+---------+---------------------------------+------+---------------------------------+
|
| 1 | PRIMARY | ptld | ref | project_id_project_type_id_leaf_component_id_date_revision_id,project_id,project_id_2 | project_id_2 | 4 | const | 34 | Using index |
|
| 1 | PRIMARY | dr | eq_ref | PRIMARY,date_id | PRIMARY | 4 | bugreport.ptld.date_revision_id | 1 | Using where |
|
| 1 | PRIMARY | <derived2> | ref | key0 | key0 | 4 | bugreport.ptld.project_type_id | 2 | Using where |
|
| 2 | LATERAL DERIVED | dr | ref | PRIMARY,date_id | date_id | 4 | bugreport.dr.date_id | 1 | Using temporary; Using filesort |
|
| 2 | LATERAL DERIVED | dr | ref | PRIMARY,date_id | date_id | 4 | bugreport.dr.date_id | 1 | Using index; Start temporary |
|
| 2 | LATERAL DERIVED | tstrdr | ref | start_date_revision_id | start_date_revision_id | 5 | bugreport.dr.id | 4 | Using where; End temporary |
|
| 2 | LATERAL DERIVED | ptld | ref | project_id_project_type_id_leaf_component_id_date_revision_id,project_id,project_id_2 | project_id_2 | 4 | const | 34 | Using index |
|
+------+-----------------+------------+--------+---------------------------------------------------------------------------------------+------------------------+---------+---------------------------------+------+---------------------------------+
|
7 rows in set (0.001 sec)
|
Attachments
Issue Links
- is duplicated by
-
MDEV-26749 Optimizer_switch split_materialized Returns Wrong Results
-
- Closed
-
-
MDEV-26965 Left join to derived table on multiple clauses with aggregation returns missing/incorrect results
-
- Closed
-
- relates to
-
MDEV-25714 Join using derived with aggregation returns incorrect results
-
- Closed
-
-
MDEV-26965 Left join to derived table on multiple clauses with aggregation returns missing/incorrect results
-
- Closed
-
-
MDEV-27510 Query returns wrong result when using split optimization
-
- Closed
-
Activity
Field | Original Value | New Value |
---|---|---|
Description |
We've encountered inconsistent query results. By an trial/error we found out that the cause of the inconsistency is use of LATERAL DERIVED in the query plan. When it's used, the query doesn't return correct results.
We've found similar open issues already reported, but I can't say if it's the same cause or not: - https://jira.mariadb.org/browse/MDEV-21328 - https://jira.mariadb.org/browse/MDEV-26749 Disabling the optimization fixed the issue for us and the query results started to behave: {code} set global optimizer_switch='split_materialized=off' {code} I'm attaching minimal dump, that can be used to reproduce the issue. The query to reproduce the issue is: {code:sql} SELECT COUNT(*) FROM project_time_lines_dates ptld INNER JOIN date_revision dr ON dr.id = ptld.date_revision_id INNER JOIN ( SELECT ptld.project_id AS project_id, ptld.project_type_id AS project_type_id, ptld.leaf_component_id AS leaf_component_id, dr.date_id AS date_id, MAX(dr.order_idx) AS max_order FROM project_time_lines_dates ptld INNER JOIN date_revision dr ON dr.id = ptld.date_revision_id WHERE dr.date_id IN ( SELECT DISTINCT dr.date_id FROM time_sheet_activities_time_sheet_roles_date_revisions tstrdr INNER JOIN date_revision dr ON dr.id = tstrdr.start_date_revision_id WHERE tstrdr.offered_for_overdue ) GROUP BY ptld.project_id, ptld.project_type_id, ptld.leaf_component_id, dr.date_id ) max_anticipated ON max_anticipated.project_id = ptld.project_id AND max_anticipated.project_type_id = ptld.project_type_id AND max_anticipated.date_id = dr.date_id AND max_anticipated.max_order = dr.order_idx WHERE dr.is_anticipated = TRUE AND ptld.project_id = 5896 {code} The anticipated result is *2*, but when LATERAL DERIVED is used, the number of returned rows is *0*. |
We've encountered inconsistent query results. By an trial/error we found out that the cause of the inconsistency is use of LATERAL DERIVED in the query plan. When it's used, the query doesn't return correct results.
We've found similar open issues already reported, but I can't say if it's the same cause or not: - https://jira.mariadb.org/browse/MDEV-21328 - https://jira.mariadb.org/browse/MDEV-26749 Disabling the optimization fixed the issue for us and the query results started to behave: {code} set global optimizer_switch='split_materialized=off' {code} I'm attaching minimal dump, that can be used to reproduce the issue. The query to reproduce the issue is: {code:sql} SELECT COUNT(*) FROM project_time_lines_dates ptld INNER JOIN date_revision dr ON dr.id = ptld.date_revision_id INNER JOIN ( SELECT ptld.project_id AS project_id, ptld.project_type_id AS project_type_id, ptld.leaf_component_id AS leaf_component_id, dr.date_id AS date_id, MAX(dr.order_idx) AS max_order FROM project_time_lines_dates ptld INNER JOIN date_revision dr ON dr.id = ptld.date_revision_id WHERE dr.date_id IN ( SELECT DISTINCT dr.date_id FROM time_sheet_activities_time_sheet_roles_date_revisions tstrdr INNER JOIN date_revision dr ON dr.id = tstrdr.start_date_revision_id WHERE tstrdr.offered_for_overdue ) GROUP BY ptld.project_id, ptld.project_type_id, ptld.leaf_component_id, dr.date_id ) max_anticipated ON max_anticipated.project_id = ptld.project_id AND max_anticipated.project_type_id = ptld.project_type_id AND max_anticipated.date_id = dr.date_id AND max_anticipated.max_order = dr.order_idx WHERE dr.is_anticipated = TRUE AND ptld.project_id = 5896 {code} The anticipated result is *2*, but when LATERAL DERIVED is used, the number of returned rows is *0*. {{EXPLAIN}} of the query with disabled LATERAL DERIVED looks like this: {code} +------+-------------+------------+--------+---------------------------------------------------------------------------------------+------------------------+---------+---------------------------------------------+------+----------------------------------------------+ | id | select_type | table | type | possible_keys | key | key_len | ref | rows | Extra | +------+-------------+------------+--------+---------------------------------------------------------------------------------------+------------------------+---------+---------------------------------------------+------+----------------------------------------------+ | 1 | PRIMARY | ptld | ref | project_id_project_type_id_leaf_component_id_date_revision_id,project_id,project_id_2 | project_id_2 | 4 | const | 34 | Using index | | 1 | PRIMARY | dr | eq_ref | PRIMARY,date_id | PRIMARY | 4 | bugreport.ptld.date_revision_id | 1 | Using where | | 1 | PRIMARY | <derived2> | ref | key1 | key1 | 9 | bugreport.dr.date_id,bugreport.dr.order_idx | 10 | Using where | | 2 | DERIVED | ptld | ref | project_id_project_type_id_leaf_component_id_date_revision_id,project_id,project_id_2 | project_id_2 | 4 | const | 34 | Using index; Using temporary; Using filesort | | 2 | DERIVED | dr | eq_ref | PRIMARY,date_id | PRIMARY | 4 | bugreport.ptld.date_revision_id | 1 | | | 2 | DERIVED | dr | ref | PRIMARY,date_id | date_id | 4 | bugreport.dr.date_id | 1 | Using index | | 2 | DERIVED | tstrdr | ref | start_date_revision_id | start_date_revision_id | 5 | bugreport.dr.id | 4 | Using where; FirstMatch(dr) MariaDB [bugreport]> +------+-------------+------------+--------+---------------------------------------------------------------------------------------+------------------------+---------+---------------------------------------------+------+----------------------------------------------+ {code} {{EXPLAIN}} of the query with enabled LATERAL DERIVED looks like this: {code} +------+-----------------+------------+--------+---------------------------------------------------------------------------------------+------------------------+---------+---------------------------------+------+---------------------------------+ | id | select_type | table | type | possible_keys | key | key_len | ref | rows | Extra | +------+-----------------+------------+--------+---------------------------------------------------------------------------------------+------------------------+---------+---------------------------------+------+---------------------------------+ | 1 | PRIMARY | ptld | ref | project_id_project_type_id_leaf_component_id_date_revision_id,project_id,project_id_2 | project_id_2 | 4 | const | 34 | Using index | | 1 | PRIMARY | dr | eq_ref | PRIMARY,date_id | PRIMARY | 4 | bugreport.ptld.date_revision_id | 1 | Using where | | 1 | PRIMARY | <derived2> | ref | key0 | key0 | 4 | bugreport.ptld.project_type_id | 2 | Using where | | 2 | LATERAL DERIVED | dr | ref | PRIMARY,date_id | date_id | 4 | bugreport.dr.date_id | 1 | Using temporary; Using filesort | | 2 | LATERAL DERIVED | dr | ref | PRIMARY,date_id | date_id | 4 | bugreport.dr.date_id | 1 | Using index; Start temporary | | 2 | LATERAL DERIVED | tstrdr | ref | start_date_revision_id | start_date_revision_id | 5 | bugreport.dr.id | 4 | Using where; End temporary | | 2 | LATERAL DERIVED | ptld | ref | project_id_project_type_id_leaf_component_id_date_revision_id,project_id,project_id_2 | project_id_2 | 4 | const | 34 | Using index | +------+-----------------+------------+--------+---------------------------------------------------------------------------------------+------------------------+---------+---------------------------------+------+---------------------------------+ {code} |
Description |
We've encountered inconsistent query results. By an trial/error we found out that the cause of the inconsistency is use of LATERAL DERIVED in the query plan. When it's used, the query doesn't return correct results.
We've found similar open issues already reported, but I can't say if it's the same cause or not: - https://jira.mariadb.org/browse/MDEV-21328 - https://jira.mariadb.org/browse/MDEV-26749 Disabling the optimization fixed the issue for us and the query results started to behave: {code} set global optimizer_switch='split_materialized=off' {code} I'm attaching minimal dump, that can be used to reproduce the issue. The query to reproduce the issue is: {code:sql} SELECT COUNT(*) FROM project_time_lines_dates ptld INNER JOIN date_revision dr ON dr.id = ptld.date_revision_id INNER JOIN ( SELECT ptld.project_id AS project_id, ptld.project_type_id AS project_type_id, ptld.leaf_component_id AS leaf_component_id, dr.date_id AS date_id, MAX(dr.order_idx) AS max_order FROM project_time_lines_dates ptld INNER JOIN date_revision dr ON dr.id = ptld.date_revision_id WHERE dr.date_id IN ( SELECT DISTINCT dr.date_id FROM time_sheet_activities_time_sheet_roles_date_revisions tstrdr INNER JOIN date_revision dr ON dr.id = tstrdr.start_date_revision_id WHERE tstrdr.offered_for_overdue ) GROUP BY ptld.project_id, ptld.project_type_id, ptld.leaf_component_id, dr.date_id ) max_anticipated ON max_anticipated.project_id = ptld.project_id AND max_anticipated.project_type_id = ptld.project_type_id AND max_anticipated.date_id = dr.date_id AND max_anticipated.max_order = dr.order_idx WHERE dr.is_anticipated = TRUE AND ptld.project_id = 5896 {code} The anticipated result is *2*, but when LATERAL DERIVED is used, the number of returned rows is *0*. {{EXPLAIN}} of the query with disabled LATERAL DERIVED looks like this: {code} +------+-------------+------------+--------+---------------------------------------------------------------------------------------+------------------------+---------+---------------------------------------------+------+----------------------------------------------+ | id | select_type | table | type | possible_keys | key | key_len | ref | rows | Extra | +------+-------------+------------+--------+---------------------------------------------------------------------------------------+------------------------+---------+---------------------------------------------+------+----------------------------------------------+ | 1 | PRIMARY | ptld | ref | project_id_project_type_id_leaf_component_id_date_revision_id,project_id,project_id_2 | project_id_2 | 4 | const | 34 | Using index | | 1 | PRIMARY | dr | eq_ref | PRIMARY,date_id | PRIMARY | 4 | bugreport.ptld.date_revision_id | 1 | Using where | | 1 | PRIMARY | <derived2> | ref | key1 | key1 | 9 | bugreport.dr.date_id,bugreport.dr.order_idx | 10 | Using where | | 2 | DERIVED | ptld | ref | project_id_project_type_id_leaf_component_id_date_revision_id,project_id,project_id_2 | project_id_2 | 4 | const | 34 | Using index; Using temporary; Using filesort | | 2 | DERIVED | dr | eq_ref | PRIMARY,date_id | PRIMARY | 4 | bugreport.ptld.date_revision_id | 1 | | | 2 | DERIVED | dr | ref | PRIMARY,date_id | date_id | 4 | bugreport.dr.date_id | 1 | Using index | | 2 | DERIVED | tstrdr | ref | start_date_revision_id | start_date_revision_id | 5 | bugreport.dr.id | 4 | Using where; FirstMatch(dr) MariaDB [bugreport]> +------+-------------+------------+--------+---------------------------------------------------------------------------------------+------------------------+---------+---------------------------------------------+------+----------------------------------------------+ {code} {{EXPLAIN}} of the query with enabled LATERAL DERIVED looks like this: {code} +------+-----------------+------------+--------+---------------------------------------------------------------------------------------+------------------------+---------+---------------------------------+------+---------------------------------+ | id | select_type | table | type | possible_keys | key | key_len | ref | rows | Extra | +------+-----------------+------------+--------+---------------------------------------------------------------------------------------+------------------------+---------+---------------------------------+------+---------------------------------+ | 1 | PRIMARY | ptld | ref | project_id_project_type_id_leaf_component_id_date_revision_id,project_id,project_id_2 | project_id_2 | 4 | const | 34 | Using index | | 1 | PRIMARY | dr | eq_ref | PRIMARY,date_id | PRIMARY | 4 | bugreport.ptld.date_revision_id | 1 | Using where | | 1 | PRIMARY | <derived2> | ref | key0 | key0 | 4 | bugreport.ptld.project_type_id | 2 | Using where | | 2 | LATERAL DERIVED | dr | ref | PRIMARY,date_id | date_id | 4 | bugreport.dr.date_id | 1 | Using temporary; Using filesort | | 2 | LATERAL DERIVED | dr | ref | PRIMARY,date_id | date_id | 4 | bugreport.dr.date_id | 1 | Using index; Start temporary | | 2 | LATERAL DERIVED | tstrdr | ref | start_date_revision_id | start_date_revision_id | 5 | bugreport.dr.id | 4 | Using where; End temporary | | 2 | LATERAL DERIVED | ptld | ref | project_id_project_type_id_leaf_component_id_date_revision_id,project_id,project_id_2 | project_id_2 | 4 | const | 34 | Using index | +------+-----------------+------------+--------+---------------------------------------------------------------------------------------+------------------------+---------+---------------------------------+------+---------------------------------+ {code} |
We've encountered inconsistent query results. By an trial/error we found out that the cause of the inconsistency is use of LATERAL DERIVED in the query plan. When it's used, the query doesn't return correct results.
We've found similar open issues already reported, but I can't say if it's the same cause or not: - https://jira.mariadb.org/browse/MDEV-21328 - https://jira.mariadb.org/browse/MDEV-26749 Disabling the optimization fixed the issue for us and the query results started to behave: {code} set global optimizer_switch='split_materialized=off' {code} I'm attaching minimal dump, that can be used to reproduce the issue. The query to reproduce the issue is: {code:sql} SELECT COUNT(*) FROM project_time_lines_dates ptld INNER JOIN date_revision dr ON dr.id = ptld.date_revision_id INNER JOIN ( SELECT ptld.project_id AS project_id, ptld.project_type_id AS project_type_id, ptld.leaf_component_id AS leaf_component_id, dr.date_id AS date_id, MAX(dr.order_idx) AS max_order FROM project_time_lines_dates ptld INNER JOIN date_revision dr ON dr.id = ptld.date_revision_id WHERE dr.date_id IN ( SELECT DISTINCT dr.date_id FROM time_sheet_activities_time_sheet_roles_date_revisions tstrdr INNER JOIN date_revision dr ON dr.id = tstrdr.start_date_revision_id WHERE tstrdr.offered_for_overdue ) GROUP BY ptld.project_id, ptld.project_type_id, ptld.leaf_component_id, dr.date_id ) max_anticipated ON max_anticipated.project_id = ptld.project_id AND max_anticipated.project_type_id = ptld.project_type_id AND max_anticipated.date_id = dr.date_id AND max_anticipated.max_order = dr.order_idx WHERE dr.is_anticipated = TRUE AND ptld.project_id = 5896 {code} The anticipated result is *2*, but when LATERAL DERIVED is used, the number of returned rows is *0*. {{EXPLAIN}} of the query with disabled LATERAL DERIVED looks like this: {code} +------+-------------+------------+--------+---------------------------------------------------------------------------------------+------------------------+---------+---------------------------------------------+------+----------------------------------------------+ | id | select_type | table | type | possible_keys | key | key_len | ref | rows | Extra | +------+-------------+------------+--------+---------------------------------------------------------------------------------------+------------------------+---------+---------------------------------------------+------+----------------------------------------------+ | 1 | PRIMARY | ptld | ref | project_id_project_type_id_leaf_component_id_date_revision_id,project_id,project_id_2 | project_id_2 | 4 | const | 34 | Using index | | 1 | PRIMARY | dr | eq_ref | PRIMARY,date_id | PRIMARY | 4 | bugreport.ptld.date_revision_id | 1 | Using where | | 1 | PRIMARY | <derived2> | ref | key1 | key1 | 9 | bugreport.dr.date_id,bugreport.dr.order_idx | 10 | Using where | | 2 | DERIVED | ptld | ref | project_id_project_type_id_leaf_component_id_date_revision_id,project_id,project_id_2 | project_id_2 | 4 | const | 34 | Using index; Using temporary; Using filesort | | 2 | DERIVED | dr | eq_ref | PRIMARY,date_id | PRIMARY | 4 | bugreport.ptld.date_revision_id | 1 | | | 2 | DERIVED | dr | ref | PRIMARY,date_id | date_id | 4 | bugreport.dr.date_id | 1 | Using index | | 2 | DERIVED | tstrdr | ref | start_date_revision_id | start_date_revision_id | 5 | bugreport.dr.id | 4 | Using where; FirstMatch(dr) | +------+-------------+------------+--------+---------------------------------------------------------------------------------------+------------------------+---------+---------------------------------------------+------+----------------------------------------------+ {code} {{EXPLAIN}} of the query with enabled LATERAL DERIVED looks like this: {code} +------+-----------------+------------+--------+---------------------------------------------------------------------------------------+------------------------+---------+---------------------------------+------+---------------------------------+ | id | select_type | table | type | possible_keys | key | key_len | ref | rows | Extra | +------+-----------------+------------+--------+---------------------------------------------------------------------------------------+------------------------+---------+---------------------------------+------+---------------------------------+ | 1 | PRIMARY | ptld | ref | project_id_project_type_id_leaf_component_id_date_revision_id,project_id,project_id_2 | project_id_2 | 4 | const | 34 | Using index | | 1 | PRIMARY | dr | eq_ref | PRIMARY,date_id | PRIMARY | 4 | bugreport.ptld.date_revision_id | 1 | Using where | | 1 | PRIMARY | <derived2> | ref | key0 | key0 | 4 | bugreport.ptld.project_type_id | 2 | Using where | | 2 | LATERAL DERIVED | dr | ref | PRIMARY,date_id | date_id | 4 | bugreport.dr.date_id | 1 | Using temporary; Using filesort | | 2 | LATERAL DERIVED | dr | ref | PRIMARY,date_id | date_id | 4 | bugreport.dr.date_id | 1 | Using index; Start temporary | | 2 | LATERAL DERIVED | tstrdr | ref | start_date_revision_id | start_date_revision_id | 5 | bugreport.dr.id | 4 | Using where; End temporary | | 2 | LATERAL DERIVED | ptld | ref | project_id_project_type_id_leaf_component_id_date_revision_id,project_id,project_id_2 | project_id_2 | 4 | const | 34 | Using index | +------+-----------------+------------+--------+---------------------------------------------------------------------------------------+------------------------+---------+---------------------------------+------+---------------------------------+ 7 rows in set (0.001 sec) {code} |
Workflow | MariaDB v3 [ 128012 ] | MariaDB v4 [ 143392 ] |
Attachment | mariadb10.4.22.tar.gz [ 61514 ] |
Affects Version/s | 10.3 [ 22126 ] | |
Affects Version/s | 10.4 [ 22408 ] | |
Affects Version/s | 10.5 [ 23123 ] | |
Affects Version/s | 10.6 [ 24028 ] |
Fix Version/s | 10.3 [ 22126 ] | |
Fix Version/s | 10.4 [ 22408 ] | |
Fix Version/s | 10.5 [ 23123 ] | |
Fix Version/s | 10.6 [ 24028 ] |
Assignee | Igor Babaev [ igor ] |
Status | Open [ 1 ] | Confirmed [ 10101 ] |
Affects Version/s | 10.7 [ 24805 ] |
Fix Version/s | 10.7 [ 24805 ] |
Status | Confirmed [ 10101 ] | In Progress [ 3 ] |
Attachment | raw_ab.tar.gz [ 61563 ] |
Description |
We've encountered inconsistent query results. By an trial/error we found out that the cause of the inconsistency is use of LATERAL DERIVED in the query plan. When it's used, the query doesn't return correct results.
We've found similar open issues already reported, but I can't say if it's the same cause or not: - https://jira.mariadb.org/browse/MDEV-21328 - https://jira.mariadb.org/browse/MDEV-26749 Disabling the optimization fixed the issue for us and the query results started to behave: {code} set global optimizer_switch='split_materialized=off' {code} I'm attaching minimal dump, that can be used to reproduce the issue. The query to reproduce the issue is: {code:sql} SELECT COUNT(*) FROM project_time_lines_dates ptld INNER JOIN date_revision dr ON dr.id = ptld.date_revision_id INNER JOIN ( SELECT ptld.project_id AS project_id, ptld.project_type_id AS project_type_id, ptld.leaf_component_id AS leaf_component_id, dr.date_id AS date_id, MAX(dr.order_idx) AS max_order FROM project_time_lines_dates ptld INNER JOIN date_revision dr ON dr.id = ptld.date_revision_id WHERE dr.date_id IN ( SELECT DISTINCT dr.date_id FROM time_sheet_activities_time_sheet_roles_date_revisions tstrdr INNER JOIN date_revision dr ON dr.id = tstrdr.start_date_revision_id WHERE tstrdr.offered_for_overdue ) GROUP BY ptld.project_id, ptld.project_type_id, ptld.leaf_component_id, dr.date_id ) max_anticipated ON max_anticipated.project_id = ptld.project_id AND max_anticipated.project_type_id = ptld.project_type_id AND max_anticipated.date_id = dr.date_id AND max_anticipated.max_order = dr.order_idx WHERE dr.is_anticipated = TRUE AND ptld.project_id = 5896 {code} The anticipated result is *2*, but when LATERAL DERIVED is used, the number of returned rows is *0*. {{EXPLAIN}} of the query with disabled LATERAL DERIVED looks like this: {code} +------+-------------+------------+--------+---------------------------------------------------------------------------------------+------------------------+---------+---------------------------------------------+------+----------------------------------------------+ | id | select_type | table | type | possible_keys | key | key_len | ref | rows | Extra | +------+-------------+------------+--------+---------------------------------------------------------------------------------------+------------------------+---------+---------------------------------------------+------+----------------------------------------------+ | 1 | PRIMARY | ptld | ref | project_id_project_type_id_leaf_component_id_date_revision_id,project_id,project_id_2 | project_id_2 | 4 | const | 34 | Using index | | 1 | PRIMARY | dr | eq_ref | PRIMARY,date_id | PRIMARY | 4 | bugreport.ptld.date_revision_id | 1 | Using where | | 1 | PRIMARY | <derived2> | ref | key1 | key1 | 9 | bugreport.dr.date_id,bugreport.dr.order_idx | 10 | Using where | | 2 | DERIVED | ptld | ref | project_id_project_type_id_leaf_component_id_date_revision_id,project_id,project_id_2 | project_id_2 | 4 | const | 34 | Using index; Using temporary; Using filesort | | 2 | DERIVED | dr | eq_ref | PRIMARY,date_id | PRIMARY | 4 | bugreport.ptld.date_revision_id | 1 | | | 2 | DERIVED | dr | ref | PRIMARY,date_id | date_id | 4 | bugreport.dr.date_id | 1 | Using index | | 2 | DERIVED | tstrdr | ref | start_date_revision_id | start_date_revision_id | 5 | bugreport.dr.id | 4 | Using where; FirstMatch(dr) | +------+-------------+------------+--------+---------------------------------------------------------------------------------------+------------------------+---------+---------------------------------------------+------+----------------------------------------------+ {code} {{EXPLAIN}} of the query with enabled LATERAL DERIVED looks like this: {code} +------+-----------------+------------+--------+---------------------------------------------------------------------------------------+------------------------+---------+---------------------------------+------+---------------------------------+ | id | select_type | table | type | possible_keys | key | key_len | ref | rows | Extra | +------+-----------------+------------+--------+---------------------------------------------------------------------------------------+------------------------+---------+---------------------------------+------+---------------------------------+ | 1 | PRIMARY | ptld | ref | project_id_project_type_id_leaf_component_id_date_revision_id,project_id,project_id_2 | project_id_2 | 4 | const | 34 | Using index | | 1 | PRIMARY | dr | eq_ref | PRIMARY,date_id | PRIMARY | 4 | bugreport.ptld.date_revision_id | 1 | Using where | | 1 | PRIMARY | <derived2> | ref | key0 | key0 | 4 | bugreport.ptld.project_type_id | 2 | Using where | | 2 | LATERAL DERIVED | dr | ref | PRIMARY,date_id | date_id | 4 | bugreport.dr.date_id | 1 | Using temporary; Using filesort | | 2 | LATERAL DERIVED | dr | ref | PRIMARY,date_id | date_id | 4 | bugreport.dr.date_id | 1 | Using index; Start temporary | | 2 | LATERAL DERIVED | tstrdr | ref | start_date_revision_id | start_date_revision_id | 5 | bugreport.dr.id | 4 | Using where; End temporary | | 2 | LATERAL DERIVED | ptld | ref | project_id_project_type_id_leaf_component_id_date_revision_id,project_id,project_id_2 | project_id_2 | 4 | const | 34 | Using index | +------+-----------------+------------+--------+---------------------------------------------------------------------------------------+------------------------+---------+---------------------------------+------+---------------------------------+ 7 rows in set (0.001 sec) {code} |
We've encountered inconsistent query results. By an trial/error we found out that the cause of the inconsistency is use of LATERAL DERIVED in the query plan. When it's used, the query doesn't return correct results.
We've found similar open issues already reported, but I can't say if it's the same cause or not: - - Disabling the optimization fixed the issue for us and the query results started to behave: {code} set global optimizer_switch='split_materialized=off' {code} I'm attaching minimal dump, that can be used to reproduce the issue. The query to reproduce the issue is: {code:sql} SELECT COUNT(*) FROM project_time_lines_dates ptld INNER JOIN date_revision dr ON dr.id = ptld.date_revision_id INNER JOIN ( SELECT ptld.project_id AS project_id, ptld.project_type_id AS project_type_id, ptld.leaf_component_id AS leaf_component_id, dr.date_id AS date_id, MAX(dr.order_idx) AS max_order FROM project_time_lines_dates ptld INNER JOIN date_revision dr ON dr.id = ptld.date_revision_id WHERE dr.date_id IN ( SELECT DISTINCT dr.date_id FROM time_sheet_activities_time_sheet_roles_date_revisions tstrdr INNER JOIN date_revision dr ON dr.id = tstrdr.start_date_revision_id WHERE tstrdr.offered_for_overdue ) GROUP BY ptld.project_id, ptld.project_type_id, ptld.leaf_component_id, dr.date_id ) max_anticipated ON max_anticipated.project_id = ptld.project_id AND max_anticipated.project_type_id = ptld.project_type_id AND max_anticipated.date_id = dr.date_id AND max_anticipated.max_order = dr.order_idx WHERE dr.is_anticipated = TRUE AND ptld.project_id = 5896 {code} The anticipated result is *2*, but when LATERAL DERIVED is used, the number of returned rows is *0*. {{EXPLAIN}} of the query with disabled LATERAL DERIVED looks like this: {code} +------+-------------+------------+--------+---------------------------------------------------------------------------------------+------------------------+---------+---------------------------------------------+------+----------------------------------------------+ | id | select_type | table | type | possible_keys | key | key_len | ref | rows | Extra | +------+-------------+------------+--------+---------------------------------------------------------------------------------------+------------------------+---------+---------------------------------------------+------+----------------------------------------------+ | 1 | PRIMARY | ptld | ref | project_id_project_type_id_leaf_component_id_date_revision_id,project_id,project_id_2 | project_id_2 | 4 | const | 34 | Using index | | 1 | PRIMARY | dr | eq_ref | PRIMARY,date_id | PRIMARY | 4 | bugreport.ptld.date_revision_id | 1 | Using where | | 1 | PRIMARY | <derived2> | ref | key1 | key1 | 9 | bugreport.dr.date_id,bugreport.dr.order_idx | 10 | Using where | | 2 | DERIVED | ptld | ref | project_id_project_type_id_leaf_component_id_date_revision_id,project_id,project_id_2 | project_id_2 | 4 | const | 34 | Using index; Using temporary; Using filesort | | 2 | DERIVED | dr | eq_ref | PRIMARY,date_id | PRIMARY | 4 | bugreport.ptld.date_revision_id | 1 | | | 2 | DERIVED | dr | ref | PRIMARY,date_id | date_id | 4 | bugreport.dr.date_id | 1 | Using index | | 2 | DERIVED | tstrdr | ref | start_date_revision_id | start_date_revision_id | 5 | bugreport.dr.id | 4 | Using where; FirstMatch(dr) | +------+-------------+------------+--------+---------------------------------------------------------------------------------------+------------------------+---------+---------------------------------------------+------+----------------------------------------------+ {code} {{EXPLAIN}} of the query with enabled LATERAL DERIVED looks like this: {code} +------+-----------------+------------+--------+---------------------------------------------------------------------------------------+------------------------+---------+---------------------------------+------+---------------------------------+ | id | select_type | table | type | possible_keys | key | key_len | ref | rows | Extra | +------+-----------------+------------+--------+---------------------------------------------------------------------------------------+------------------------+---------+---------------------------------+------+---------------------------------+ | 1 | PRIMARY | ptld | ref | project_id_project_type_id_leaf_component_id_date_revision_id,project_id,project_id_2 | project_id_2 | 4 | const | 34 | Using index | | 1 | PRIMARY | dr | eq_ref | PRIMARY,date_id | PRIMARY | 4 | bugreport.ptld.date_revision_id | 1 | Using where | | 1 | PRIMARY | <derived2> | ref | key0 | key0 | 4 | bugreport.ptld.project_type_id | 2 | Using where | | 2 | LATERAL DERIVED | dr | ref | PRIMARY,date_id | date_id | 4 | bugreport.dr.date_id | 1 | Using temporary; Using filesort | | 2 | LATERAL DERIVED | dr | ref | PRIMARY,date_id | date_id | 4 | bugreport.dr.date_id | 1 | Using index; Start temporary | | 2 | LATERAL DERIVED | tstrdr | ref | start_date_revision_id | start_date_revision_id | 5 | bugreport.dr.id | 4 | Using where; End temporary | | 2 | LATERAL DERIVED | ptld | ref | project_id_project_type_id_leaf_component_id_date_revision_id,project_id,project_id_2 | project_id_2 | 4 | const | 34 | Using index | +------+-----------------+------------+--------+---------------------------------------------------------------------------------------+------------------------+---------+---------------------------------+------+---------------------------------+ 7 rows in set (0.001 sec) {code} |
Link |
This issue relates to |
Link |
This issue relates to |
Summary | Query using LATERAL DERIVED not returning correct results | Wrong result from query when using split optimization |
Assignee | Igor Babaev [ igor ] | Oleksandr Byelkin [ sanja ] |
Status | In Progress [ 3 ] | In Review [ 10002 ] |
Link |
This issue relates to |
Assignee | Oleksandr Byelkin [ sanja ] | Igor Babaev [ igor ] |
Status | In Review [ 10002 ] | Stalled [ 10000 ] |
Component/s | Optimizer [ 10200 ] | |
Fix Version/s | 10.3.33 [ 26805 ] | |
Fix Version/s | 10.4.23 [ 26807 ] | |
Fix Version/s | 10.5.14 [ 26809 ] | |
Fix Version/s | 10.6.6 [ 26811 ] | |
Fix Version/s | 10.7.2 [ 26813 ] | |
Fix Version/s | 10.3 [ 22126 ] | |
Fix Version/s | 10.4 [ 22408 ] | |
Fix Version/s | 10.5 [ 23123 ] | |
Fix Version/s | 10.6 [ 24028 ] | |
Fix Version/s | 10.7 [ 24805 ] | |
Resolution | Fixed [ 1 ] | |
Status | Stalled [ 10000 ] | Closed [ 6 ] |
Link |
This issue is duplicated by |
Link |
This issue is duplicated by |
I have a similar issue. The query result in my case is incorrect only if I use the SELECT ... FROM (<subquery>) as a, (<subquery>) as b WHERE a.foo = b.foo and ... In that case the optimizer uses a LATERAL_DERIVED table and the result is incorrect. If I use the "(<subquery.) as a LEFT JOIN (<subquery>) as b ON ..." syntax, it works fine.
The problem happens only if my table has a specific size (yep, if I delete a single character from a random column the problem goes away). It also happens only if the table has a specific set of indexes with a very specific set of columns.
My query is quite simple:
SELECT a.date, a.c, b.foo, a.foo
FROM
(SELECT date,c,SUM(foo) as foo
FROM raw_b
WHERE date >= 20211231
GROUP BY date,c
) as b,
(SELECT date, c, SUM(foo) as foo
FROM raw_a
WHERE date >= 20211231
GROUP BY date,c
) as a where a.date = b.date AND a.c = b.c
Schema:
CREATE TABLE `raw_a` (
`date` varchar(100) COLLATE latin1_general_cs DEFAULT NULL,
`v` varchar(100) COLLATE latin1_general_cs DEFAULT NULL,
`c` char(5) COLLATE latin1_general_cs DEFAULT NULL,
`foo` bigint(20) DEFAULT NULL
) ENGINE=InnoDB DEFAULT CHARSET=latin1 COLLATE=latin1_general_cs ROW_FORMAT=DYNAMIC
CREATE TABLE `raw_b` (
`date` int(11) NOT NULL,
`v` varchar(10) COLLATE latin1_general_cs NOT NULL,
`t` varchar(512) COLLATE latin1_general_cs NOT NULL,
`c` varchar(50) COLLATE latin1_general_cs NOT NULL,
`o` varchar(10) COLLATE latin1_general_cs NOT NULL,
`sh` varchar(10) COLLATE latin1_general_cs NOT NULL,
`p` char(5) COLLATE latin1_general_cs NOT NULL,
`s` varchar(20) COLLATE latin1_general_cs NOT NULL,
`foo` bigint(20) DEFAULT NULL,
PRIMARY KEY (`date`,`v`,`t`,`c`,`o`,`sh`,`p`,`s`),
KEY `date` (`date`,`c`,`v`)
) ENGINE=InnoDB DEFAULT CHARSET=latin1 COLLATE=latin1_general_cs ROW_FORMAT=DYNAMIC
I can provide a small dataset to reproduce the issue if needed.
Server version: 5.5.5-10.6.5-MariaDB MariaDB Server
Linux <host> 5.10.0-9-amd64 #1 SMP Debian 5.10.70-1 (2021-09-30) x86_64 GNU/Linux