#15 0x00007fb8fc2fbbbf in handle_select (thd=0x7fb8f2ab2070, lex=0x7fb8f0c7bb68, result=0x7fb8f0c8b1a0, setup_tables_done_option=0) at 10.1/sql/sql_select.cc:371
#16 0x00007fb8fc2bc071 in execute_sqlcom_select (thd=0x7fb8f2ab2070, all_tables=0x7fb8f0c82188) at 10.1/sql/sql_parse.cc:5805
#17 0x00007fb8fc2b24aa in mysql_execute_command (thd=0x7fb8f2ab2070) at 10.1/sql/sql_parse.cc:2937
#18 0x00007fb8fc61368c in sp_instr_stmt::exec_core (this=0x7fb8f0c82768, thd=0x7fb8f2ab2070, nextp=0x7fb8fbd19124) at 10.1/sql/sp_head.cc:3135
#19 0x00007fb8fc612dcb in sp_lex_keeper::reset_lex_and_exec_core (this=0x7fb8f0c827a8, thd=0x7fb8f2ab2070, nextp=0x7fb8fbd19124, open_tables=false, instr=0x7fb8f0c82768) at 10.1/sql/sp_head.cc:2901
#20 0x00007fb8fc613394 in sp_instr_stmt::execute (this=0x7fb8f0c82768, thd=0x7fb8f2ab2070, nextp=0x7fb8fbd19124) at 10.1/sql/sp_head.cc:3051
#21 0x00007fb8fc60efca in sp_head::execute (this=0x7fb8f0c7b088, thd=0x7fb8f2ab2070, merge_da_on_success=true) at 10.1/sql/sp_head.cc:1316
#22 0x00007fb8fc610e62 in sp_head::execute_procedure (this=0x7fb8f0c7b088, thd=0x7fb8f2ab2070, args=0x7fb8f2ab66b0) at 10.1/sql/sp_head.cc:2103
#23 0x00007fb8fc2b0c9d in do_execute_sp (thd=0x7fb8f2ab2070, sp=0x7fb8f0c7b088) at 10.1/sql/sql_parse.cc:2383
#24 0x00007fb8fc2b9d0e in mysql_execute_command (thd=0x7fb8f2ab2070) at 10.1/sql/sql_parse.cc:5180
#25 0x00007fb8fc2bf5de in mysql_parse (thd=0x7fb8f2ab2070, rawbuf=0x7fb8f0c22088 "CALL proc()", length=11, parser_state=0x7fb8fbd1a180) at 10.1/sql/sql_parse.cc:7181
#26 0x00007fb8fc2ae772 in dispatch_command (command=COM_QUERY, thd=0x7fb8f2ab2070, packet=0x7fb8f2ab8071 "CALL proc()", packet_length=11) at 10.1/sql/sql_parse.cc:1470
#27 0x00007fb8fc2ad49d in do_command (thd=0x7fb8f2ab2070) at 10.1/sql/sql_parse.cc:1093
#28 0x00007fb8fc3ee1f5 in do_handle_one_connection (thd_arg=0x7fb8f2ab2070) at 10.1/sql/sql_connect.cc:1350
#29 0x00007fb8fc3edf59 in handle_one_connection (arg=0x7fb8f2ab2070) at 10.1/sql/sql_connect.cc:1262
#30 0x00007fb8fcaa4f7c in pfs_spawn_thread (arg=0x7fb8f9023ff0) at 10.1/storage/perfschema/pfs.cc:1860
#31 0x00007fb8fa6bee9a in start_thread () from /lib/x86_64-linux-gnu/libpthread.so.0
#32 0x00007fb8f9debcbd in clone () from /lib/x86_64-linux-gnu/libc.so.6
join->join_tab[0] points to a heap table with alias="v1".
but
join->explain=NULL.
Sergei Petrunia
added a comment - Ok I was able to repeat.
It crashes when
"CALL proc()" executes
SELECT * FROM v2
which calls handle_derived() for v2
We reach JOIN::exec_inner() which has
join->optimized=true
join->have_query_plan == QEP_AVAILABLE
join->join_tab[0] points to a heap table with alias="v1".
but
join->explain=NULL.
There is no crash when the queries are run serially. derived subqueries have degenerate joins but the code handles that situation.
Sergei Petrunia
added a comment - There is no crash when the queries are run serially. derived subqueries have degenerate joins but the code handles that situation.
It's interesting that Explain data structure actually exists:
(gdb) p thd->lex->explain->get_node(2)
$17 = (Explain_select *) 0x7fffe001dde0
(gdb) p $17->message
$18 = 0x55555636d4dc "No tables used"
but it didn't get saved in join->explain.
Sergei Petrunia
added a comment - It's interesting that Explain data structure actually exists:
(gdb) p thd->lex->explain->get_node(2)
$17 = (Explain_select *) 0x7fffe001dde0
(gdb) p $17->message
$18 = 0x55555636d4dc "No tables used"
but it didn't get saved in join->explain.
Added Explain_select::my_join to check how is it possible that Explain_select exists but join->explain==NULL.
(gdb) p thd->lex->explain->get_node(2)
$3 = (Explain_node *) 0x7fffe0030c50
(gdb) p $3->my_join
$4 = (void *) 0x7fffe002fdf8
(gdb) p this
$5 = (JOIN * const) 0x7fffe00304d0
So, there is a different JOIN object!
(gdb) p (class JOIN*) 0x7fffe002fdf8 ##this is $4
$6 = (JOIN *) 0x7fffe002fdf8
(gdb) p $6->select_lex
$7 = (SELECT_LEX *) 0x7fffe00329a8
(gdb) p this->select_lex
$8 = (SELECT_LEX *) 0x7fffe002a8c0
it has a different select_lex.
(gdb) p $6->select_lex->select_number
$9 = 2
(gdb) p this->select_lex->select_number
$10 = 2
both select_lex objects have id=2!
Sergei Petrunia
added a comment - Added Explain_select::my_join to check how is it possible that Explain_select exists but join->explain==NULL.
(gdb) p thd->lex->explain->get_node(2)
$3 = (Explain_node *) 0x7fffe0030c50
(gdb) p $3->my_join
$4 = (void *) 0x7fffe002fdf8
(gdb) p this
$5 = (JOIN * const) 0x7fffe00304d0
So, there is a different JOIN object!
(gdb) p (class JOIN*) 0x7fffe002fdf8 ##this is $4
$6 = (JOIN *) 0x7fffe002fdf8
(gdb) p $6->select_lex
$7 = (SELECT_LEX *) 0x7fffe00329a8
(gdb) p this->select_lex
$8 = (SELECT_LEX *) 0x7fffe002a8c0
it has a different select_lex.
(gdb) p $6->select_lex->select_number
$9 = 2
(gdb) p this->select_lex->select_number
$10 = 2
both select_lex objects have id=2!
1) I just lost moment when select number in the statement became ID. Different statement and the same number are possible
2) where the select belong to have to be important.
3) how we reached select of VIEW?
Oleksandr Byelkin
added a comment - 1) I just lost moment when select number in the statement became ID. Different statement and the same number are possible
2) where the select belong to have to be important.
3) how we reached select of VIEW?
As result with talking with Sergey Petrunia we've found that mysql_make_view renumbers SELECTs...
Ways:
1) make parser number views from mlast high number
2) find how it avoid renumbering
Oleksandr Byelkin
added a comment - As result with talking with Sergey Petrunia we've found that mysql_make_view renumbers SELECTs...
Ways:
1) make parser number views from mlast high number
2) find how it avoid renumbering
OK. Renumbering work as it should during parsing in mysql_make_view().
So problem probably in caching statement by SP and re-parsing views...
Oleksandr Byelkin
added a comment - OK. Renumbering work as it should during parsing in mysql_make_view().
So problem probably in caching statement by SP and re-parsing views...
Problem is that in case VIEW is already processed it do not advance select_number counter in THD. Also the counter is not stored properly in the statement. So we get two selects #2 in case if ALTER is in time to fix the view.
Oleksandr Byelkin
added a comment - Problem is that in case VIEW is already processed it do not advance select_number counter in THD. Also the counter is not stored properly in the statement. So we get two selects #2 in case if ALTER is in time to fix the view.
Sequential execution works because it makes full re-prepare of the procedure including views re-parsing, when in parallel execution one view is used as it was stored and other is parsed (and counter set wrongly).
Oleksandr Byelkin
added a comment - Sequential execution works because it makes full re-prepare of the procedure including views re-parsing, when in parallel execution one view is used as it was stored and other is parsed (and counter set wrongly).
MDEV-8087: Server crashed in Time_and_counter_tracker::incr_loops
Problem:
Procedure which uses stack of views first executed without most deep view. It fails but one view cached.
Then simultaniusely create the view we lack and execute the procedure.
In the beginning of procedure execution views are not yet changes so procedure used as it was cached.
But by the time we are trying to use most deep view it is already created.
The problem with the view is that thd->select_number (first view was not parsed) so second view will get the same number.
The fix is in keeping the thd->select_number correct even if we use cached views.
In the proposed solution (to keep it simple) counter can be bigger then should but it should not create problem because numbers are still unique and situation is very rare.
—
Oleksandr Byelkin
added a comment - revision-id: a3c33026bc1f3a8efe3a165661786e7bc08bea34 (mariadb-10.1.6-108-ga3c3302)
parent(s): 20291639994beaa070b7228eafa7be31eb1d7ff8
committer: Oleksandr Byelkin
timestamp: 2015-09-22 21:59:18 +0200
message:
MDEV-8087 : Server crashed in Time_and_counter_tracker::incr_loops
Problem:
Procedure which uses stack of views first executed without most deep view. It fails but one view cached.
Then simultaniusely create the view we lack and execute the procedure.
In the beginning of procedure execution views are not yet changes so procedure used as it was cached.
But by the time we are trying to use most deep view it is already created.
The problem with the view is that thd->select_number (first view was not parsed) so second view will get the same number.
The fix is in keeping the thd->select_number correct even if we use cached views.
In the proposed solution (to keep it simple) counter can be bigger then should but it should not create problem because numbers are still unique and situation is very rare.
—
Also the question is should we allow the situation when view changes and the procedure which uses it is not invalidated.
Situation is possible if we had problem with the view before and only when creating/altering view made in the same time with the procedure call but still can possibly bring some other problems.
Oleksandr Byelkin
added a comment - Also the question is should we allow the situation when view changes and the procedure which uses it is not invalidated.
Situation is possible if we had problem with the view before and only when creating/altering view made in the same time with the procedure call but still can possibly bring some other problems.
MDEV-8087: Server crashed in Time_and_counter_tracker::incr_loops
Problem:
Procedure which uses stack of views first executed without most deep view.
It fails but one view cached (as well as whole procedure).
Then simultaniusely create the second view we lack and execute the procedure.
In the beginning of procedure execution the view is not yet created so
procedure used as it was cached (cache was not invalidated).
But by the time we are trying to use most deep view it is already created.
The problem with the view is that thd->select_number (first view was not parsed) so second view will get the same number.
The fix is in keeping the thd->select_number correct even if we use cached views.
In the proposed solution (to keep it simple) counter can be bigger then should but it should not create problem because numbers are still unique and situation is very rare.
—
Oleksandr Byelkin
added a comment - fix with test suite:
revision-id: ccb6b0dd1fb603673dbcfe7d1763bd3fe5d3bcaf (mariadb-10.1.6-108-gccb6b0d)
parent(s): 20291639994beaa070b7228eafa7be31eb1d7ff8
committer: Oleksandr Byelkin
timestamp: 2015-10-06 19:54:56 +0200
message:
MDEV-8087 : Server crashed in Time_and_counter_tracker::incr_loops
Problem:
Procedure which uses stack of views first executed without most deep view.
It fails but one view cached (as well as whole procedure).
Then simultaniusely create the second view we lack and execute the procedure.
In the beginning of procedure execution the view is not yet created so
procedure used as it was cached (cache was not invalidated).
But by the time we are trying to use most deep view it is already created.
The problem with the view is that thd->select_number (first view was not parsed) so second view will get the same number.
The fix is in keeping the thd->select_number correct even if we use cached views.
In the proposed solution (to keep it simple) counter can be bigger then should but it should not create problem because numbers are still unique and situation is very rare.
—
Please re-assign if needed.