170505 16:17:38 [ERROR] mysqld got signal 11 ;
This could be because you hit a bug. It is also possible that this binary
or one of the libraries it was linked against is corrupt, improperly built,
or misconfigured. This error can also be caused by malfunctioning hardware.
We will try our best to scrape up some info that will hopefully help
diagnose the problem, but since we have already crashed,
something is definitely wrong and this may fail.
Server version: 10.1.18-MariaDB-1~jessie
key_buffer_size=8589934592
read_buffer_size=2097152
max_used_connections=1419
max_threads=5002
thread_count=1023
It is possible that mysqld could use up to
key_buffer_size + (read_buffer_size + sort_buffer_size)*max_threads = 28979515 K bytes of memory
Hope that's ok; if not, decrease some variables in the equation.
Thread pointer: 0x0x7f636b4ec008
Attempting backtrace. You can use the following information to find out
where mysqld died. If you see no messages after this, something went
terribly wrong...
stack_bottom = 0x7f6299eef1f8 thread_stack 0x48400
*** buffer overflow detected ***: /usr/sbin/mysqld terminated
{noformat}
170505 16:17:38 [ERROR] mysqld got signal 11 ;
This could be because you hit a bug. It is also possible that this binary
or one of the libraries it was linked against is corrupt, improperly built,
or misconfigured. This error can also be caused by malfunctioning hardware.
We will try our best to scrape up some info that will hopefully help
diagnose the problem, but since we have already crashed,
something is definitely wrong and this may fail.
Server version: 10.1.18-MariaDB-1~jessie
key_buffer_size=8589934592
read_buffer_size=2097152
max_used_connections=1419
max_threads=5002
thread_count=1023
It is possible that mysqld could use up to
key_buffer_size + (read_buffer_size + sort_buffer_size)*max_threads = 28979515 K bytes of memory
Hope that's ok; if not, decrease some variables in the equation.
Thread pointer: 0x0x7f636b4ec008
Attempting backtrace. You can use the following information to find out
where mysqld died. If you see no messages after this, something went
terribly wrong...
stack_bottom = 0x7f6299eef1f8 thread_stack 0x48400
*** buffer overflow detected ***: /usr/sbin/mysqld terminated
{noformat}
Judging by the error log, you have high concurrency on the server, but still – do you have any guess at all which query was executed at the time of the crash? Apparently it involves a function and geometry.
Does it keep happening, or was it a one-time event?
Elena Stepanova
added a comment - Judging by the error log, you have high concurrency on the server, but still – do you have any guess at all which query was executed at the time of the crash? Apparently it involves a function and geometry.
Does it keep happening, or was it a one-time event?
yes we use geometry functions but they are so many running we can't tell which was the one that make maria crash.
today crash occured for the second time
message was: (see attached log) mariadb.err
dgit
added a comment - yes we use geometry functions but they are so many running we can't tell which was the one that make maria crash.
today crash occured for the second time
message was: (see attached log)
mariadb.err
Normally the server is supposed to print the problematic query in the error log, but with buffer overflow crash reports are not reliable. Still, please check, maybe the other occurrence was "luckier" this way.
In a different situation I would have suggested enabling the general log for a while (until the next crash). However, in your case, when there are over a thousand of open connections at a time, and we don't even get the crashing thread ID in the error log, the general log will be useless. My hope was that your server only runs a few types of queries most of the time (which happens often enough in busy environments), and you'd be able to recognize it by general attributes – function, geometry, collection, polygon, etc. I understand that it's impossible if you have many different queries.
Elena Stepanova
added a comment - Normally the server is supposed to print the problematic query in the error log, but with buffer overflow crash reports are not reliable. Still, please check, maybe the other occurrence was "luckier" this way.
In a different situation I would have suggested enabling the general log for a while (until the next crash). However, in your case, when there are over a thousand of open connections at a time, and we don't even get the crashing thread ID in the error log, the general log will be useless. My hope was that your server only runs a few types of queries most of the time (which happens often enough in busy environments), and you'd be able to recognize it by general attributes – function, geometry, collection, polygon, etc. I understand that it's impossible if you have many different queries.
No, We did not have any luck. We suspect it might be the GeomFromText function on hi concurrent insert, in table with geometry field passing as argument POINT(lat ,lng). But not sure.
dgit
added a comment - No, We did not have any luck. We suspect it might be the GeomFromText function on hi concurrent insert, in table with geometry field passing as argument POINT(lat ,lng). But not sure.
Since we don't know enough about the initial problem, let's at least fix the test case above in scope of this bug report, maybe by luck it will also fix the reported failure.
Though the initial problem is not related to this. I'd really like to be able to reproduce it somehow.
Is it possible to see the general log or install the audit plugin?
Alexey Botchkov
added a comment - Issue with the ST_RELATE found by Elena is fixed now
http://lists.askmonty.org/pipermail/commits/2017-October/011536.html
Though the initial problem is not related to this. I'd really like to be able to reproduce it somehow.
Is it possible to see the general log or install the audit plugin?
dgit, i'm interested to take a look at the patch.
Can you please send it?
Or just explain a bit how it validates the point?
Alexey Botchkov
added a comment - dgit, i'm interested to take a look at the patch.
Can you please send it?
Or just explain a bit how it validates the point?
From the attached log:
*** buffer overflow detected ***: /usr/sbin/mysqld terminated
======= Backtrace: =========
/lib/x86_64-linux-gnu/libc.so.6(+0x731ff)[0x7f65cbbef1ff]
/lib/x86_64-linux-gnu/libc.so.6(__fortify_fail+0x37)[0x7f65cbc724c7]
/lib/x86_64-linux-gnu/libc.so.6(+0xf46e0)[0x7f65cbc706e0]
/lib/x86_64-linux-gnu/libc.so.6(+0xf6437)[0x7f65cbc72437]
/usr/sbin/mysqld(my_addr_resolve+0x48)[0x7f65ce96acd8]
/usr/sbin/mysqld(my_print_stacktrace+0x1d2)[0x7f65ce957462]
/usr/sbin/mysqld(handle_fatal_signal+0x2d5)[0x7f65ce493785]
/lib/x86_64-linux-gnu/libpthread.so.0(+0xf8d0)[0x7f65cdabd8d0]
/usr/sbin/mysqld(_ZN10Gcalc_heap14new_point_infoEddj+0x1b)[0x7f65ce5d347b]
/usr/sbin/mysqld(_ZN23Gcalc_shape_transporter13int_add_pointEjdd+0x1f)[0x7f65ce5d372f]
/usr/sbin/mysqld(_ZNK11Gis_polygon12store_shapesEP23Gcalc_shape_transporter+0x240)[0x7f65ce59d0e0]
/usr/sbin/mysqld(_ZNK23Gis_geometry_collection12store_shapesEP23Gcalc_shape_transporter+0xaa)[0x7f65ce59f14a]
/usr/sbin/mysqld(_ZN29Item_func_spatial_precise_rel7val_intEv+0x5ec)[0x7f65ce4ffc8c]
/usr/sbin/mysqld(_ZN4Item13save_in_fieldEP5Fieldb+0x15c)[0x7f65ce4a2c7c]
/usr/sbin/mysqld(_Z12sp_eval_exprP3THDP5FieldPP4Item+0x72)[0x7f65ce593292]
/usr/sbin/mysqld(_ZN16sp_instr_freturn9exec_coreEP3THDPj+0x49)[0x7f65ce591009]
/usr/sbin/mysqld(_ZN13sp_lex_keeper23reset_lex_and_exec_coreEP3THDPjbP8sp_instr+0x2fa)[0x7f65ce59840a]
/usr/sbin/mysqld(_ZN7sp_head7executeEP3THDb+0x76e)[0x7f65ce5945ee]
/usr/sbin/mysqld(_ZN7sp_head16execute_functionEP3THDPP4ItemjP5Field+0x54c)[0x7f65ce5951cc]
/usr/sbin/mysqld(_ZN12Item_func_sp12execute_implEP3THD+0x109)[0x7f65ce4f7959]
/usr/sbin/mysqld(_ZN12Item_func_sp7executeEv+0x2c)[0x7f65ce4f7a3c]
/usr/sbin/mysqld(_ZN12Item_func_sp7val_intEv+0x11)[0x7f65ce4fbba1]
/usr/sbin/mysqld(_ZN22Item_func_set_user_var5checkEb+0xbb)[0x7f65ce4f590b]
/usr/sbin/mysqld(_ZN12set_var_user5checkEP3THD+0x2c)[0x7f65ce28824c]
/usr/sbin/mysqld(_Z17sql_set_variablesP3THDP4ListI12set_var_baseEb+0x39)[0x7f65ce28a149]
/usr/sbin/mysqld(_Z21mysql_execute_commandP3THD+0x65d4)[0x7f65ce30db14]
/usr/sbin/mysqld(_Z11mysql_parseP3THDPcjP12Parser_state+0x285)[0x7f65ce3108b5]
/usr/sbin/mysqld(_Z16dispatch_command19enum_server_commandP3THDPcj+0x1c89)[0x7f65ce3133e9]
/usr/sbin/mysqld(_Z10do_commandP3THD+0x16e)[0x7f65ce313b2e]
/usr/sbin/mysqld(_Z24do_handle_one_connectionP3THD+0x182)[0x7f65ce3de022]
/usr/sbin/mysqld(handle_one_connection+0x40)[0x7f65ce3de1e0]
/lib/x86_64-linux-gnu/libpthread.so.0(+0x80a4)[0x7f65cdab60a4]
/lib/x86_64-linux-gnu/libc.so.6(clone+0x6d)[0x7f65cbc6204d]
======= Memory map: ========
7f612f3fd000-7f612f3fe000 ---p 00000000 00:00 0
...