[MDEV-32386] Galera node crash with signal 11 Created: 2023-10-09  Updated: 2023-11-27  Resolved: 2023-11-27

Status: Closed
Project: MariaDB Server
Component/s: Galera
Affects Version/s: 10.6.15
Fix Version/s: N/A

Type: Bug Priority: Major
Reporter: Pramod Mahto Assignee: Pramod Mahto
Resolution: Incomplete Votes: 0
Labels: None


 Description   

Galera cluster node keep on crashing with signal 11 with INSERT and DELETE queries.

 
231002 20:11:47 [ERROR] mysqld got signal 11 ;
This could be because you hit a bug. It is also possible that this binary
or one of the libraries it was linked against is corrupt, improperly built,
or misconfigured. This error can also be caused by malfunctioning hardware.
 
To report this bug, see https://mariadb.com/kb/en/reporting-bugs
 
We will try our best to scrape up some info that will hopefully help
diagnose the problem, but since we have already crashed, 
something is definitely wrong and this may fail.
 
Server version: 10.6.15-MariaDB-log source revision: 0d16eb35bc981023ce2f4912e8ecde68ca381f4e
key_buffer_size=134217728
read_buffer_size=131072
max_used_connections=1
max_threads=1002
thread_count=10
It is possible that mysqld could use up to 
key_buffer_size + (read_buffer_size + sort_buffer_size)*max_threads = 2337668 K  bytes of memory
Hope that's ok; if not, decrease some variables in the equation.
 
Thread pointer: 0x7f3e1c000c58
Attempting backtrace. You can use the following information to find out
where mysqld died. If you see no messages after this, something went
terribly wrong...
stack_bottom = 0x7f40a0154d40 thread_stack 0x49000
mysys/stacktrace.c:216(my_print_stacktrace)[0x560f80847c3e]
sql/signal_handler.cc:241(handle_fatal_signal)[0x560f80212627]
/lib/x86_64-linux-gnu/libpthread.so.0(+0x13140)[0x7f4129ef4140]
sql/sql_class.cc:4875(thd_get_thread_id)[0x560f7ff9a341]
bits/vector.tcc:94(lock_wait_wsrep(trx_t*))[0x560f7fead455]
lock/lock0lock.cc:1854(lock_wait(que_thr_t*))[0x560f8057bdb2]
2023-10-02 20:11:47 0 [Warning] WSREP: Failed to report last committed 8980859d-7d3b-11ed-9f38-13a53b6b149d:402956260, -110 (Connection timed out)
row/row0mysql.cc:688(row_mysql_handle_errors(dberr_t*, trx_t*, que_thr_t*, trx_savept_t*))[0x560f805fa1ff]
row/row0mysql.cc:1330(row_insert_for_mysql(unsigned char const*, row_prebuilt_t*, ins_mode_t))[0x560f805fc8c6]
handler/ha_innodb.cc:7922(ha_innobase::write_row(unsigned char const*))[0x560f8053a36e]
sql/handler.cc:7639(handler::ha_write_row(unsigned char const*))[0x560f8022170f]
sql/log_event_server.cc:7374(Rows_log_event::write_row(rpl_group_info*, bool))[0x560f80335a65]
sql/log_event_server.cc:7628(Write_rows_log_event::do_exec_row(rpl_group_info*))[0x560f80335f5e]
sql/log_event_server.cc:5772(Rows_log_event::do_apply_event(rpl_group_info*))[0x560f8032b9ad]
sql/log_event.h:1500(Log_event::apply_event(rpl_group_info*))[0x560f80505c3d]
sql/wsrep_high_priority_service.cc:129(apply_events(THD*, Relay_log_info*, wsrep::const_buffer const&, wsrep::mutable_buffer&))[0x560f804ec1a0]
sql/wsrep_high_priority_service.cc:597(Wsrep_applier_service::apply_write_set(wsrep::ws_meta const&, wsrep::const_buffer const&, wsrep::mutable_buffer&))[0x560f804ec293]
src/server_state.cpp:333(apply_write_set(wsrep::server_state&, wsrep::high_priority_service&, wsrep::ws_handle const&, wsrep::ws_meta const&, wsrep::const_buffer const&))[0x560f8099f51f]
src/server_state.cpp:1131(wsrep::server_state::on_apply(wsrep::high_priority_service&, wsrep::ws_handle const&, wsrep::ws_meta const&, wsrep::const_buffer const&))[0x560f809a0245]
src/wsrep_provider_v26.cpp:507((anonymous namespace)::apply_cb(void*, wsrep_ws_handle const*, unsigned int, wsrep_buf const*, wsrep_trx_meta const*, bool*))[0x560f809afca8]
src/trx_handle.cpp:396(galera::TrxHandleSlave::apply(void*, wsrep_cb_status (*)(void*, wsrep_ws_handle const*, unsigned int, wsrep_buf const*, wsrep_trx_meta const*, bool*), wsrep_trx_meta const&, bool&))[0x7f4128ff36d1]
src/replicator_smm.cpp:512(galera::ReplicatorSMM::apply_trx(void*, galera::TrxHandleSlave&))[0x7f412900191f]
src/replicator_smm.cpp:2151(galera::ReplicatorSMM::process_trx(void*, boost::shared_ptr<galera::TrxHandleSlave> const&))[0x7f4129005761]
src/trx_handle.hpp:623(galera::TrxHandleSlave::exit_loop() const)[0x7f4129031faf]
src/gcs_action_source.cpp:29(Release::~Release())[0x7f412903298b]
src/replicator_smm.cpp:398(galera::ReplicatorSMM::async_recv(void*))[0x7f4129006780]
src/wsrep_provider.cpp:266(galera_recv)[0x7f4128fe1bea]
src/wsrep_provider_v26.cpp:859(wsrep::wsrep_provider_v26::run_applier(wsrep::high_priority_service*))[0x560f809b03de]
sql/wsrep_thd.cc:59(wsrep_replication_process(THD*, void*))[0x560f805079e2]
sql/wsrep_mysqld.cc:3633(start_wsrep_THD(void*))[0x560f804f745d]
perfschema/pfs.cc:2204(pfs_spawn_thread)[0x560f80486c5c]
/lib/x86_64-linux-gnu/libpthread.so.0(+0x7ea7)[0x7f4129ee8ea7]
/lib/x86_64-linux-gnu/libc.so.6(clone+0x3f)[0x7f4129aefa2f]

Got fixed only after forced SST


Generated at Thu Feb 08 10:30:57 UTC 2024 using Jira 8.20.16#820016-sha1:9d11dbea5f4be3d4cc21f03a88dd11d8c8687422.