2018-04-03 17:36:11 140137779095808 [Note] WSREP: Read nil XID from storage engines, skipping position init 2018-04-03 17:36:11 140137779095808 [Note] WSREP: wsrep_load(): loading provider library '/usr/lib64/galera/libgalera_smm.so' 2018-04-03 17:36:11 140137779095808 [Note] WSREP: wsrep_load(): Galera 25.3.23(r3789) by Codership Oy loaded successfully. 2018-04-03 17:36:11 140137779095808 [Note] WSREP: CRC-32C: using hardware acceleration. 2018-04-03 17:36:11 140137779095808 [Warning] WSREP: Could not open state file for reading: '/mnt/data//grastate.dat' 2018-04-03 17:36:11 140137779095808 [Note] WSREP: Found saved state: 00000000-0000-0000-0000-000000000000:-1, safe_to_bootstrap: 1 2018-04-03 17:36:11 140137779095808 [Note] WSREP: Passing config to GCS: base_dir = /mnt/data/; base_host = 10.134.19.4; base_port = 4567; cert.log_conflicts = no; debug = no; evs.auto_evict = 0; evs.delay_margin = PT1S; evs.delayed_keep_period = PT30S; evs.inactive_check_period = PT0.5S; evs.inactive_timeout = PT15S; evs.join_retrans_period = PT1S; evs.max_install_timeouts = 3; evs.send_window = 4; evs.stats_report_period = PT1M; evs.suspect_timeout = PT5S; evs.user_send_window = 2; evs.view_forget_timeout = PT24H; gcache.dir = /mnt/data/; gcache.keep_pages_size = 0; gcache.mem_size = 0; gcache.name = /mnt/data//galera.cache; gcache.page_size = 128M; gcache.recover = no; gcache.size = 128M; gcomm.thread_prio = ; gcs.fc_debug = 0; gcs.fc_factor = 1.0; gcs.fc_limit = 16; gcs.fc_master_slave = no; gcs.max_packet_size = 64500; gcs.max_throttle = 0.25; gcs.recv_q_hard_limit = 9223372036854775807; gcs.recv_q_soft_limit = 0.25; gcs.sync_donor = no; gmcast.segment = 0; gmcast.version = 0; pc.announce_timeout = PT3S; pc.checksum = false; pc.ignore_quorum = f 2018-04-03 17:36:11 140137779095808 [Note] WSREP: GCache history reset: 00000000-0000-0000-0000-000000000000:0 -> 00000000-0000-0000-0000-000000000000:-1 2018-04-03 17:36:11 140137779095808 [Note] WSREP: Assign initial position for certification: -1, protocol version: -1 2018-04-03 17:36:11 140137779095808 [Note] WSREP: wsrep_sst_grab() 2018-04-03 17:36:11 140137779095808 [Note] WSREP: Start replication 2018-04-03 17:36:11 140137779095808 [Note] WSREP: Setting initial position to 00000000-0000-0000-0000-000000000000:-1 2018-04-03 17:36:11 140137779095808 [Note] WSREP: protonet asio version 0 2018-04-03 17:36:11 140137779095808 [Note] WSREP: Using CRC-32C for message checksums. 2018-04-03 17:36:11 140137779095808 [Note] WSREP: backend: asio 2018-04-03 17:36:11 140137779095808 [Note] WSREP: gcomm thread scheduling priority set to other:0 2018-04-03 17:36:11 140137779095808 [Warning] WSREP: access file(/mnt/data//gvwstate.dat) failed(No such file or directory) 2018-04-03 17:36:11 140137779095808 [Note] WSREP: restore pc from disk failed 2018-04-03 17:36:11 140137779095808 [Note] WSREP: GMCast version 0 2018-04-03 17:36:11 140137779095808 [Note] WSREP: (bc874889, 'tcp://0.0.0.0:4567') listening at tcp://0.0.0.0:4567 2018-04-03 17:36:11 140137779095808 [Note] WSREP: (bc874889, 'tcp://0.0.0.0:4567') multicast: , ttl: 1 2018-04-03 17:36:11 140137779095808 [Note] WSREP: EVS version 0 2018-04-03 17:36:11 140137779095808 [Note] WSREP: gcomm: connecting to group 'tgs_cluster_p', peer '10.134.18.4:,10.134.18.5:,10.134.19.4:' 2018-04-03 17:36:11 140137779095808 [Note] WSREP: (bc874889, 'tcp://0.0.0.0:4567') connection established to bc874889 tcp://10.134.19.4:4567 2018-04-03 17:36:11 140137779095808 [Warning] WSREP: (bc874889, 'tcp://0.0.0.0:4567') address 'tcp://10.134.19.4:4567' points to own listening address, blacklisting 2018-04-03 17:36:11 140137779095808 [Note] WSREP: (bc874889, 'tcp://0.0.0.0:4567') connection established to 23c3936e tcp://10.134.18.4:4567 2018-04-03 17:36:11 140137779095808 [Note] WSREP: (bc874889, 'tcp://0.0.0.0:4567') turning message relay requesting on, nonlive peers: 2018-04-03 17:36:11 140137779095808 [Note] WSREP: declaring 23c3936e at tcp://10.134.18.4:4567 stable 2018-04-03 17:36:12 140137779095808 [Note] WSREP: Node 23c3936e state prim 2018-04-03 17:36:12 140137779095808 [Note] WSREP: view(view_id(PRIM,23c3936e,114) memb { 23c3936e,0 bc874889,0 } joined { } left { } partitioned { }) 2018-04-03 17:36:12 140137779095808 [Note] WSREP: save pc into disk 2018-04-03 17:36:12 140137779095808 [Note] WSREP: discarding pending addr without UUID: tcp://10.134.18.5:4567 2018-04-03 17:36:12 140137779095808 [Note] WSREP: gcomm: connected 2018-04-03 17:36:12 140137779095808 [Note] WSREP: Changing maximum packet size to 64500, resulting msg size: 32636 2018-04-03 17:36:12 140137779095808 [Note] WSREP: Shifting CLOSED -> OPEN (TO: 0) 2018-04-03 17:36:12 140137779095808 [Note] WSREP: Opened channel 'tgs_cluster_p' 2018-04-03 17:36:12 140137476384512 [Note] WSREP: New COMPONENT: primary = yes, bootstrap = no, my_idx = 1, memb_num = 2 2018-04-03 17:36:12 140137476384512 [Note] WSREP: STATE EXCHANGE: Waiting for state UUID. 2018-04-03 17:36:12 140137779095808 [Note] WSREP: Waiting for SST to complete. 2018-04-03 17:36:12 140137476384512 [Note] WSREP: STATE EXCHANGE: sent state msg: bca6148c-3754-11e8-931e-a3b98de92ac2 2018-04-03 17:36:12 140137476384512 [Note] WSREP: STATE EXCHANGE: got state msg: bca6148c-3754-11e8-931e-a3b98de92ac2 from 0 (azabnl-id03) 2018-04-03 17:36:12 140137476384512 [Note] WSREP: STATE EXCHANGE: got state msg: bca6148c-3754-11e8-931e-a3b98de92ac2 from 1 (azabir-id01) 2018-04-03 17:36:12 140137476384512 [Note] WSREP: Quorum results: version = 4, component = PRIMARY, conf_id = 113, members = 1/2 (joined/total), act_id = 204294210, last_appl. = -1, protocols = 0/7/3 (gcs/repl/appl), group UUID = 53540047-107d-11e6-8b2a-9a31eea4d5df 2018-04-03 17:36:12 140137476384512 [Note] WSREP: Flow-control interval: [23, 23] 2018-04-03 17:36:12 140137476384512 [Note] WSREP: Trying to continue unpaused monitor 2018-04-03 17:36:12 140137476384512 [Note] WSREP: Shifting OPEN -> PRIMARY (TO: 204294210) 2018-04-03 17:36:12 140137778776832 [Note] WSREP: State transfer required: Group state: 53540047-107d-11e6-8b2a-9a31eea4d5df:204294210 Local state: 00000000-0000-0000-0000-000000000000:-1 2018-04-03 17:36:12 140137778776832 [Note] WSREP: New cluster view: global state: 53540047-107d-11e6-8b2a-9a31eea4d5df:204294210, view# 114: Primary, number of nodes: 2, my index: 1, protocol version 3 2018-04-03 17:36:12 140137778776832 [Warning] WSREP: Gap in state sequence. Need state transfer. 2018-04-03 17:36:12 140137447028480 [Note] WSREP: Running: 'wsrep_sst_rsync --role 'joiner' --address '10.134.19.4' --datadir '/mnt/data/' --parent '3671' '' ' 2018-04-03 17:36:12 140137778776832 [Note] WSREP: Prepared SST request: rsync|10.134.19.4:4444/rsync_sst 2018-04-03 17:36:12 140137778776832 [Note] WSREP: wsrep_notify_cmd is not defined, skipping notification. 2018-04-03 17:36:12 140137778776832 [Note] WSREP: REPL Protocols: 7 (3, 2) 2018-04-03 17:36:12 140137778776832 [Note] WSREP: Assign initial position for certification: 204294210, protocol version: 3 2018-04-03 17:36:12 140137534711552 [Note] WSREP: Service thread queue flushed. 2018-04-03 17:36:12 140137778776832 [Warning] WSREP: Failed to prepare for incremental state transfer: Local state UUID (00000000-0000-0000-0000-000000000000) does not match group state UUID (53540047-107d-11e6-8b2a-9a31eea4d5df): 1 (Operation not permitted) at galera/src/replicator_str.cpp:prepare_for_IST():482. IST will be unavailable. 2018-04-03 17:36:12 140137476384512 [Note] WSREP: Member 1.0 (azabir-id01) requested state transfer from '*any*'. Selected 0.0 (azabnl-id03)(SYNCED) as donor. 2018-04-03 17:36:12 140137476384512 [Note] WSREP: Shifting PRIMARY -> JOINER (TO: 204294212) 2018-04-03 17:36:12 140137778776832 [Note] WSREP: Requesting state transfer: success, donor: 0 2018-04-03 17:36:12 140137778776832 [Note] WSREP: GCache history reset: 00000000-0000-0000-0000-000000000000:0 -> 53540047-107d-11e6-8b2a-9a31eea4d5df:204294210 2018-04-03 17:36:14 140137484777216 [Note] WSREP: (bc874889, 'tcp://0.0.0.0:4567') connection to peer bc874889 with addr tcp://10.134.19.4:4567 timed out, no messages seen in PT3S 2018-04-03 17:36:14 140137484777216 [Note] WSREP: (bc874889, 'tcp://0.0.0.0:4567') turning message relay requesting off Terminated WSREP_SST: [INFO] Joiner cleanup. rsync PID: 3715 (20180403 17:37:41.510) WSREP_SST: [INFO] Joiner cleanup done. (20180403 17:37:42.016) 2018-04-03 17:37:42 140137447028480 [ERROR] WSREP: Process completed with error: wsrep_sst_rsync --role 'joiner' --address '10.134.19.4' --datadir '/mnt/data/' --parent '3671' '' : 3 (No such process) 2018-04-03 17:37:42 140137447028480 [ERROR] WSREP: Failed to read uuid:seqno and wsrep_gtid_domain_id from joiner script. 2018-04-03 17:37:42 140137779095808 [ERROR] WSREP: SST failed: 3 (No such process) 2018-04-03 17:37:42 140137779095808 [ERROR] Aborting 2018-04-03 17:37:42 140137476384512 [Warning] WSREP: 0.0 (azabnl-id03): State transfer to 1.0 (azabir-id01) failed: -255 (Unknown error 255) 2018-04-03 17:37:42 140137476384512 [ERROR] WSREP: gcs/src/gcs_group.cpp:gcs_group_handle_join_msg():737: Will never receive state. Need to abort. 180404 05:55:01 mysqld_safe Starting mysqld daemon with databases from /mnt/data 180404 05:55:01 mysqld_safe WSREP: Running position recovery with --disable-log-error --pid-file='/mnt/data/AZABIR-ID01.azure.cloud.corp.local-recover.pid' 180404 05:55:01 mysqld_safe WSREP: Failed to recover position: '2018-04-04 5:55:01 140152281286912 [Note] //sbin/mysqld (mysqld 10.1.31-MariaDB) starting as process 24758 ... 2018-04-04 5:55:01 140152281286912 [ERROR] Can't find messagefile '/share/mysql/errmsg.sys' 2018-04-04 5:55:01 140152281286912 [ERROR] Aborting' 180404 05:59:55 mysqld_safe Starting mysqld daemon with databases from /mnt/data 180404 05:59:55 mysqld_safe WSREP: Running position recovery with --disable-log-error --pid-file='/mnt/data/AZABIR-ID01.azure.cloud.corp.local-recover.pid' 180404 05:59:55 mysqld_safe WSREP: Failed to recover position: '2018-04-04 5:59:55 139933851752704 [Note] //sbin/mysqld (mysqld 10.1.31-MariaDB) starting as process 25119 ... 2018-04-04 5:59:55 139933851752704 [ERROR] Can't find messagefile '/share/mysql/errmsg.sys' 2018-04-04 5:59:55 139933851752704 [ERROR] Aborting' 180404 06:01:37 mysqld_safe Starting mysqld daemon with databases from /mnt/data 180404 06:01:37 mysqld_safe WSREP: Running position recovery with --disable-log-error --pid-file='/mnt/data/AZABIR-ID01.azure.cloud.corp.local-recover.pid' 180404 06:01:38 mysqld_safe WSREP: Failed to recover position: '2018-04-04 6:01:38 140400691869952 [Note] //sbin/mysqld (mysqld 10.1.31-MariaDB) starting as process 25422 ... 2018-04-04 6:01:38 140400691869952 [Warning] An old style --language or -lc-message-dir value with language specific part detected: /share/mysql/ 2018-04-04 6:01:38 140400691869952 [Warning] Use --lc-messages-dir without language specific part instead. 2018-04-04 6:01:38 140400691869952 [Note] Loaded 'file_key_management.so' with offset 0x7fb1861fb000 2018-04-04 6:01:38 140400691869952 [Note] InnoDB: Using mutexes to ref count buffer pool pages 2018-04-04 6:01:38 140400691869952 [Note] InnoDB: The InnoDB memory heap is disabled 2018-04-04 6:01:38 140400691869952 [Note] InnoDB: Mutexes and rw_locks use GCC atomic builtins 2018-04-04 6:01:38 140400691869952 [Note] InnoDB: GCC builtin __atomic_thread_fence() is used for memory barrier 2018-04-04 6:01:38 140400691869952 [Note] InnoDB: Compressed tables use zlib 1.2.7 2018-04-04 6:01:38 140400691869952 [Note] InnoDB: Using Linux native AIO 2018-04-04 6:01:38 140400691869952 [Note] InnoDB: Using SSE crc32 instructions 2018-04-04 6:01:38 140400691869952 [Note] InnoDB: Initializing buffer pool, size = 3.0G 2018-04-04 6:01:38 140400691869952 [Note] InnoDB: Completed initialization of buffer pool 2018-04-04 6:01:38 140400691869952 [Note] InnoDB: Highest supported file format is Barracuda. 2018-04-04 6:01:38 140400691869952 [Note] InnoDB: Starting crash recovery from checkpoint LSN=663773430322 2018-04-04 6:01:38 140400691869952 [Note] InnoDB: Restoring possible half-written data pages from the doublewrite buffer... 2018-04-04 6:01:38 140400691869952 [Note] InnoDB: Starting final batch to recover 272 pages from redo log 2018-04-04 6:01:38 140400691869952 [ERROR] InnoDB: Trying to access page number 254 in space 30854 space name DB_GRPUK_P/cache_form, which is outside the tablespace bounds. Byte offset 0, len 16384 i/o type 10. 2018-04-04 06:01:38 7fb1955d7900 InnoDB: Assertion failure in thread 140400691869952 in file ha_innodb.cc line 22015 InnoDB: We intentionally generate a memory trap. InnoDB: Submit a detailed bug report to https://jira.mariadb.org/ InnoDB: If you get repeated assertion failures or crashes, even InnoDB: immediately after the mysqld startup, there may be InnoDB: corruption in the InnoDB tablespace. Please refer to InnoDB: http://dev.mysql.com/doc/refman/5.6/en/forcing-innodb-recovery.html InnoDB: about forcing recovery. 180404 6:01:38 [ERROR] mysqld got signal 6 ; This could be because you hit a bug. It is also possible that this binary or one of the libraries it was linked against is corrupt, improperly built, or misconfigured. This error can also be caused by malfunctioning hardware. To report this bug, see https://mariadb.com/kb/en/reporting-bugs We will try our best to scrape up some info that will hopefully help diagnose the problem, but since we have already crashed, something is definitely wrong and this may fail. Server version: 10.1.31-MariaDB key_buffer_size=134217728 read_buffer_size=131072 max_used_connections=0 max_threads=2002 thread_count=0 It is possible that mysqld could use up to key_buffer_size + (read_buffer_size + sort_buffer_size)*max_threads = 4529069 K bytes of memory Hope that's ok; if not, decrease some variables in the equation. Thread pointer: 0x0 Attempting backtrace. You can use the following information to find out where mysqld died. If you see no messages after this, something went terribly wrong... stack_bottom = 0x0 thread_stack 0x48400 mysys/stacktrace.c:268(my_print_stacktrace)[0x55df5319a1ce] sql/signal_handler.cc:168(handle_fatal_signal)[0x55df52cbdfb5] sigaction.c:0(__restore_rt)[0x7fb1951ed5e0] :0(__GI_raise)[0x7fb1934c61f7] :0(__GI_abort)[0x7fb1934c78e8] handler/ha_innodb.cc:22015(ib_logf(ib_log_level_t, char const*, ...))[0x55df52f0b13f] fil/fil0fil.cc:5982(fil_io(unsigned long, bool, unsigned long, unsigned long, unsigned long, unsigned long, unsigned long, void*, void*, unsigned long*, trx_t*, bool))[0x55df530bb323] buf/buf0rea.cc:263(buf_read_page_low(dberr_t*, bool, unsigned long, unsigned long, unsigned long, unsigned long, long, unsigned long, trx_t*, bool))[0x55df5307b0a0] buf/buf0rea.cc:1119(buf_read_recv_pages(unsigned long, unsigned long, unsigned long, unsigned long const*, unsigned long))[0x55df5307eb66] log/log0recv.cc:1857(recv_apply_hashed_log_recs(bool))[0x55df52f5d44f] srv/srv0start.cc:2663(innobase_start_or_create_for_mysql())[0x55df52ff076e] handler/ha_innodb.cc:4479(innobase_init(void*))[0x55df52f0f67d] sql/handler.cc:521(ha_initialize_handlerton(st_plugin_int*))[0x55df52cc0264] sql/sql_plugin.cc:1409(plugin_initialize(st_mem_root*, st_plugin_int*, int*, char**, bool))[0x55df52b47e70] sql/sql_plugin.cc:1686(plugin_init(int*, char**, int))[0x55df52b4975a] sql/mysqld.cc:5133(init_server_components())[0x55df52a9eb88] sql/mysqld.cc:5722(mysqld_main(int, char**))[0x55df52aa2630] /lib64/libc.so.6(__libc_start_main+0xf5)[0x7fb1934b2c05] //sbin/mysqld(+0x39910d)[0x55df52a9610d] The manual page at http://dev.mysql.com/doc/mysql/en/crashing.html contains information that should help you find out what is causing the crash.'