[MDEV-13293] Donor may show "Assertion `rcode != 0 || state_id.seqno >= 0' failed" when SST failed Created: 2017-07-11  Updated: 2023-10-10

Status: Stalled
Project: MariaDB Server
Component/s: Galera SST
Affects Version/s: 10.2.6
Fix Version/s: 10.2

Type: Bug Priority: Major
Reporter: Andrii Nikitin (Inactive) Assignee: Julius Goryavsky
Resolution: Unresolved Votes: 0
Labels: None


 Description   

Following assert may happen on donor when SST encounters an error:

++ rsync --owner --group --perms --links --specials --ignore-times --inplace --dirs --delete --quiet --whole-file -f '- /lost+found' -f '- /.fseventsd' -f '- /.Trashes' -f '+ /wsrep
_sst_binlog.tar' -f '+ /ib_lru_dump' -f '+ /ibdata*' -f '+ /*/' -f '- /*' /mariadb-environs/m1-10.2.6/dt// rsync://172.17.0.2:4444/rsync_sst
rsync: open "ibdata1" (in rsync_sst) failed: Permission denied (13)
rsync error: some files/attrs were not transferred (see previous errors) (code 23) at main.c(1052) [sender=3.0.9]
+ RC=23
+ '[' 23 -ne 0 ']'
+ wsrep_log_error 'rsync returned code 23:'
+ wsrep_log '[ERROR] rsync returned code 23:'
++ date '+%Y%m%d %H:%M:%S.%N'
++ cut -b -21
+ local readonly 'tst=20170711 11:09:14.582'
+ echo 'WSREP_SST: [ERROR] rsync returned code 23: (20170711 11:09:14.582)'
WSREP_SST: [ERROR] rsync returned code 23: (20170711 11:09:14.582)
+ case $RC in
+ RC=255
+ exit 255
2017-07-11 11:09:14 139629336434432 [ERROR] WSREP: Failed to read from: wsrep_sst_rsync --role 'donor' --address '172.17.0.2:4444/rsync_sst' --socket '/mariadb-environs/m1-10.2.6/dt/my.sock' --datadir '/mariadb-environs/m1-10.2.6/dt/'  --defaults-file '/mariadb-environs/m1-10.2.6/my.cnf'    '' --gtid '5bb618b2-6629-11e7-93d8-63e301c50b5d:0' --gtid-domain-id '0'
2017-07-11 11:09:14 139629336434432 [Note] WSREP: resuming provider at 5
2017-07-11 11:09:14 139629336434432 [Note] WSREP: Provider resumed.
mysqld: galera/src/replicator_smm.cpp:1194: virtual wsrep_status_t galera::ReplicatorSMM::sst_sent(const wsrep_gtid_t&, int): Assertion `rcode != 0 || state_id.seqno >= 0' failed.
170711 11:09:14 [ERROR] mysqld got signal 6 ;
This could be because you hit a bug. It is also possible that this binary
or one of the libraries it was linked against is corrupt, improperly built,
or misconfigured. This error can also be caused by malfunctioning hardware.
 
To report this bug, see https://mariadb.com/kb/en/reporting-bugs
 
We will try our best to scrape up some info that will hopefully help
diagnose the problem, but since we have already crashed,
something is definitely wrong and this may fail.
 
Server version: 10.2.6-MariaDB
key_buffer_size=134217728
read_buffer_size=131072
max_used_connections=1
max_threads=153
thread_count=10
It is possible that mysqld could use up to                                                                                                                        
key_buffer_size + (read_buffer_size + sort_buffer_size)*max_threads = 467201 K  bytes of memory                                                                   
Hope that's ok; if not, decrease some variables in the equation.                                                                                                  
                                                                                                                                                                  
Thread pointer: 0x7efdd00009a8                                                                                                                                    
Attempting backtrace. You can use the following information to find out                                                                                           
where mysqld died. If you see no messages after this, something went                                                                                              
terribly wrong...                                                                                                                                                 
stack_bottom = 0x7efdfcff8e88 thread_stack 0x49000                                                                                                                
/mariadb-environs/_depot/m-tar/10.2.6/bin/mysqld(my_print_stacktrace+0x2e)[0xddc65e]                                                                              
/mariadb-environs/_depot/m-tar/10.2.6/bin/mysqld(handle_fatal_signal+0x444)[0x7d3374]                                                                             
/lib64/libpthread.so.0(+0xf370)[0x7efe65a49370]                                                                                                                   
/lib64/libc.so.6(gsignal+0x37)[0x7efe6485e1d7]                                                                                                                    
/lib64/libc.so.6(abort+0x148)[0x7efe6485f8c8]                                                                                                                     
/lib64/libc.so.6(+0x2e146)[0x7efe64857146]                                                                                                                        
/lib64/libc.so.6(+0x2e1f2)[0x7efe648571f2]                                                                                                                        
/mariadb-environs/m1-10.2.6/../_depot/m-tar/10.2.6/lib/libgalera_smm.so(_ZN6galera13ReplicatorSMM8sst_sentERK10wsrep_gtidi+0xb9)[0x7efe628572f1]                  
/mariadb-environs/m1-10.2.6/../_depot/m-tar/10.2.6/lib/libgalera_smm.so(galera_sst_sent+0xe2)[0x7efe628720b0]                                                     
/mariadb-environs/_depot/m-tar/10.2.6/bin/mysqld[0x755592]                                                                                                        
/lib64/libpthread.so.0(+0x7dc5)[0x7efe65a41dc5]                                                                                                                   
/lib64/libc.so.6(clone+0x6d)[0x7efe6492076d]                                                                                                                      
                                                                                                                                                                  
Trying to get some variables.                                                                                                                                     
Some pointers may be invalid and cause the dump to abort.                                                                                                         
Query (0x7efdfcff8540): SET GLOBAL innodb_disallow_writes=0                                                                                                       
Connection ID (thread ID): 0                                                                                                                                      
Status: NOT_KILLED                                                                                                                                                
                                                                                                                                                                  
Optimizer switch: index_merge=on,index_merge_union=on,index_merge_sort_union=on,index_merge_intersection=on,index_merge_sort_intersection=off,engine_condition_pushdown=off,index_con
dition_pushdown=on,derived_merge=on,derived_with_keys=on,firstmatch=on,loosescan=on,materialization=on,in_to_exists=on,semijoin=on,partial_match_rowid_merge=on,partial_match_table_s
can=on,subquery_cache=on,mrr=off,mrr_cost_based=off,mrr_sort_keys=off,outer_join_with_cache=on,semijoin_with_cache=on,join_cache_incremental=on,join_cache_hashed=on,join_cache_bka=o
n,optimize_join_buffer_size=off,table_elimination=on,extended_keys=on,exists_to_in=on,orderby_uses_equalities=on,condition_pushdown_for_derived=on


Generated at Thu Feb 08 08:04:27 UTC 2024 using Jira 8.20.16#820016-sha1:9d11dbea5f4be3d4cc21f03a88dd11d8c8687422.