[MDEV-9243] Asynchronous replication between two galera clusters. Moment the slave starts- the slave machine crashes Created: 2015-12-07 Updated: 2015-12-10 Resolved: 2015-12-10 |
|
| Status: | Closed |
| Project: | MariaDB Server |
| Component/s: | Galera |
| Affects Version/s: | 10.1.9 |
| Fix Version/s: | 10.1.9 |
| Type: | Bug | Priority: | Major |
| Reporter: | trupti mali | Assignee: | Michael Widenius |
| Resolution: | Fixed | Votes: | 0 |
| Labels: | None | ||
| Environment: |
Debian Jessie. |
||
| Description |
|
Hi, I upgraded to 10.1.9.(Well I didnt really do upgrading process on existing setup yet. But I was doing a poc of this setup before taking upgrade decision). 2015-12-07 6:45:53 139917150004992 [Note] Slave SQL thread initialized, starting replication in log 'mariadb-bin.000010' at position 2812, relay log './mysqld-relay-bin.000001' position: 4 To report this bug, see http://kb.askmonty.org/en/reporting-bugs We will try our best to scrape up some info that will hopefully help Server version: 10.1.9-MariaDB-1~jessie-log Thread pointer: 0x0x7f40ab71e008 Trying to get some variables. Optimizer switch: index_merge=on,index_merge_union=on,index_merge_sort_union=on,index_merge_intersection=on,index_merge_sort_intersection=off,engine_condition_pushdown=off,index_condition_pushdown=on,derived_merge=on,derived_with_keys=on,firstmatch=on,loosescan=on,materialization=on,in_to_exists=on,semijoin=on,partial_match_rowid_merge=on,partial_match_table_scan=on,subquery_cache=on,mrr=off,mrr_cost_based=off,mrr_sort_keys=off,outer_join_with_cache=on,semijoin_with_cache=on,join_cache_incremental=on,join_cache_hashed=on,join_cache_bka=on,optimize_join_buffer_size=off,table_elimination=on,extended_keys=on,exists_to_in=on |
| Comments |
| Comment by trupti mali [ 2015-12-07 ] |
|
Hi , |
| Comment by Michael Widenius [ 2015-12-10 ] |
|
Thanks for all the information, it helped me locate the bug quickly. |
| Comment by Michael Widenius [ 2015-12-10 ] |
|
Fixed by initialize rpl_write_set properly in rpl_slave_state::record_gtid(). Bug was introduced by me in 10.1.9 as part of fixing a bug where we write more columns than needed to the binary log in row based replication. I didn't notice this as earlier as I didn't have the galera wsrep library installed on my machine and thus the galera test was not run as part of my testing the original fix. Have now installed the library on my machine so there is less likely that things like this should happen again. |