[MDEV-29571] WSREP donor stuck in "donor/desynced" state Created: 2022-09-19 Updated: 2023-05-08 Resolved: 2023-05-08 |
|
| Status: | Closed |
| Project: | MariaDB Server |
| Component/s: | wsrep |
| Affects Version/s: | 10.5.9 |
| Fix Version/s: | 10.5.14 |
| Type: | Bug | Priority: | Critical |
| Reporter: | cc lin | Assignee: | Alexey |
| Resolution: | Duplicate | Votes: | 0 |
| Labels: | None | ||
| Environment: |
OS: Debian GNU/Linux 10 (buster) |
||
| Attachments: |
|
| Description |
|
After donation, the donor node keeps its wsrep local state as 2 (donor/desynced) and never comes back to state 4 (synced) Related logs [Warning] WSREP: Could not find key from index |
| Comments |
| Comment by cc lin [ 2022-09-20 ] |
|
wsrep-related server variable and status MariaDB [(none)]> show variables like '%wsrep%'\G Variable_name: wsrep_osu_method Variable_name: wsrep_sr_store Variable_name: wsrep_auto_increment_control Variable_name: wsrep_causal_reads Variable_name: wsrep_certification_rules Variable_name: wsrep_certify_nonpk Variable_name: wsrep_cluster_address Variable_name: wsrep_cluster_name Variable_name: wsrep_convert_lock_to_trx Variable_name: wsrep_data_home_dir Variable_name: wsrep_dbug_option Variable_name: wsrep_debug Variable_name: wsrep_desync Variable_name: wsrep_dirty_reads Variable_name: wsrep_drupal_282555_workaround Variable_name: wsrep_forced_binlog_format Variable_name: wsrep_gtid_domain_id Variable_name: wsrep_gtid_mode Variable_name: wsrep_gtid_seq_no Variable_name: wsrep_ignore_apply_errors Variable_name: wsrep_load_data_splitting Variable_name: wsrep_log_conflicts Variable_name: wsrep_max_ws_rows Variable_name: wsrep_max_ws_size Variable_name: wsrep_mysql_replication_bundle Variable_name: wsrep_node_address Variable_name: wsrep_node_incoming_address Variable_name: wsrep_node_name Variable_name: wsrep_notify_cmd Variable_name: wsrep_on Variable_name: wsrep_patch_version Variable_name: wsrep_provider Variable_name: wsrep_provider_options Variable_name: wsrep_recover Variable_name: wsrep_reject_queries Variable_name: wsrep_replicate_myisam Variable_name: wsrep_restart_slave Variable_name: wsrep_retry_autocommit Variable_name: wsrep_slave_fk_checks Variable_name: wsrep_slave_uk_checks Variable_name: wsrep_slave_threads Variable_name: wsrep_sst_auth Variable_name: wsrep_sst_donor Variable_name: wsrep_sst_donor_rejects_queries Variable_name: wsrep_sst_method Variable_name: wsrep_sst_receive_address Variable_name: wsrep_start_position Variable_name: wsrep_strict_ddl Variable_name: wsrep_sync_wait Variable_name: wsrep_trx_fragment_size Variable_name: wsrep_trx_fragment_unit MariaDB [(none)]> show status like '%wsrep%'\G Variable_name: wsrep_local_state_uuid Variable_name: wsrep_protocol_version Variable_name: wsrep_last_committed Variable_name: wsrep_replicated Variable_name: wsrep_replicated_bytes Variable_name: wsrep_repl_keys Variable_name: wsrep_repl_keys_bytes Variable_name: wsrep_repl_data_bytes Variable_name: wsrep_repl_other_bytes Variable_name: wsrep_received Variable_name: wsrep_received_bytes Variable_name: wsrep_local_commits Variable_name: wsrep_local_cert_failures Variable_name: wsrep_local_replays Variable_name: wsrep_local_send_queue Variable_name: wsrep_local_send_queue_max Variable_name: wsrep_local_send_queue_min Variable_name: wsrep_local_send_queue_avg Variable_name: wsrep_local_recv_queue Variable_name: wsrep_local_recv_queue_max Variable_name: wsrep_local_recv_queue_min Variable_name: wsrep_local_recv_queue_avg Variable_name: wsrep_local_cached_downto Variable_name: wsrep_flow_control_paused_ns Variable_name: wsrep_flow_control_paused Variable_name: wsrep_flow_control_sent Variable_name: wsrep_flow_control_recv Variable_name: wsrep_flow_control_active Variable_name: wsrep_flow_control_requested Variable_name: wsrep_cert_deps_distance Variable_name: wsrep_apply_oooe Variable_name: wsrep_apply_oool Variable_name: wsrep_apply_window Variable_name: wsrep_commit_oooe Variable_name: wsrep_commit_oool Variable_name: wsrep_commit_window Variable_name: wsrep_local_state Variable_name: wsrep_local_state_comment Variable_name: wsrep_cert_index_size Variable_name: wsrep_causal_reads Variable_name: wsrep_cert_interval Variable_name: wsrep_open_transactions Variable_name: wsrep_open_connections Variable_name: wsrep_incoming_addresses Variable_name: wsrep_cluster_weight Variable_name: wsrep_desync_count Variable_name: wsrep_evs_delayed Variable_name: wsrep_evs_evict_list Variable_name: wsrep_evs_repl_latency Variable_name: wsrep_evs_state Variable_name: wsrep_gcomm_uuid Variable_name: wsrep_gmcast_segment Variable_name: wsrep_applier_thread_count Variable_name: wsrep_cluster_capabilities Variable_name: wsrep_cluster_conf_id Variable_name: wsrep_cluster_size Variable_name: wsrep_cluster_state_uuid Variable_name: wsrep_cluster_status Variable_name: wsrep_connected Variable_name: wsrep_local_bf_aborts Variable_name: wsrep_local_index Variable_name: wsrep_provider_capabilities Variable_name: wsrep_provider_name Variable_name: wsrep_provider_vendor Variable_name: wsrep_provider_version Variable_name: wsrep_ready Variable_name: wsrep_rollbacker_thread_count Variable_name: wsrep_thread_count |
| Comment by Jan Lindström (Inactive) [ 2023-01-02 ] |
|
cclin Can you provide full error log? |
| Comment by Juan [ 2023-01-04 ] |
|
Hi jplindst - here's details running on MariaDB 10.5.12 on Rocky. I've attached logs for two nodes, MDB-10-5-12-G-104, the joiner, and MDB-10-5-12-G-101, the donor. At Wed Jan 4 02:42:13 UTC 2023 I restart MDB-10-5-12-G-104 after deleting the datadir, forcing an SST. MDB-10-5-12-G-101 desyncs to provide the SST and stays desynced until I restart MDB-10-5-12-G-104 once again at Wed Jan 4 02:42:57 UTC 2023 Attached are MDB-10-5-12-G-101-202301040240-state.log, a log watching the node state on the donor during this time-period, along with the complete logs for the donor, MDB-10-5-12-G-101-202301040245-mariadb-error-logs.tgz, and for the joiner, MDB-10-5-12-G-104-202301040245-mariadb-error-logs.tgz |
| Comment by Stephan Vos [ 2023-01-05 ] |
|
I can confirm the same happened with us recently on MariaDB 10.5.17 on Ubuntu 18.04 |
| Comment by Alexey [ 2023-05-08 ] |
|
Appears to be a duplicate of https://jira.mariadb.org/browse/MDEV-27459 |