[MDEV-16091] Seconds_Behind_Master spikes to millions of seconds - Jira

XML

Word

Printable

Details

Type: Bug
Status: Closed (View Workflow)
Priority: Critical
Resolution: Fixed
Affects Version/s: 10.1.28, 10.0.38, 10.4.19
Fix Version/s: 10.2.42, 10.3.33, 10.4.23, 10.5.14, 10.6.6, 10.7.2
Component/s: Replication
Labels:
- Seconds_Behind_Master
- seconds-behind-master
Environment:
CentOS Linux release 7.4.1708 (Core)
Linux global-db 3.10.0-693.2.2.el7.x86_64 #1 SMP Tue Sep 12 22:26:13 UTC 2017 x86_64 x86_64 x86_64 GNU/Linux

Description

Hey everyone

Rarely, we have a weird issue happening. We have a master <-> master setup. The writes are going to only one of them. Every now and then, the replication lag on the master where we send writes jumps to millions of seconds (ex: 2879736). Then, after 1s, is back to 0. After all, there's no lag, as the writes and the binlog from the other master should be ignored, as this is the server where we write to.
Both servers have the same timezone and ntp running.
Replication is set with GTIDs, and parallel threads.

Sample config:
slave_net_timeout=2
gtid-strict-mode=1
gtid-domain-id=1
server_id = 101
sync_binlog = 1
binlog_format = ROW
log-slave-updates
replicate-same-server-id = 0
log_bin=/var/log/mysql/binlog/mysql-bin.log
log_bin_index=/var/log/mysql/binlog/mysql-bin.log.index
relay_log=/var/log/mysql/binlog/mysql-relay-bin
relay_log_index=/var/log/mysql/binlog/mysql-relay-bin.index
expire_logs_days=5
max_binlog_size=1G
slave_parallel_threads=10

We have a bunch of monitoring scripts looking at that value and triggering actions and alerts and we want to know what can cause this and if there's anything we can do to avoid this in the future.

Thank you

Attachments

Issue Links

is duplicated by

MDEV-25999 Unrealistic "Seconds behind master" value

Closed

relates to

MDEV-31895 Report a Replica's Time Difference with its Primary

Closed

MDEV-33327 rpl_seconds_behind_master_spike Sensitive to IO Thread Stop Position

Closed

Activity

People

Assignee:: Brandon Nesterenko

Reporter:: raul (Inactive)

Votes:: 2 Vote for this issue

Watchers:: 10 Start watching this issue

Dates

Created:: 2018-05-04 16:41

Updated:: 2025-07-07 20:03

Resolved:: 2022-01-05 14:47

Git Integration

Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.