[MXS-5206] Readwritesplit does not drop connections to severely lagging servers - Jira

XML

Word

Printable

Details

Type: Bug
Status: Closed (View Workflow)
Priority: Minor
Resolution: Fixed
Affects Version/s: 23.08.4
Fix Version/s: 23.08.7, 24.02.3
Component/s: readwritesplit
Labels:
- monitoring
- readwritesplit
Environment:
Ubuntu 22.04

Description

When a server is lagging behind and readwritesplit is configured with max_replication_lag, it will stop routing read queries to it but it will leave the connections open and it will continue to route session commands to them. This is done in the hopes that the replication lag will eventually subside and that the connections can be used again.

If the server is lagging behind by a lot (e.g. by several hours or even days), keeping the connection open is somewhat wasteful and, if nothing else, slightly misleading as it implies that it might be used for routing. A better alternative to this would be to discard connections to servers that are lagging behind by some amount.

Original description:

I have configured the following parameters in my Read-Write-Service:

max_replication_lag = 30000ms

causal_reads = global

causal_reads_timeout = 10000ms

I notice if a replica server has been down for quite a long time, that on startup, the MariaDB-Monitor service must have a delay in reporting the actual replication lag, which results in the Read-Write-Service routing connections to the lagging replica..

The replica in question was ~140,000s behind, and I clearly don't want that to be added as a suitable server to connect to.

Is there:
1) A way to delay the initial routing until a valid replication lag has been identified?
2) A way to kill connections that are part of a lagging server automatically, once they've already snuck through?

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending
- Thumbnails
- List
- Download All

read-write-stats.png
22 kB
2024-08-29 21:32
MaxScale Replication Lag connections.png
91 kB
2024-08-19 23:39

Activity

People

Assignee:: markus makela

Reporter:: Richard Lee

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 2024-08-19 23:39

Updated:: 2025-01-02 08:05

Resolved:: 2024-09-02 06:20

Git Integration

Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.