[MXS-4359] Give a way to use slave_selection_criteria = LEAST_BEHIND_MASTER with Galera - Jira

Details

Type: New Feature
Status: Closed (View Workflow)
Priority: Major
Resolution: Won't Do
Affects Version/s: None
Fix Version/s: N/A
Component/s: galeramon, readwritesplit
Labels:
None

Description

Looks like readwritesplit router assumes that Galera nodes are all always in sync. In reality they may have write sets pending to be applied and wsrep_local_recv_queue may be used as a measure of current "lagging" for Galera node.

With this we may be able to apply slave_selection_criteria = LEAST_BEHIND_MASTER and route to the Galera node that is more close to "master" or less loaded.

Attachments

Activity

Ascending order - Click to sort in descending order

markus makela added a comment - 2022-10-22 08:18

Would wsrep-causal-reads be an alternative to this?

On average, how large is the lag between the reading of the writeset from the cluster and the application of it? The main problem with using replication lag in the routing logic is that it's only updated by the monitor and thus is a relatively coarse measurement of lag. In the case of traditional async replication the lag might be minutes in the worst case which is used by readwritesplit to rule out severely lagging servers. For lag that's less than that you're probably better off enabling causal_reads in MaxScale to eliminate replication lag from the user's point of view.

In the case of Galera, if the "replication" lag is significantly less than the value of monitor_interva, it might make more sense to force wsrep-causal-reads to be used instead of using wsrep_local_recv_queue as a measurement of lag as it avoids the same problem that causal_reads=fast suffers from: if the servers are lagging too much, almost all of the traffic gets routed to a single node. The same limitations also apply to causal_reads=fast_global where it is only useful for very low write throughput and for workloads that are largely read-only.

markus makela added a comment - 2022-10-22 08:18 Would wsrep-causal-reads be an alternative to this? On average, how large is the lag between the reading of the writeset from the cluster and the application of it? The main problem with using replication lag in the routing logic is that it's only updated by the monitor and thus is a relatively coarse measurement of lag. In the case of traditional async replication the lag might be minutes in the worst case which is used by readwritesplit to rule out severely lagging servers. For lag that's less than that you're probably better off enabling causal_reads in MaxScale to eliminate replication lag from the user's point of view. In the case of Galera, if the "replication" lag is significantly less than the value of monitor_interva , it might make more sense to force wsrep-causal-reads to be used instead of using wsrep_local_recv_queue as a measurement of lag as it avoids the same problem that causal_reads=fast suffers from: if the servers are lagging too much, almost all of the traffic gets routed to a single node. The same limitations also apply to causal_reads=fast_global where it is only useful for very low write throughput and for workloads that are largely read-only.

markus makela added a comment - 2022-10-24 12:22

This looks like a new feature and not a generic task. I converted the type into New Task.

markus makela added a comment - 2022-10-24 12:22 This looks like a new feature and not a generic task. I converted the type into New Task.

People

Assignee:: Todd Stoffel (Inactive)

Reporter:: Valerii Kravchuk

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 2022-10-21 15:20

Updated:: 2024-10-03 15:53

Resolved:: 2023-04-04 05:17

Git Integration

Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.

MariaDB MaxScale