[MXS-4472] Secondary monitor cannot get locks after Primary monitor power failed Created: 2023-01-09 Updated: 2023-02-14 Resolved: 2023-02-14 |
|
| Status: | Closed |
| Project: | MariaDB MaxScale |
| Component/s: | Monitor |
| Affects Version/s: | 22.08.3 |
| Fix Version/s: | 6.4.5 |
| Type: | New Feature | Priority: | Major |
| Reporter: | Maetee | Assignee: | Unassigned |
| Resolution: | Not a Bug | Votes: | 0 |
| Labels: | None | ||
| Environment: |
Two MaxScale with cooperative monitoring. Three MariaDB backed. |
||
| Description |
|
either the power failed or the primary MaxScale node was disconnected from the network. The locks on backend servers are still exist. MaxScale secondary node cannot acquire locks and cannot perform cluster operation, such as automatic failover or manual switch over.
This does not happen when the primary MaxScale is graceful shutdown. I think set a small wait_timeout in monitor session should help remove the stale backend locks. |
| Comments |
| Comment by markus makela [ 2023-01-12 ] | ||||||||||||
|
If there are stale locks, it means the client connections to them are still open. A lower wait_timeout seems like a pretty good idea and it could be set to something like (monitor_interval + timeouts) * 2. | ||||||||||||
| Comment by Maetee [ 2023-02-02 ] | ||||||||||||
|
I try setting the wait_timeout hard coded to 30 seconds.
| ||||||||||||
| Comment by Esa Korhonen [ 2023-02-02 ] | ||||||||||||
|
Check the "Releasing locks"-section in monitor documentation. | ||||||||||||
| Comment by Esa Korhonen [ 2023-02-14 ] | ||||||||||||
|
Handled with tcp connection timeout settings. |