Details
-
Bug
-
Status: Open (View Workflow)
-
Major
-
Resolution: Unresolved
-
10.11.13, 10.11.14, 10.11.15, 11.4.6, 11.4.7, 11.4.8, 11.4.9
-
None
Description
We are observing random occurrences of Error 1950 after performing SST via Mariabackup. We do not see this issue on MariaDB 10.5 or 10.6, but every newer version appears to be affected.
After SST, the cluster node starts, but it becomes stuck in the following state:
Cluster node will start but its stuck in:
Variable_name: wsrep_ready
Value: OFF
Variable_name: wsrep_cluster_status
Value: Disconnected
Variable_name: wsrep_connected
Value: OFF
The issue occurs randomly, but it seems to be more frequent on clusters with higher traffic.
Our Galera and GTID configuration follows the official documentation, and nothing was changed during the upgrade from 10.6.
Switching wsrep_sst_method to rsync avoids the issue, so it seems to be related specifically to Mariabackup-based SST.
It may be a problem on the Galera/MariaDB side, but we have not been able to identify the root cause. I have not been able to reproduce it.
Any help would be appreciated.
Thanks a lot