[MDEV-7111] Unable to detect network timeout in 10.x when using SSL (regression from 5.5) - Jira

XML

Word

Printable

Details

Type: Bug
Status: Closed (View Workflow)
Priority: Minor
Resolution: Fixed
Affects Version/s: 10.0.14, 10.0(EOL)
Fix Version/s: 10.5.26, 10.6.19, 10.11.9, 11.1.6, 11.2.5, 11.4.3
Component/s: Replication, SSL
Labels:
- upstream
- verified
Environment:
Ubuntu Linux 14.04 x86_64

Description

Summary:
When simulating a network connectivity loss between replicating servers, if the replication channel uses SSL then MariaDB 10.x does not detect the loss of network connectivity, although 5.5 does. This is regardless of the value of the "slave_net_timeout" variable.

Reproduced using the 10.0.14 GLIBC214 build and 10.0.0 builds, using both GTID (10.0.14) and binlog-based replication (10.0.14/10.0.0). Works as expected on 5.5.40 with binlog-based replication. I am testing with the binary .tar.gz MariaDB builds downloaded from the Mariadb servers (archive.mariadb.org).

Steps to reproduce (functional case):

Set up MariaDB with two 5.5.40 servers in master-slave configuration and ensure replication is working and SSL-encrypted.
Start generating traffic on the master. Watch the slave status to see the traffic is being replicated successfully.
Simulate a network failure, e.g. "iptables -I INPUT -s <master_ip> -j DROP" on the slave. This drops all network packets from the master host.
Wait for slave_net_timeout seconds to pass. The slave will restart as documented, and the slave status will now state that it is attempting to reconnect to the master.

Steps to reproduce (broken case):

Set up MariaDB with two 10.0.14 servers in master-slave configuration and ensure replication is working and SSL-encrypted.
Start generating traffic on the master. Watch the slave status to see the traffic is being replicated successfully.
Simulate a network failure, e.g. "iptables -I INPUT -s <master_ip> -j DROP" on the slave. This drops all network packets from the master host.
Wait for slave_net_timeout seconds to pass. The slave status will continue to state "waiting for master to send event", even though the log position counters are not advancing. The slave will remain in this state until the slave is stopped and restarted – it will not restart on its own, contrary to documentation. This is a change in behavior from Mariadb 5.5 and appears to be incorrect behavior.

Attachments

Issue Links

links to

Bug #74908 - Unable to detect network timeout in 5.6 when using SSL (regression from 5.5)

Activity

People

Assignee:: Unassigned

Reporter:: Paul Kreiner

Votes:: 0 Vote for this issue

Watchers:: 7 Start watching this issue

Dates

Created:: 2014-11-14 01:03

Updated:: 2024-06-22 16:23

Resolved:: 2024-06-22 16:23

Git Integration

Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.