[MDEV-18621] wsrep_sst_mariabackup socat dead connection Created: 2019-02-18 Updated: 2023-10-10 |
|
| Status: | Stalled |
| Project: | MariaDB Server |
| Component/s: | Galera SST |
| Affects Version/s: | 10.2.22 |
| Fix Version/s: | 10.2 |
| Type: | Bug | Priority: | Major |
| Reporter: | Martin Vit | Assignee: | Julius Goryavsky |
| Resolution: | Unresolved | Votes: | 1 |
| Labels: | None | ||
| Environment: |
debian 9 |
||
| Description |
|
When doing SST on joiner the socat receiver once donor completes SST transfer the joiner hangs in TCP connection ESTABLISHED while the socat on donor already ends. The solution is to wait 7200 seconds which is default timeout for dead tcp connections or manually kill socat on joiner which will continue with SST or my workaround currently is configuration on joiner in my.cnf - [sst] which will close dead tcp connection. I suggest to put this keepalive directly into the /usr/bin/wsrep_sst_mariabackup I also suggest to revise or find out why the socat on donor does not sends FIN or sends EOF over the network to joiner. Here are some logs: on Donor: Feb 18 04:04:50 s1 -innobackupex-backup: [00] 2019-02-18 04:04:50 completed OK! on Joiner: Feb 18 02:20:31 s3 -wsrep-sst-joiner: Waiting for SST streaming to complete! without the extra socket option (,keepalive,keepidle=10,keepintvl=10,keepcnt=2) the timeout will happen after 2 hours and not that fast |
| Comments |
| Comment by Jan Lindström (Inactive) [ 2019-06-14 ] |
|
There is workaround for this problem so this issue is not a critical. |