[MXS-3653] Lost connection to master server while waiting for a result. Connection has been idle for 0.0 seconds Created: 2021-07-01 Updated: 2021-07-23 Resolved: 2021-07-23 |
|
| Status: | Closed |
| Project: | MariaDB MaxScale |
| Component/s: | Core |
| Affects Version/s: | 2.5.13 |
| Fix Version/s: | N/A |
| Type: | Bug | Priority: | Major |
| Reporter: | Maiavria | Assignee: | markus makela |
| Resolution: | Won't Fix | Votes: | 0 |
| Labels: | None | ||
| Environment: |
Debian 10.10, 10.5.11-MariaDB |
||
| Sprint: | MXS-SPRINT-136 |
| Description |
|
Hello, I'm experiencing theses warnings in MX's log every seconds. I tried disabling persistent pool without success. I've also added backend timeouts without success. No success with setting a higher wait_timeout on mariadb either. My configuration is a read-write split router and 3x 10.5.11-MariaDB galera cluster nodes. With info_log enabled I get the following :
My MX configuration :
Do you have any clues why this happening with latest version ? |
| Comments |
| Comment by markus makela [ 2021-07-11 ] |
|
The error is logged because the client closes the connection with KILL CONNECTION_ID(). This causes an unexpected interruption of the connection which MaxScale proceeds to log. Currently this seems to be expected behavior but we could suppress the error for that particular error type. |
| Comment by markus makela [ 2021-07-14 ] |
|
It turns out this isn't something that's easy to fix: if you kill your own connection, I believe you aren't guaranteed to receive the error for it before the network hangup arrives. For this reason it's possible that MaxScale thinks it lost the connection to the master and decides to log the message. In order for readwritesplit to know that it shouldn't log this, it would need to detect that the KILL targets the connection itself. Since this isn't a very practical way of closing a connection, I don't think it's worth it to fix this just for this one query that, at least to my knowledge, isn't useful. |
| Comment by markus makela [ 2021-07-18 ] |
|
jom why is the KILL CONNECTION_ID() executed and who does it? Can you remove it or further explain why the connection needs to KILL the same connection that does it? |
| Comment by markus makela [ 2021-07-23 ] |
|
Since this seems to be something that's caused by the application itself, I'll close this as Won't Fix until we get more feedback. |