[MXS-2860] MaxScale 2.3.16 logs "Lost connection to master server while waiting for a result. Connection has been idle for 0.0 seconds. Error caused by: #HY000: Lost connection to backend server: network error. Last close reason: <none> Created: 2020-01-29 Updated: 2020-07-07 Resolved: 2020-07-07 |
|
| Status: | Closed |
| Project: | MariaDB MaxScale |
| Component/s: | N/A |
| Affects Version/s: | 2.3.16 |
| Fix Version/s: | N/A |
| Type: | Bug | Priority: | Major |
| Reporter: | Claudio Nanni | Assignee: | markus makela |
| Resolution: | Cannot Reproduce | Votes: | 0 |
| Labels: | need_feedback | ||
| Issue Links: |
|
||||||||
| Sprint: | MXS-SPRINT-99, MXS-SPRINT-101 | ||||||||
| Description |
|
It looks like: https://jira.mariadb.org/browse/MXS-2408 Is still on 2.3.16: 2020-01-23 15:42:15 error : (872) Lost connection to the master server 'server', closing session. Lost connection to master server while waiting for a result. Connection has been idle for 0.0 seconds. Error caused by: #HY000: Lost connection to backend server: network error. Last close reason: <none> |
| Comments |
| Comment by markus makela [ 2020-02-06 ] |
|
Since this was on MaxScale 2.3.16 which has the improved error logging, we know this event wasn't generated by MaxScale (no (Generated event) after the network error part). This means that it is not a regression of The connection idle time calculation could be wrong if a network input event (EPOLLIN) occurs at the same time that a network error does (EPOLLERR). Currently the idle time is unconditionally updated whenever network input events arrive. By updating the idle time only when data is successfully read, we should be able to preserve the real idle time of the connection. |
| Comment by markus makela [ 2020-03-06 ] |
|
Managed to find out a few cases where the network socket error is cleared before the error handler function gets to read it. With this fix, we should be able to see the actual error message that caused the connection to be closed. |
| Comment by markus makela [ 2020-07-07 ] |
|
Closing as Cannot Reproduce as this hasn't happened with latest releases. |