[MXS-2371] Maxscale crashed after losing connection to master database Created: 2019-03-07 Updated: 2019-10-18 Resolved: 2019-10-18 |
|
| Status: | Closed |
| Project: | MariaDB MaxScale |
| Component/s: | failover |
| Affects Version/s: | 2.3.4 |
| Fix Version/s: | N/A |
| Type: | Bug | Priority: | Major |
| Reporter: | Jeffrey Parker | Assignee: | markus makela |
| Resolution: | Cannot Reproduce | Votes: | 0 |
| Labels: | None | ||
| Environment: |
Centos 7, Percona Mysql backend, PHP front end |
||
| Attachments: |
|
| Description |
|
MaxScale lost connection to the master server and promptly crashed. The relevant log lines are below.
|
| Comments |
| Comment by Jeffrey Parker [ 2019-03-07 ] |
|
Unfortunately this only happened under load that I cannot seem to reproduce without our production environment so I do not have simple reproduction steps and the core dump does not exist because that failed due to a signing issue. |
| Comment by markus makela [ 2019-03-07 ] |
|
Please add the base MaxScale configuration plus any generated configurations from /var/lib/maxscale/maxscale.cnf.d/ to the issue. Please also remove any sensitive data from the configuration files like usernames, passwords and IP addresses. First look at the stacktrace would suggest either a null buffer or a pointer to a freed buffer. |
| Comment by Jeffrey Parker [ 2019-03-07 ] |
|
Configs attached. |
| Comment by markus makela [ 2019-03-12 ] |
|
Seems that the readwritesplit service has a lot of filters. Can you try to remove the filters and see if it happens again? If it doesn't, try adding back the filters one at a time to find out which causes the problem. |
| Comment by Jeffrey Parker [ 2019-03-12 ] |
|
I would really love to help you out with this, but we really can't do that since we have only been able to get this to happen with our full production load going through maxscale and when it crashes we have like an hour of complete downtime while we fix it. This is not something we can do and any testing we have done has either not put enough load on it, has not run long enough, or has not run the correct queries to cause this issue. We also had to add the filters to try to make sure the software worked when using maxscale, while there may be some redundancy we kind of need the filters in place except for the top and qla filters. I will point out that the maxscale system was nowhere near having a high load, very low CPU usage, no high memory usage, no disk io to speak of. |
| Comment by markus makela [ 2019-03-13 ] |
|
OK, that's understandable. We'll continue our efforts to try and reproduce this on our environment. |
| Comment by markus makela [ 2019-03-28 ] |
|
I've been testing with the combined configuration and haven't been able to reproduce the crash. I'll see if I can put this configuration under some heavy testing and see if that helps. |
| Comment by markus makela [ 2019-10-18 ] |
|
We never reproduced this error which is why I'll close this as Cannot Reproduce. If this still happens with the latest release, please let us know and we'll reopen this issue. |