[MXS-2446] Fatal on Maxscale server on reimaging clustrix setup being monitored. Created: 2019-04-23 Updated: 2019-07-04 Resolved: 2019-07-03 |
|
| Status: | Closed |
| Project: | MariaDB MaxScale |
| Component/s: | xpandmon |
| Affects Version/s: | None |
| Fix Version/s: | 2.4.1 |
| Type: | Bug | Priority: | Major |
| Reporter: | Rahul Joshi (Inactive) | Assignee: | Johan Wikman |
| Resolution: | Fixed | Votes: | 0 |
| Labels: | None | ||
| Environment: |
MaxScale server karma172: |
||
| Sprint: | MXS-SPRINT-81, MXS-SPRINT-85 |
| Description |
|
I’m seeing a Fatal on MaxScale:
I had a working setup with MaxScale and Clustrix and then I reimaged the OS (from RHEL 7 to CentOS 7) on all clustrix nodes to take it to 9.1.4 from Fred release, reinstalled Clustrix and created the clustrix group again. maxscale user is not yet created in Clustrix. Config file:
|
| Comments |
| Comment by markus makela [ 2019-04-23 ] | ||||||||||||||||
|
Can you provide the full stacktrace (all lines logged at alert level)? | ||||||||||||||||
| Comment by Rahul Joshi (Inactive) [ 2019-04-23 ] | ||||||||||||||||
|
This is what I could find out:
| ||||||||||||||||
| Comment by Johan Wikman [ 2019-04-30 ] | ||||||||||||||||
|
rahul.joshi@mariadb.com Was MaxScale built in Debug or Release mode? As the crash happens at server.cc:705 it means that the pointer to the object at that point must have pointer at NULL or garbage. The object calling that method at that point is assumed to be created in a context where the pointer to that server object is valid. If that is not the case, then that would have caused an assertion to have been triggered. But that assertion is active only in debug mode. | ||||||||||||||||
| Comment by Rahul Joshi (Inactive) [ 2019-04-30 ] | ||||||||||||||||
|
Hi johan.wikman , I built MaxScale from source (develop branch) and did not specify any debug flags. I used: Thanks and Regards, | ||||||||||||||||
| Comment by Johan Wikman [ 2019-05-03 ] | ||||||||||||||||
|
Ok, then that means that the assertion would not have been triggered even if the condition was fulfilled. | ||||||||||||||||
| Comment by Johan Wikman [ 2019-05-06 ] | ||||||||||||||||
|
This could not be repeated, but the only | ||||||||||||||||
| Comment by Rahul Joshi (Inactive) [ 2019-06-12 ] | ||||||||||||||||
|
Okay, it happened again on the build that has the fix : This time, I was reimaging the CentOS 7 Glutton setup with 9 nodes. I don't see and option to reopen it. Can someone please reopen this bug for me? | ||||||||||||||||
| Comment by Johan Wikman [ 2019-06-13 ] | ||||||||||||||||
|
rahul.joshi@mariadb.com Would you have a core file? | ||||||||||||||||
| Comment by Johan Wikman [ 2019-06-13 ] | ||||||||||||||||
|
rahul.joshi@mariadb.com Could you explain in detail, step by step, what you are doing, because I'm not 100% sure I have fully understood. From the starting setup until the crash. | ||||||||||||||||
| Comment by Johan Wikman [ 2019-06-13 ] | ||||||||||||||||
|
rahul.joshi@mariadb.com And would you have the MaxScale log? | ||||||||||||||||
| Comment by Rahul Joshi (Inactive) [ 2019-06-18 ] | ||||||||||||||||
|
Hi johan.wikman , This does not seem to be reproducible at will. MaxScale log under: Thanks and Regards, | ||||||||||||||||
| Comment by Rahul Joshi (Inactive) [ 2019-07-02 ] | ||||||||||||||||
|
Finally it happened again after multiple setups were reimaged over the time. Similar configuration as last time except no SSL was used. [root@karma172 log]# date; grep -i "2019-07-02.*alert" maxscale.log | ||||||||||||||||
| Comment by Johan Wikman [ 2019-07-03 ] | ||||||||||||||||
|
rahul.joshi@mariadb.com Thanks! I'm working on this now and I think I know what is happening, although I'm not 100% sure yet. |