[MDEV-31818] Server crashes in choose_best_splitting Created: 2023-08-01 Updated: 2024-01-30 |
|
| Status: | Open |
| Project: | MariaDB Server |
| Component/s: | Server |
| Affects Version/s: | 10.11.2, 10.11.4 |
| Fix Version/s: | 10.11 |
| Type: | Bug | Priority: | Major |
| Reporter: | Christian Braeuner | Assignee: | Sergei Petrunia |
| Resolution: | Unresolved | Votes: | 2 |
| Labels: | crash | ||
| Environment: |
ProLiant DL360 Gen10, 48 cores, 128GB memory, Centos 8 |
||
| Attachments: |
|
||||||||||||||||
| Issue Links: |
|
||||||||||||||||
| Description |
|
We have a primary with 2 replicas and have seen a crash where both replicas crash almost simultaneously at the same query. At other times during the day the same query has had no problems and unfortunately we have not managed to replicate this either. |
| Comments |
| Comment by Christian Braeuner [ 2023-08-03 ] |
|
To clarify our setup: one replica is MariaDB 10.11.2 the other 10.11.4. The replicas are used exclusively for reading. The master is not used for queries and is only importing data. |
| Comment by Christian Braeuner [ 2023-11-08 ] |
|
We are still experiencing the crashes at random intervals, on average 3-4 times a month. |
| Comment by Richard DEMONGEOT [ 2023-11-09 ] |
|
Hello cbefin; Could you read the https://jira.mariadb.org/browse/MDEV-32064 issue? i think it's very similar. If yes, a patch was written, but not delivered yet. Should be on the next release. Regards; |
| Comment by Christian Braeuner [ 2023-11-13 ] |
|
Hi Richard, |
| Comment by Christian Braeuner [ 2023-11-20 ] |
|
We have changed our configuration last week to also use the primary node for requests in an attempt eliminate the replication as one of the factors. Today we had a simulataneous crash of the primary and one replica, which tells us that replication is not causing the instability. The crash was again happening during a bulk import of an unrelated table in a separate schema. |
| Comment by Alice Sherepa [ 2023-12-15 ] |
|
Is it possible for you to upgrade to the recent MariaDB version? It might be the same as MDEV-31440 and with the test case, that was provided there, the crash does not happen anymore. |
| Comment by Christian Braeuner [ 2024-01-30 ] |
|
Hi, we have changed the query in the meantime to no longer use a subselect with distinct. Since then we had no crashes. As this crash was only ever observed in our productive environment we do not want to put the dangerous query back in order to test later versions of MariaDB. |