[MDEV-31532] Galera cluster blocks concurrent commits and leads to server hang Created: 2023-06-23 Updated: 2023-11-06 Resolved: 2023-11-06 |
|
| Status: | Closed |
| Project: | MariaDB Server |
| Component/s: | Galera |
| Affects Version/s: | 10.6 |
| Fix Version/s: | N/A |
| Type: | Bug | Priority: | Critical |
| Reporter: | Ramesh Sivaraman | Assignee: | Denis Protivensky |
| Resolution: | Incomplete | Votes: | 1 |
| Labels: | None | ||
| Attachments: |
|
| Description |
|
Testcase
|
| Comments |
| Comment by Seppo Jaakola [ 2023-06-26 ] | |||||||||||||||||||||||
|
the stack trace shows that there is one connection executing some DDL statement (actual SQL not shown), and this connection is probable reason for the hang. ramesh do you know what this DDL is about? mysqlslap is not supposed to cause this | |||||||||||||||||||||||
| Comment by Ramesh Sivaraman [ 2023-06-26 ] | |||||||||||||||||||||||
|
seppo Did not run any DDL statements apart from test table creation to reproduce the issue. I am not sure what DDL is being executed in the background, if any. | |||||||||||||||||||||||
| Comment by Seppo Jaakola [ 2023-06-26 ] | |||||||||||||||||||||||
|
Thread 3, in the stack trace has following call stack:
Too bad that SQL statement is not visible | |||||||||||||||||||||||
| Comment by Ramesh Sivaraman [ 2023-06-26 ] | |||||||||||||||||||||||
|
seppo Attached full bt full_bt_new.log | |||||||||||||||||||||||
| Comment by Seppo Jaakola [ 2023-06-27 ] | |||||||||||||||||||||||
|
ok, no DDL in the latest stack traces, and looks that there is genuine binlog group commit deadlock. There is a potentially related old issue: | |||||||||||||||||||||||
| Comment by Ramesh Sivaraman [ 2023-06-27 ] | |||||||||||||||||||||||
|
Server version is 10.6.14-MariaDB-log. The issue is happening only when we set repl.commit_order=1. Did not see this issue with the default repl.commit_order value. | |||||||||||||||||||||||
| Comment by Denis Protivensky [ 2023-11-06 ] | |||||||||||||||||||||||
|
The issue won't be fixed as users should be discouraged from changing `repl.commit_order` variable. This should be documented explicitly. |