[MDEV-29861] Galera "notify" test cases hang Created: 2022-10-24 Updated: 2023-12-12 Resolved: 2023-09-12 |
|
| Status: | Closed |
| Project: | MariaDB Server |
| Component/s: | Galera, Tests |
| Affects Version/s: | 10.4, 10.5, 10.6, 10.7, 10.8, 10.9, 10.10, 10.11 |
| Fix Version/s: | 10.4.32, 10.5.23, 10.6.16, 10.10.7, 10.11.6, 11.0.4, 11.1.3 |
| Type: | Bug | Priority: | Major |
| Reporter: | Jan Lindström (Inactive) | Assignee: | Julius Goryavsky |
| Resolution: | Fixed | Votes: | 0 |
| Labels: | None | ||
| Issue Links: |
|
||||||||
| Description |
|
| Comments |
| Comment by Sergei Golubchik [ 2022-12-24 ] | ||||||||||||||||||||||||||||||||||||||||
|
I find it the easiest to reproduce locally with rr:
this hits it every time for me, and without --rr the test mostly passes. With the following change:
one can easily see why it hangs.
Here:
And this is from the timed out test:
As you see, server status has changed to "joined" before Wsrep_server_state::instance().initialized() returned, so the script is started with a new status, on "joined" it tries to connect to the server to update some table, but the server isn't initialized yet, it's not listening for connections. So the server waits for the script to finish, script waits for mariadb client to connect, and the client cannot connect, because the server isn't listening. | ||||||||||||||||||||||||||||||||||||||||
| Comment by Jan Lindström [ 2023-08-16 ] | ||||||||||||||||||||||||||||||||||||||||
|
julien.fritsch I do not think this is critical bug as it effects at the moment only tests, and requires wsrep_notify_cmd configuration variable to be used, by default it is not defined. Regardless, I will work on this issue and propose a fix. | ||||||||||||||||||||||||||||||||||||||||
| Comment by Jan Lindström [ 2023-08-25 ] | ||||||||||||||||||||||||||||||||||||||||
| Comment by Julius Goryavsky [ 2023-09-12 ] | ||||||||||||||||||||||||||||||||||||||||
|
Fixed, https://github.com/MariaDB/server/commit/632a503ce7726547d8d858ae89fc3833ff95fe83 |