[MDEV-32896] Unstable XA + binglog tests, with possible MDEV-32830 caused issues Created: 2023-11-28 Updated: 2024-02-03 |
|
| Status: | Stalled |
| Project: | MariaDB Server |
| Component/s: | Binary Protocol, Tests, XA |
| Affects Version/s: | N/A |
| Fix Version/s: | N/A |
| Type: | Bug | Priority: | Critical |
| Reporter: | Roel Van de Paar | Assignee: | Roel Van de Paar |
| Resolution: | Unresolved | Votes: | 0 |
| Labels: | affects-tests | ||
| Issue Links: |
|
||||||||||||
| Description |
|
The following tests: binlog_xa_recover, binlog_xa_prepared_disconnect, and binlog_empty_xa_prepared have proven to be unstable. The issue is that they fail differently, and additionally fail in various different ways, on base 10.6 and the MDEV-32830 patch trees. As such, MTR stress testing of XA + binlog on the MDEV-32830 patch is not possible, and it is possible that MDEV-32830 is causing different/additional issues. Given this, these tests will need to be fixed and stabilized before signoff on the MDEV-32830 patch can happen. The failures occur even when run in single-thread instances (verified), but various issues can be made to shown quickly (< ~1 minute) using:
|
| Comments |
| Comment by Roel Van de Paar [ 2023-11-28 ] |
|
Regrettably I am also seeing, more sporadically, issues with binlog_xa_checkpoint, binlog_xa_handling and xa_binlog, though the latter thus far on base only. binlog_xa_checkpoint and binlog_xa_handling will thus need to be checked also. For binlog_xa_handling issues have been seen only on the patch tree thus far. Additionally, what may be of interest, xa_binlog is considerably faster (17 seconds versus 70 seconds for 1085 tests) on the patch tree than on base. For this testcase, there seems to be a clear parallelism at work in the patch tree unlike base. |
| Comment by Andrei Elkin [ 2023-11-28 ] |
|
roel, I can't confirm by running them locally the way you did. On both bb-10.6-MDEV-31949 and the vanilla 10.6. Let me ask you to paste 10.6 and bb-10.6-MDEV-31949 traces in two separate comments so that I'd try to match, or explain any difference? |
| Comment by Andrei Elkin [ 2023-11-28 ] |
|
> I am also seeing, more sporadically, issues with binlog_xa_checkpoint Please always paste them - even for your own records - as apparently mtr invocation references may not suffice for one with different env. |
| Comment by Roel Van de Paar [ 2023-11-29 ] |
|
> I can't confirm by running them locally the way you did. On both bb-10.6-MDEV-31949 and the vanilla 10.6. > In which branch? > The test has been altered in 31949 in 9de57a483e7. Previously it must've been non-deterministic. > Please always paste them - even for your own records - as apparently mtr invocation references may not suffice for one with different env. |
| Comment by Roel Van de Paar [ 2024-01-19 ] |
|
This is waiting for MDEV-32830 ftm, so I have reversed the blocker direction. Retesting required once MDEV-32830 and MDEV-31949 are ready for testing. |