[MDEV-15409] make sure every sst script is tested in buildbot Created: 2018-02-24 Updated: 2018-06-11 Resolved: 2018-03-23 |
|
| Status: | Closed |
| Project: | MariaDB Server |
| Component/s: | Galera SST |
| Fix Version/s: | 10.1.32, 10.2.14 |
| Type: | Task | Priority: | Blocker |
| Reporter: | Sergei Golubchik | Assignee: | Sergei Golubchik |
| Resolution: | Fixed | Votes: | 2 |
| Labels: | None | ||
| Issue Links: |
|
||||||||||||||||||||||||||||
| Sprint: | 10.2.14 | ||||||||||||||||||||||||||||
| Description |
|
make sure every sst script is tested in buildbot |
| Comments |
| Comment by Elena Stepanova [ 2018-03-12 ] |
|
I've set up non-MTR tests in buildbot, for now on deb packages with 10.1, 10.2, 10.3 main branches. The tests are run amongst the installation/upgrade bunch. They use the resulting VM image from install test (which installs MariaDB server and whatever dependencies it pulls).
The test is run 4 times, once for each of mariabackup, xtrabackup-v2, mysqldump, rsync. Tests are shown in buildbot as galera-mariabackup, galera-rsync etc. Error logs from all nodes and syslog are stored. Notes:
Tests seem to behave more or less as expected at the first glance, but probably some intermittent failures will happen and will need to be fixed. Examples: |
| Comment by Daniel Black [ 2018-03-12 ] |
|
elenst thanks for setting up all these tests. xtrabackup-v2 and mariadbbackup sst failures with mysqld_multi: As xtrabackup and mariadbbackup can take --datadir as an argument and the datadir is passed to the sst scripts there shouldn't be a reason these scripts are relying on a configuration file settings. wlad commented here https://github.com/MariaDB/server/pull/554#issuecomment-359403975 that settings come directly from the server. In theory it should also work with --defaults-group-suffix= too. on 10.3 xtrabackup-v2 failure: Its hitting the error: however there's a lot more formats in mariadb I was working on some docker related tests. Can I get a link to the buildbot test source please? |
| Comment by Elena Stepanova [ 2018-03-12 ] |
The problem is not with xtrabackup and mariabackup, it's with SST methods xtrabackup-v2 and mariabackup. At some point (at the last move-back step in particular) they don't pass datadir over to innobackupex, it attempts to find it in default config files, and things get messed up.
There isn't much of a source, the text above basically describes it all, but anyway, everything is in buildbot's maria-master.cfg, def getDebGaleraStep (it will be in https://github.com/MariaDB/mariadb.org-tools/blob/master/buildbot/maria-master.cfg after it auto-commits next night). Please don't modify it directly even if it looks ugly to you. Please also note that addition of these tests doesn't rule out the need of MTR tests. While of course ultimately it's MariaDB's responsibility to test the final code, somehow the rule that all contributions must come with their own tests has been largely forgotten, there have been many patches to SST scripts without any tests whatsoever. It would be beneficial for the quality if they were tested more thoroughly before submission and followed guidelines upon submission, even if it would cause some decrease in the amount of patches. |
| Comment by Daniel Black [ 2018-03-13 ] |
|
Thanks for the details elenst. I've no intention of modifying the buildbot config, just gaining ideas for testing. I apologize for my broken galera SST changes. Quick remedies especially after releases aren't good enough. They weren't tested properly, I'm really sorry, I lost patience with the state of SST MTR tests and hoped component based testing was sufficient. It wasn't. I'll try to fix some of the MTR tests for galera and I promise to do better next time. So sorry. |
| Comment by Sergei Golubchik [ 2018-03-15 ] |
|
| Comment by Sachin Setiya (Inactive) [ 2018-03-19 ] |
|
It is failing because It gives 1205 error on each call of "show status" in wait_untill_connected_again.inc |
| Comment by Aurélien LEQUOY [ 2018-03-20 ] |
|
For me the problem is on client |
| Comment by Aurélien LEQUOY [ 2018-03-20 ] |
|
the problem is not mariadb-Server, not on galera but on stupid client : libmariadbclient18 10.2.13+maria~stretch You need to add a full test to test galera cluster WITH SST, and don't push new version without try SST. I lost so many time with this, i remember on 10.0.x i spend one month to find tricky problem on xtrabackup (this time) and solve 95% of bug open on Percona. |