[MDEV-12906] binlog_encryption.encrypted_master failed in buildbot - Failed to sync Created: 2017-05-24 Updated: 2021-06-04 Resolved: 2021-06-04 |
|
| Status: | Closed |
| Project: | MariaDB Server |
| Component/s: | Replication, Tests |
| Affects Version/s: | 10.1, 10.2, 10.3, 10.4, 10.5 |
| Fix Version/s: | 10.1.48, 10.2.38, 10.3.29, 10.4.19, 10.5.10, 10.6.1 |
| Type: | Bug | Priority: | Major |
| Reporter: | Elena Stepanova | Assignee: | Brandon Nesterenko |
| Resolution: | Not a Bug | Votes: | 0 |
| Labels: | None | ||
| Description |
|
|
| Comments |
| Comment by Marko Mäkelä [ 2020-12-09 ] | |||||||||||||||||||||||
|
I observed recent occurrences of this in a 10.5-based branch: https://buildbot.mariadb.org/#/builders/46/builds/1887/steps/8/logs/stdio
It occurs with both stmt and mix, as well as with both ctr and cbc combinations. The builder name is debug-ubuntu-1809, and it is running on Docker. | |||||||||||||||||||||||
| Comment by Marko Mäkelä [ 2020-12-09 ] | |||||||||||||||||||||||
|
Buildbot cross-reference is also showing SIGSEGV in this test for various versions. Here is an example from a main branch:
| |||||||||||||||||||||||
| Comment by Andrei Elkin [ 2020-12-09 ] | |||||||||||||||||||||||
|
There seems to be multiple unrelated failures on the test. E.g X-ref witnesses (10.5 2020-11-03 19:53:29)
| |||||||||||||||||||||||
| Comment by Brandon Nesterenko [ 2021-06-01 ] | |||||||||||||||||||||||
|
It looks like the buildbot failures from this test (along with other tests, e.g., encrypted_slave and encrypted_master_switch_to_unencrypted) are caused by the same issue of running out of disk space. Overall, this test needs about 180M of disk space to complete (tested on my laptop). The output resulting from the failure depends on when the VM runs out of disk space and potentially which thread catches the SIGABRT. The list following this paragraph describes the points that the disk can run out of space along with its corresponding failure output. Note, that some of the buildbot failures directly indicate that there is no more room left on the device, and these failures are not listed. To ensure the buildbot VMs have enough space to finish the test, is there a buildbot resource allocator/scheduler which can be adjusted? ==== Master Errors 1) 2013: Lost connection to MySQL server during query. 2) Timeout after 900s. More specifically, a GDB backtrace shows that the server can hang on the call to (in 10.5) wait_for_free_space (filename=0x7fcd980266f8 "<BUILD_DIR>/mysql-test/var/tmp/mysqld.1/#sql-temptable-16717-6-1a.MAD", errors=7) at <SRC_ROOT>/mysys/errors.C:126 ==== Slave Errors 1) 2) 3) (If the SQL thread handles the SIGABRT) 4) ‘2013: Lost connection to MySQL server during query’ with ABRT (signal 6) in a backtrace (not SQL thread). I was not (yet) able to reproduce the SIGSEGV (signal 11) from within rpl_end.inc. | |||||||||||||||||||||||
| Comment by Brandon Nesterenko [ 2021-06-04 ] | |||||||||||||||||||||||
|
The failures were due to a lack of space on the ramdisk of buildbot machines. TODO-2958 was created to increase the overall memory allocated to each machine, and thereby the ramdisk as well, as they are partitioned based on a percentage of the overall memory designated to the machine. |