[MDBF-414] bb-rhel8-docker create routine to clean podman system Created: 2022-05-13 Updated: 2022-09-19 Resolved: 2022-09-19 |
|
| Status: | Closed |
| Project: | MariaDB Foundation Development |
| Component/s: | Buildbot |
| Affects Version/s: | None |
| Fix Version/s: | N/A |
| Type: | Task | Priority: | Minor |
| Reporter: | Faustin Lammler | Assignee: | Faustin Lammler |
| Resolution: | Fixed | Votes: | 0 |
| Labels: | None | ||
| Remaining Estimate: | 0d | ||
| Time Spent: | Not Specified | ||
| Original Estimate: | Not Specified | ||
| Description |
|
Find an automated way of cleaning podman system on this machine before /data is full. |
| Comments |
| Comment by Faustin Lammler [ 2022-05-13 ] | ||||||||||||
|
/data was almost full, danblack maybe you have an idea... | ||||||||||||
| Comment by Daniel Black [ 2022-05-13 ] | ||||||||||||
|
An error condition on last docker library test.
Frequent $arch-ubuntu-$lts-deb-autobake failures due to incorrect dependencies not installed onto builders (fix) aren't a cause as pmem (10.7+) doesn't use ubuntu-18.04. As the delta of builds across arches has reduced maybe the week gap can be reduced. But some more measurement is needed (ppc64le focal 2 days). Builds terminated by the hard SIGKILL of buildbot shutdown aren't cleaned up. | ||||||||||||
| Comment by Daniel Black [ 2022-05-16 ] | ||||||||||||
|
Receiving a trigger from a $arch-ubuntu-$lts-deb-autobake builder with haltOnFailure=False set, with the failure trigger, could enable the running containers and the containers from the other architectures to be cleared. | ||||||||||||
| Comment by Daniel Black [ 2022-05-16 ] | ||||||||||||
|
Noting implementation. Note build on L87 builds without a tag, runs a test (L118) (most of the bb time), before adding it to a manifest on L188. A cron pruning of untagged images is dangerous in removing good images that have their test being run. What is needed in script is:
| ||||||||||||
| Comment by Faustin Lammler [ 2022-05-16 ] | ||||||||||||
|
Ok, the jira task was updated and I did not refresh the page... | ||||||||||||
| Comment by Faustin Lammler [ 2022-05-16 ] | ||||||||||||
|
> What is needed in script is: Sounds good, how do you want to proceed? | ||||||||||||
| Comment by Daniel Black [ 2022-05-16 ] | ||||||||||||
|
Disable the cron pruning. Its breaking thinks like:
Then let's see the kinds of leftovers we get. Like the test script trap "killoff" EXIT is needed in the bb script to be called on the failure condition to clean the environment depending on the image build, image text, manifest push stage. If you want to try writing a PR for this please do so. | ||||||||||||
| Comment by Faustin Lammler [ 2022-05-16 ] | ||||||||||||
|
> Disable the cron pruning I did not activate any pruning routine so far. > If you want to try writing a PR for this please do so. Ok will try that in the next days. | ||||||||||||
| Comment by Faustin Lammler [ 2022-05-18 ] | ||||||||||||
|
First attempt in https://github.com/MariaDB/mariadb.org-tools/pull/156, need testing and polishing. | ||||||||||||
| Comment by Faustin Lammler [ 2022-06-15 ] | ||||||||||||
|
The routine is probably missing something or needs improvements as /data is almost full:
If you danblack want to take a look before I do some manual cleaning, it would be good. | ||||||||||||
| Comment by Daniel Black [ 2022-06-15 ] | ||||||||||||
|
The answer is obvious enough from bb status For each branch there should be 3 (< 10.5) or 4 (>= 10.5) builds on the amd64-rhel8-docker. There are so many failures its filling up quicker than the purging can handle. cleanups only happen after a successful push, so 2 days ago. 7 day backlog of amd64-2004-deb-autobake isn't helping. 10.3, and soon to be all others failing because of https://github.com/MariaDB/buildbot/blob/main/scripts/docker-library-build-and-test.sh#L90 and https://github.com/MariaDB/mariadb-docker/blob/next/update.sh#L31 cause https://github.com/MariaDB/server/pull/2141 / Also https://github.com/MariaDB/mariadb.org-tools/pull/81 for cleaning when there is a failure in the dependency. | ||||||||||||
| Comment by Faustin Lammler [ 2022-08-09 ] | ||||||||||||
|
Disk was increased by 50GB. | ||||||||||||
| Comment by Faustin Lammler [ 2022-09-19 ] | ||||||||||||
|
Problem fixed with :
The machine was also moved to a stronger (and more reliable builder). |