Details
-
Task
-
Status: Closed (View Workflow)
-
Major
-
Resolution: Fixed
-
None
Description
Add monitoring to Grafana/Zabbix that shows failing build tasks due to exceptions/builder not ready errors.
Probably the right field is already in Prometheus, so this would be the easy approach.
Another possibility is something along these lines, as Daniel Black suggested:
* for each masterid in https://buildbot.mariadb.org/api/v2/masters?active=true
|
* get the workerid (and name from https://buildbot.mariadb.org/api/v2/masters/{masterid}/workers?paused=false)
|
* then poll each worker https://buildbot.mariadb.org/api/v2/builds?limit=1&order=-started_at&workerid={workerid}&complete=1
|
* if the .builds[0].results ==4 then .builds[0].state_string is the error (combined with the worker name).
|