infra - Set up monitoring for key infrastructure servers
Registered by
Steve Varnau
Jenkins does a fair job of monitoring slave test machines for disk space, and takes them offline if <1GB is available.
We need additional monitoring to alert us when there are problems with other nodes:
- static
- wiki
- review
- jenkins01
- puppet
- dashboard
- mail
Perhaps this can start with simple puppet checks, notifications.
Perhaps a Nagios/Cacti server is called for, though that is a bigger undertaking.
Blueprint information
- Status:
- Not started
- Approver:
- None
- Priority:
- Low
- Drafter:
- Steve Varnau
- Direction:
- Needs approval
- Assignee:
- None
- Definition:
- Approved
- Series goal:
- None
- Implementation:
- Unknown
- Milestone target:
- None
- Started by
- Completed by
Related branches
Related bugs
Sprints
Whiteboard
(?)