Worker alarms per node

Registered by Swann Croiset

Right now, the LMA toolchain monitors the availability of the Nova/Cinder/Neutron backend services but when one is down, the alarm doesn't tell which node is guilty. We need to fix this.

Blueprint information

Status:
Complete
Approver:
None
Priority:
High
Drafter:
Swann Croiset
Direction:
Approved
Assignee:
Swann Croiset
Definition:
Review
Series goal:
Accepted for 0.9
Implementation:
Implemented
Milestone target:
milestone icon 0.9.0
Started by
Swann Croiset
Completed by
Simon Pasquier

Related branches

Sprints

Whiteboard

Gerrit topic: https://review.openstack.org/#q,topic:bp/worker-alarms-per-node,n,z

Addressed by: https://review.openstack.org/261307
    Retrieve worker state metrics per hostname

Addressed by: https://review.openstack.org/272169
    Fix the metric of the number of Neutron agent UP

Addressed by: https://review.openstack.org/273090
    [WIP] Add hostname details for faulty workers

Addressed by: https://review.openstack.org/273632
    Add Nova service state details by hostname

Addressed by: https://review.openstack.org/273944
    Add Cinder service state details by hostname

Addressed by: https://review.openstack.org/273966
    Add Neutron agent state details by hostname

(?)

Work Items

This blueprint contains Public information 
Everyone can see this information.

Subscribers

No subscribers.