Implement container monitoring solution for bay

Registered by hongbin

Need a container monitoring solution that monitors containers running in the bay and general usage of the bay COE. For example, we could explore adding support for cAdvisor/heapster for this purpose

Blueprint information

Status:
Complete
Approver:
hongbin
Priority:
Undefined
Drafter:
hongbin
Direction:
Approved
Assignee:
Cristovao Cordeiro
Definition:
New
Series goal:
None
Implementation:
Implemented
Milestone target:
None
Started by
Spyros Trigazis
Completed by
Spyros Trigazis

Related branches

Sprints

Whiteboard

--
strigazi @ Sun Mar 25 16:42:01 UTC 2018

For kubernetes we have the kubernetes-dashboard with heapster and a prometheus, cadvisor, node-exporter, grafana stack.
--
Gerrit topic: https://review.openstack.org/#q,topic:bp/container-monitoring,n,z

Addressed by: https://review.openstack.org/407581
    container monitoring in Swarm with Docker v1.10

Gerrit topic: https://review.openstack.org/#q,topic:bp/https,n,z

Gerrit topic: https://review.openstack.org/#q,topic:bug/1646489,n,z

Usage notes:
    In Swarm with Docker v1.10:
     - monitoring is disabled by default. Enable it by setting the label "monitoring_enabled" to True
     - Grafana comes with a default Docker dashboard and default "admin" password. Use "grafana_admin_passwd" label to overwrite it
     - cadvisor and node-exporter will run on all nodes
     - a Prometheus, Fedora and Grafana container stack are then deployed through the Docker API, all in the same node
     - in v1.10 there are no services, killing that node will destroy the stack. When provisioning a new node the stack should be re-created
     - the final Grafana dashboard will be at http://$NODE_running_stack:3000

Addressed by: https://review.openstack.org/426291
    [WIP][k8s] Monitoring with Prometheus and Grafana

Addressed by: https://review.openstack.org/548731
    [WIP-ignore] Add prometheus monitoring stack to swarm

(?)

Work Items

Work items:
- container monitoring in Swarm with Docker v1.10 : INPROGRESS
- add cadvisor/prometheus/grafana based monitoring to Kubernetes : INPROGRESS

This blueprint contains Public information 
Everyone can see this information.