masakari

large scale hosts failure

Registered by suzhengwei on 2020-06-01

According to the workload, some instances may have a low value, while other instances have a high value to the operator or users.

When large scale hosts failed in the same time, the reouses is decreasing. What's worse, there is not enough resources to recovery all the instances.

We should make sure the instances recovery in order, while instances with high value should recovery at first. We also need to bring some mechanism, in case that the cloud platform chaotically recovery and lost stability.

Read the full specification

Blueprint information

Status:: Not started

Approver:: None

Priority:: Medium

Drafter:: suzhengwei

Direction:: Needs approval

Assignee:: suzhengwei

Definition:: Review

Series goal:: Accepted for yoga

Implementation:: Unknown

Milestone target:: None

Related branches

Related bugs

Sprints

Whiteboard

Spec
https://review.opendev.org/732477

(?)

Work Items

Dependency tree

* Blueprints in grey have been implemented.

This blueprint contains Public information

Everyone can see this information.

Subscribers

No subscribers.