large scale hosts failure

Registered by suzhengwei on 2020-06-01

According to the workload, some instances may have a low value, while other instances have a high value to the operator or users.

When large scale hosts failed in the same time, the reouses is decreasing. What's worse, there is not enough resources to recovery all the instances.

We should make sure the instances recovery in order, while instances with high value should recovery at first. We also need to bring some mechanism, in case that the cloud platform chaotically recovery and lost stability.

Blueprint information

Status:
Not started
Approver:
None
Priority:
Undefined
Drafter:
suzhengwei
Direction:
Needs approval
Assignee:
suzhengwei
Definition:
New
Series goal:
None
Implementation:
Unknown
Milestone target:
None

Related branches

Sprints

Whiteboard

Addressed by: https://review.opendev.org/732477
    promotion for large scale hosts failure

(?)

Work Items

This blueprint contains Public information 
Everyone can see this information.

Subscribers

No subscribers.