Compute manager restart during live-migration

Registered by Loganathan Parthipan on 2014-02-07

Currently when a compute manager restarts during a live migration the task state is stuck at 'migrating' and we have effectively no way forward or rollback and reset this task_state.

Here in this blueprint we propose that we use the primitives provided by the blueprint https://blueprints.launchpad.net/nova/+spec/live-migrate-instance-actions and implement into the framework provided by https://blueprints.launchpad.net/nova/+spec/recover-stuck-state.

In compute manager init, if a task_state is 'migrating', we check if it's still in live-migrating and if so, cancel the hypervisor job and rollback and set task_state to None. If it's in post-live-migration we should either try to complete it or for now just set the vm_state to ERROR.

Blueprint information

Status:
Not started
Approver:
None
Priority:
Undefined
Drafter:
Loganathan Parthipan
Direction:
Needs approval
Assignee:
Pawel Koniszewski
Definition:
Drafting
Series goal:
None
Implementation:
Unknown
Milestone target:
None

Related branches

Sprints

Whiteboard

Removed from next, as next is now reserved for near misses from the last milestone --johnthetubaguyRemoved from next, as next is now reserved for near misses from the last milestone --johnthetubaguy

If you are still working on this, please re-submit via nova-specs. If not, please mark as obsolete, and add a quick comment to describe why. --johnthetubaguy (20th April 2014)

(?)

Work Items

Dependency tree

* Blueprints in grey have been implemented.

This blueprint contains Public information 
Everyone can see this information.