HA improvements of pacemaker and corosync

Registered by Vladimir Kuklin on 2014-05-28

Current problems with corosync/pacemaker
* Creating and committing shadows not only adds constant pain with dependencies and unneeded complexity but also rewrites cluster attributes and even other changes if you mess up with ordering and it’s really hard to debug it.
* Restarting resources by Puppet’s pacemaker service provider restarts them even if they are running on other nodes and it sometimes impacts the cluster.
* An attempt to stop or restart corosync service brings down a lot of resources and probably will fail and bring down the entire deployment.
* Controllers other the the first download configured cib an immediate start all cloned resources before they are configured so they have to be cleaned up later.

https://docs.google.com/document/d/1emjSKmfBnv_2XW5XY3wx4Wik-l-SXhRm9NXb6HEmpHY/edit?usp=sharing

Manage/unmanage in corosync init scripts
https://bugs.launchpad.net/fuel/+bug/1340172

Blueprint information

Status:
Complete
Approver:
Sergey Vasilenko
Priority:
Essential
Drafter:
Dmitry Ilyin
Direction:
Approved
Assignee:
Dmitry Ilyin
Definition:
Approved
Series goal:
Accepted for 5.1.x
Implementation:
Implemented
Milestone target:
milestone icon 5.1
Started by
Vladimir Kuklin on 2014-07-02
Completed by
Vladimir Kuklin on 2014-07-28

Whiteboard

First, we should discuss proposed improvements and choose which of them we would like to see in our product.

Gerrit topic: https://review.openstack.org/#q,topic:bp/ha-pacemaker-improvements,n,z

Addressed by: https://review.openstack.org/101242
    remove unnided CIB definition for some OCF resources (part #1)

Gerrit topic: https://review.openstack.org/#q,topic:95764,n,z

Addressed by: https://review.openstack.org/102773
    remove unnided CIB definition for some OCF resources (part#2)

Addressed by: https://review.openstack.org/103642
    Corosync service improvements

Addressed by: https://review.openstack.org/106110
    Make simple resouurces to be unbaned everywhere

Gerrit topic: https://review.openstack.org/#q,topic:bug/1340260,n,z

(?)

Work Items

Work items:
Discuss what changes are we going to implement: INPROGRESS

Dependency tree

* Blueprints in grey have been implemented.

This blueprint contains Public information 
Everyone can see this information.