OpenStack Compute (nova)

NaaS core design proposal

Registered by uri elzur on 2011-04-22

NaaS core design proposal including interaction w Compute Services and basic feature set

Read the full specification

Blueprint information

Status:: Complete

Approver:: None

Priority:: Undefined

Drafter:: uri elzur

Direction:: Needs approval

Assignee:: None

Definition:: Obsolete

Series goal:: None

Implementation:: Unknown

Milestone target:: None

Completed by: Vish Ishaya on 2011-05-17

Related branches

Related bugs

Sprints

ods-d

Whiteboard

This was moved into its own project. Network projects can ve found at https://launchpad.net/netstack --vish
This blueprint is being redrafted. Uri Elzur will be happy to hear from you if you wish to contribute. Nothing here is committed.
________________________________________
• Launchpad Entry: network-service
• Created: 20 April 2011
• Contributors: Ewan Mellor,Salvatore Orlando,Youcef Laribi and Uri Elzur
Summary
The core functionality of Openstack Network-as-a-service (NaaS) is to provide a customer-facing service for creating and managing networks intended as "collection of virtual ports with shared connectivity".
Rationale
A network created with the core NaaS API can be regarded as a virtual network switch and related network devices attached to it, which potentially spans over all the compute nodes in the cloud. NaaS APIs should be decoupled by the actual implementation of the core service, which should be provided by a plugin implementing the core Naas API. This implies that NaaS does not mandate any specific model for created networks (e.g.: VLANs, IP tunnels). However, NaaS API may present some requirements that a plugin should support.
The core NaaS service can also be regarded as a container for higher level services, for instance DHCP and NAT. Higher level services will come their own API and implementation and they are discussed in detail in the Naas-Higher-Layer blueprint.
Goals
Goal 1: Allow customers and CSPs to create networks. Networks can either be private, i.e.: available only to a specific customer, or shared. Networks shared only among a specific group of customers can also be considered.
Goal 2: Allow customers and CSPs to manage virtual ports for their networks, and attach instances or other network appliances (physical or virtual) available in the cloud to them.
Goal 3: Allow customers and CSPs to extend their networks from the cloud to a remote site, by attaching a bridging device (L2? L3???) within the cloud to their networks'; the bridging device would then bridge to the appropriate remote site.
Goal 4: Allow customers and CSPs to securely configure network policies for networks, ports, and devices attached to them. These policies can include, for instance, port security polices, access control lists, high availability or QoS policies (which are typically available on physical network switches). Basic set of VIF and virtual Switch configuration options will be supported. Since a 'minimum set' of policies supported by each possible plugin can hardly be identified, network policies should be assumed as plugin-specific and always be configured through API extension mechanisms.
Goal 5: Allow CSPs to register and configure the plugins providing the actual implementation of the core service, as well as register and configure plugins for higher-level services such as DHCP, Firewall or Load Balancing. CSPs shoould be able to select and plug in third-party technologies as appropriate. This may be for extended features, improved performance, or reduced complexity or cost.
Goal 6: Allow CSPs to provide a secure auto-configuring network upon workload migration and select, register and configure plugins accordingly.
Use cases
Use case 1: Create private network and attach instance to it
Related topology: Each tenant has an isolated network. Tenants are free to do whatever they want on their network, because it is completely isolated from those of other tenants. This is the NASA Nebula model as of today, implemented using one VLAN per tenant. Note that although this same model could equally be implemented with different technologies (e.g. GRE tunnels instead of VLANs for isolation), this would not change the nature of the model itself.
1. Customer uses the Core NaaS API to create a Network;
2. On success, Core NaaS API return a unique identifier for the newly created network;
3. Customer uses Core NaaS API for configuring a logical port on the network;
4. Customer invokes Cloud Controller API to run an instance, specifying network and virtual port for it;
5. Cloud Controller API dispatches request to compute service;
6. Compute service checks instance requirements (if provided, e.g. compute resources, Security, QoS) and attempts to place the workload (see Scheduler blueprint). Upon success Compute Service creates VM and VIFs. For each VIF, it asks Naas Core to check instance requirements (if provided, e.g. connectivity, BW, Security, QoS) and upon success NaaS Core plugs it into port and network specified by Customer in (4).
6.7. Customer uses Core NaaS API for configuring the logical port on the network;
Use case 2: Attach instance to default public network
Related topology: Similar to the 'Flat' mode currently supported by nova network. Instances from different customers are all deployed on the same virtual network. In this case, the Core NaaS service can provide port isolation policy in order to ensure VM security.
1. Customer uses Core Naas API to retrieve public networks;
2. On success, Core Naas API returns a list of unique network identifiers; Customer selects a networks from this list;
3. Customer uses Core NaaS API for configuring a logical port on the network;
4. Customer invokes Cloud Controller API to run an instance, specifying network and virtual port for it;
5. Cloud Controller API dispatches request to compute service;
6. Compute service checks instance requirements (if provided, e.g. compute resources, Security, QoS) and attempts to place the workload (see Scheduler blueprint). Upon success Compute Service creates VM and VIFs. For each VIF, it asks Naas Core to check instance requirements (if provided, e.g. connectivity, BW, Security, QoS) and upon success NaaS Core plugs it into port and network specified by Customer in (4).
6.7. Customer uses Core NaaS API for configuring the logical port on the network;
The main difference between this use case and the previous one is that the Customer uses a pre-configured network instead of creating it. Another point that needs to be discussed is whether customers should be allowed to manage ports for public networks. Alternatively, the compute service can implicitly create a port when it contacts the Core NaaS API.
Use case 3: Register bridge for connecting cloud network to other site
Related topology: Customer on-premise data centere extending into the cloud, interconnecting networks in distinct cloud. Although actual implementation can be provided in several ways. we are interested in the abstract model: a single connectivity domain spanning two ore more networks in distinct administrative domains.
1. Customer uses Core NaaS API to register a bridge for its network;
2. On success the Core NaaS API returns the bridge identifier;
3. Customer uses Core NaaS API to provide bridge configuration (e.g.: remote endpoint, port, credentials);
4. Customer uses Core NaaS API to create a virtual port on the network for the bridge device;
5. Customer uses Core NaaS API to plug bridge device into network
Plug-in use cases
In this section, we give examples of the technologies that a service provider may wish to plug in at various points in the infrastructure. Some of these technologies are commercial in nature. Where this is the case, this blueprint will include bindings for an open-source alternative, so that the NaaS API is completely supported by open-source software. It is a goal of this blueprint that a service provider may select different implementation technologies.
Category Example technology
Distributed virtual switches VMware DVS, Cisco Nexus 1000V, NOX OpenFlow Controller
Inter-datacenter tunnels
Requirements
R1. Add a first-class, customer-facing service for management and configuration of network infrastructure via a RESTful API. This service shall be known as openstack-NaaS, and the API that it exposes shall be known as the OpenStack NaaS API.
R2. Modify nova-compute to obtain network details by calling openstack-NaaS through its public API, rather than calling nova-network as it does today.
... lots more coming here don't worry!
Design Ideas
At this stage any attempt to provide a design for Core NaaS would be quite unrealistic. This section should be therefore regarded as a first attempt to define general design guidelines for NaaS.
At a very high-level view Core NaaS and its interactions with other entities (compute service, nova database, plugins) can be summarized as follows:

• NaaS Core API serves requests from customers concerning networks and virtual ports. The request may pertain to connecting VMs and requesting network services (e.g. FW or LB). The former are sent to the Cloud Controller which coordinates the request with the Scheduler (see Scheduler blueprint) for a Workload (i.e VM/s) placement request.
• Compute Service attempts to allocate a Worker (Server) and upon success, the Scheduler informs the Cloud Controller of the Worker selected. The Cloud Controller dispatches a request to the Network Service:NaaS Core along with the identity of the Worker selected for the workload (assume that NaaS Core is part of Network Service)
• The NaaS Core API layer then dispatches the request to the Core plugin implementation connected to the selected worker’s Core plugin Network Agent (which is not part of Core NaaS). The NaaS Core Plugin and the Worker Core Plugin Network Agent SHOULD authenticate each other. The Worker’s Core plugin Network Agent then enforces network/port configuration on hypervisors using proprietary mechanisms; (need to discuss how to overcome local HV allocation of MAC and local HV vSwitch configuration (including ACL, filtering, spoofing etc.) and ensure no other configuration to SW/HW may take place).
• The NaaS Core SHOULD allocates the customer’s requested resources. For best coordination, NaaS Core notifies Compute Service when network is ready. In case of a failure, Naas Core notifies Cloud Controller (with failed location/Worker identification). The Cloud Controller will attempt a new location/Worker for placement of workload
• The Core Plugin Network Agent SHOULD support a basic vSwitch. The vSwitch services, interface with the vEth (VIF) and configuration options are to be defined. The vSwitch should support checking for network connectivity, BW, QoS, HA and Security to the NaaS Core – using the Core Plugin – as defined in the request API.
• A secure Core Plugin Network Agent MAY be requested. A secure Core Plugin Network Agent SHOULD authenticate itself to the local Worker and SHOULD provide a Secure vSwitch.
• A Secure vSwitch MAY provide the following services
o Strong (TPM supported) vSwitch to Core NaaS Network Module authentication
o Core NaaS Network module controlled ACL setting/enforcement on vSwitches (network entry point)
o Strong (TPM supported) authenticated ACL changes (e.g. upon migration)
•
• The Compute service should not have any knowledge of the hypervisor's networking stack and uses NaaS Core API for plugging VIFs into networks (this is sligthly different from current nova design where part of the network manager's code - namely setup_compute_network - is used by the compute service as well);
• Although the diagram supposes a plugin has a 'manager' component on the NaaS node and agent 'component', this might not be always true, as NaaS should be completely agnostic w.r.t. plugin implementations; however, basic services as outlined above SHOULD be supported by the plugin
• Just like today's nova-network service, Core NaaS used uses the nova DB. However, the database table for describing networks will be much simpler, as it will not contain information about higher-level services, such as VPN. Customer-Network association can still be performed on a project base, but it might be worth thinking about a 1:N association between networks and projects. IP information for networks are a slightly different argument, and will be discussed later in this blueprint.
As far as IP configuration is concerned, this aspect can be regarded as border-line between the core service and a higher-layer service. On the one hand, given that the aim of core service is to provide basic connectivity only, just like a switch provides network connectivity to servers in a rack, IP configuration should be regarded as a higher layer service; for instance, a DHCP service represents an example of an higher-level service for providing instances with IP configuration. On the other hand, it might be rightly argued that IP addressing is such a fundamental service that it should be part of the Core NaaS; the NaaS Core API will then also provide functionalities for configuring IP subnets in terms of CIDR, netmask, and gateway. However, the implementation for the IP API could be provided by a different plugin, and several plugins providing different IP configuration strategies could coexist on the NaaS node (e.g.: DHCP and IP injection). Each VIF on each VM will need an IP address (IPv4 or IPv6); these are then given to the VM by one of the schemes described here, which can be implemented by different plugins:
Scheme 1: Agent: Use an agent inside the VM to set the IP address details. The agent will receive the configuration through a virtualization-specific scheme (XenStore, VMCI, etc) or through a more generic scheme such as a virtual CD-ROM. This scheme of course requires installation of the agent in the VM image.
Scheme 2: DHCP: Configure the VM to use DHCP (usually the default anyway) and configure a DHCP server before booting the VM. This may be a private DHCP server, visible only to that VM, or it could be a server that is shared more widely.
Scheme 3: Filesystem modification: Before booting the VM, set the IP configuration by directly modifying its filesystem.
If IP configuration is deemed part of the core API, the following goal should then be added to the above list:
Goal 6: Customers should be able to configure IP subnets for their networks. Zero, One, or more IP subnets can be associated with a network.
Opens / Questions:
1) Define “Network” – L2 (MACs and VLANs)? L3 (IPv4, IPv6, Tunnel)?
2) Address allocation schemes for L2 and L3 addresses
3) Is the network aware of Zones?
4) Network to Compute Controller interaction: MAC setting, HW config, VM ID to tenant mapping

(?)

Work Items

This blueprint contains Public information

Everyone can see this information.

Subscribers

No subscribers.