Provide Cloudera Hadoop packages (CDH3)

Registered by Mathias Gug

As a Hadoop admin I download CDH3 packages from Cloudera and can easily run them on Ubuntu.

Blueprint information

Status:
Complete
Approver:
Robbie Williamson
Priority:
High
Drafter:
James Page
Direction:
Approved
Assignee:
Ubuntu Server
Definition:
Approved
Series goal:
Accepted for oneiric
Implementation:
Implemented
Milestone target:
milestone icon ubuntu-11.10-beta-1
Started by
James Page
Completed by
Dave Walker

Related branches

Sprints

Whiteboard

Packaging Review:

See http://pad.ubuntu.com/cloudera-hadoop-packaging

UDS-O Session Notes:

first pass packaing done in ppa. needs to be checked for archive compatibility
questions over quality of packaging.

Target: CDH3 for 11.10

Cloudera are recommending java-[oracle|sun]
Would need to be validated on OpenJDK to be in the archive
CDH3 has issues running on OpenJDK (tested by fedora on openjdk6 OK, doesn't run in openjdk7)
 * If requirement on [oracle|sun]jdk is real and hard, then Partner or PPA repo is probably the best we can do
 * Else, if we can get a commitment for help getting Hadoop working on OpenJDK, then we can and should target Ubuntu Universe

In order to determine if we need to work on putting CDH in Universe or main, we need to work on how we can add visibility to modules in orchestra
 - who controls the list?
 - what rules to accept new items?
 - how is it published?
If the above allow for this list to contain stuff from ppa/partner, then the universe/main requirement would be pretty low

Work items (oneiric-alpha-2):
[james-page] Review packaging to date and work with iamfuzz: DONE
[kirkland] Discuss with Cloudera the build/runtime dependencies with [oracle|sun]jdk: INPROGRESS

Work items:
[negronjl] ensemble formula for deploying hadoop: DONE

(?)

Work Items

Dependency tree

* Blueprints in grey have been implemented.