This is the documentation for CDH 4.6.0.
Documentation for other versions is available at Cloudera Documentation.

Ways To Install CDH4

You can install CDH4 in any of the following ways:

  • Automated method using Cloudera Manager; instructions here. Cloudera Manager automates the installation and configuration of CDH4 on an entire cluster if you have root or password-less sudo SSH access to your cluster's machines.
      Note: Cloudera recommends that you use the automated method if possible.
  • Manual methods described below:
    • Download and install the CDH4 "1-click Install" package
    • Add the CDH4 repository
    • Build your own CDH4 repository
    • Install from a CDH4 tarball — see How Packaging Affects CDH4 Deployment.

The following instructions describe downloading and installing the "1-click Install" package, adding a repository, and building your own repository.

If you use one of these methods rather than Cloudera Manager, the first of these methods (downloading and installing the "1-click Install" package) is recommended in most cases because it is simpler than building or adding a repository.

How Packaging Affects CDH4 Deployment

Installing from Packages

Installing from a Tarball

  Note: The instructions in this Installation Guide are tailored for a package installation, as described in the sections that follow, and do not cover installation or deployment from tarballs.
  • If you install CDH4 from a tarball, you will install YARN. Read the discussion of YARN under New Features before you proceed.
  • As of CDH4.3.0, there is no separate tarball for MRv1. Instead, the MRv1 binaries, examples, etc., are delivered in the Hadoop tarball itself. The scripts for running MRv1 are in the bin-mapreduce1 directory in the tarball, and the MRv1 examples are in the examples-mapreduce1 directory.