Overview

Cloudbreak simplifies the provisioning, management, and monitoring of on-demand HDP clusters in virtual and cloud environments. It leverages cloud infrastructure to create host instances, and uses Apache Ambari (via Ambari Blueprints) to install and manage HDP clusters.

Using the Cloudbreak Web UI, REST API, or CLI, you can launch HDP clusters on public cloud infrastructure platforms such as Microsoft Azure, Amazon Web Services (AWS), and Google Cloud Platform (GCP), and the private cloud infrastructure platform OpenStack.

Cloudbreak has two main components: the Cloudbreak Application and the Cloudbreak Deployer. The Cloudbreak Application consists of multiple microservices (Cloudbreak, Uluwatu, Sultans, and so on). The Cloudbreak Deployer helps you to deploy the Cloudbreak Application automatically with Docker support. Once the Cloudbreak Application is deployed, you can use it to provision HDP clusters in various cloud environments.

For an architectural overview of the Cloudbreak Deployer, the Cloudbreak Application, Apache Ambari, and other Cloudbreak components, see Architecture.

Installation Options

On a high level, to set up Cloudbreak and provision an HDP cluster, you need to perform the following steps:

  1. Install the Cloudbreak Deployer using one of the two available scenarios:
    • (Option #1) Install the Cloudbreak Deployer on your own VM/host, or
    • (Option #2) Instantiate one of the pre-built cloud images that include Cloudbreak Deployer pre-installed.
  2. Configure the Cloudbreak Deployer and install the Cloudbreak Application.
    Once you have installed Cloudbreak Deployer (called "cbd"), the deployer will start up several Docker containers: Cloudbreak API, Cloudbreak web UI (called "Uluwatu"), Identity Server, and supporting databases. You have successfully completed this step if you are able to log in to Cloudbreak web UI in your browser.
  3. Provision an HDP Cluster using the Cloudbreak Application.

(Option #1) Installing the Cloudbreak Deployer

You can install the Cloudbreak Deployer on your own VM/host manually.

System Requirements: RHEL / CentOS / Oracle Linux 7 (64-bit), Docker 1.9.1, 4GB RAM, 10GB disk, 2 cores recommended

Once Cloudbreak Deployer is installed, use it to set up the Cloudbreak Application. We suggest that you install the Cloudbreak Application as close to your desired HDP clusters as possible. For example, if you plan to launch clusters on AWS, install the Cloudbreak Application on AWS.

If you choose this installation option, follow the instructions for installing the Cloudbreak Deployer. Alternatively, consider using one of the pre-built cloud images that include Cloudbreak Deployer pre-installed.

IMPORTANT: If you plan to use Cloudbreak on Azure, you must use the Azure Setup instructions to configure the image.

(Option #2) Using the Pre-Built Cloud Images

We provide pre-built cloud images with Cloudbreak Deployer pre-installed. The following table includes links to provider-specific instructions for configuring and launching cbd (Cloudbreak Deployer) and then clusters.

VM Requirements: 4GB RAM, 10GB disk, 2 cores recommended

Cloud Cloud Image
AWS Follow the AWS instructions.
Azure There are no pre-built cloud images available for Azure. However, we give you an option to use Azure Resource Manager Templates instead. See the Azure Setup instructions.
GCP Follow the GCP instructions.
OpenStack Follow the OpenStack instructions.

Learn More

For more information on Cloudbreak, Ambari and Ambari blueprints, see:

Resource Description
Cloudbreak Project Visit the Hortonworks website to see Cloudbreak-related news and updates.
Cloudbreak Forums Visit the Cloudbreak Forums to get connected with the Cloudbreak community.
Apache Ambari Project Learn about the Apache Ambari Project. Apache Ambari is an operational platform for provisioning, managing, and monitoring Apache Hadoop clusters. Ambari exposes a robust set of REST APIs and a rich web interface for cluster management.
Ambari Blueprints Learn about Ambari Bleuprints. Ambari Blueprints are a declarative definition of a Hadoop cluster that Ambari can use to create Hadoop clusters.
Edit on GitHub