Setting up a Hadoop on Azure Cluster in 15 minutes

One thing I’d like to share about the Microsoft Hadoop on Azure offering is how ridiculously easy it is to set up.

This is the setup screen for registering your cluster (click to see full size):

Fig 1: Hadoop on Azure Cluster setup screen
Fig 1: Hadoop on Azure Cluster setup screen

It’s all shiny and Metro interfaced, but the important bit is that to set up a cluster you need to choose the following:

  • A DNS Name
  • How big a Cluster you want (4 Node 2TB, 8 Node 4TB, 16 Node 8TB or 32Node 16TB)
  • Username and Password for the Cluster
  • Optional Azure details to store Hive content
  • …. and that’s it.

Enter those fields, click request cluster and you get your wait screen:

Fig 2: Hadoop on Azure Cluster Allocation wait screen
Fig 2: Hadoop on Azure Cluster Allocation wait screen

It then allocates your nodes:

Fig 3: Hadoop on Azure Node Allocation Wait Screen
Fig 3: Hadoop on Azure Node Allocation Wait Screen

Gets down to the business of Creating and Starting them:

Fig 4: Hadoop on Azure Node Creation Wait Screen
Fig 4: Hadoop on Azure Node Creation Wait Screen

The services start on the Nodes:

Fig 5: Hadoop on Azure Nodes starting
Fig 5: Hadoop on Azure Nodes starting

And then it’s cooked:

Fig 6: One Hadoop on Azure Cluster at your service
Fig 6: One Hadoop on Azure Cluster at your service

This took 15 minutes.

Let me repeat that – end to end – the process took 15 minutes.

Some days my laptop takes that long to become usable….

Leave a Reply

Your email address will not be published. Required fields are marked *

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <s> <strike> <strong>