× {{alert.msg}} Never ask again
Get notified about new tutorials RECEIVE NEW TUTORIALS

Spinning up a Hadoop (CDH5) environment with Docker

Cory Flanigan
Apr 30, 2015
<p>It's very easy to get a <a href="https://hadoop.apache.org/#What+Is+Apache+Hadoop%3F">Hadoop</a> environment up and running with minimal time and effort, if you're willing to use <a href="http://www.docker.com/whatisdocker/">Docker</a>.</p> <p>If you don't already have Docker installed, follow the <a href="https://docs.docker.com/installation/">instructions</a> on their site.</p> <p>Then, pull down this image which already has a fully functioning environment running CDH5 (a Hadoop distribution which includes Hadoop, HDFS, and a useful set of tools.)</p> <p><a href="https://github.com/chali/cdh5-pseudo-distributed-cluster-docker">https://github.com/chali/cdh5-pseudo-distributed-cluster-docker</a></p> <p>Follow the instructions in the README, and you'll be up and running in no time at all!</p> <p>Extra Credit:</p> <p>Check out the documentation for <a href="http://docs.docker.com/userguide/dockervolumes/">mounting files into Docker containers</a> so you can write <a href="http://hadoop.apache.org/docs/r1.2.1/mapred_tutorial.html">MapReduce</a> and other Hadoop jobs outside of the container, and run them inside!<br> <br> <img alt="Inception!" src="https://fusiondotnet.files.wordpress.com/2014/06/3q5zinc.gif" style="height:105px; width:250px"></p>
comments powered by Disqus