Purpose and Background Use Ubuntu Trusty 14.04.02 Server (without UI) or the Desktop version (flashy UI) as the base OS, then install and configure the CDH 5.4 including HDFS, YARN (MR2), HIVE, ZooKeeper, Sqoop, Oozie, and my favorite Spark. To date, I've successfully used this DIY instruction manual to build a 12-node cluster, and added [...]The post DIY Instructions for Installing Hadoop, Spark, and Hive on Ubuntu appeared first on Advanti Solutions.