Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Install Hue without Cloudera

Has anyone tried/succeeded in installing Hue on Hadoop without Cloudera?

I have gotten to a point where I can reliably reproduce a hadoop cluster with hbase and hive and can set it all up in about 15 minutes. I'd love to have Hue along with all this without having to go back and redo my setup with Cloudera.

like image 925
Aman Chawla Avatar asked Dec 14 '13 03:12

Aman Chawla


People also ask

Is Hue part of Cloudera?

Hue integrates with the entirety of Cloudera's platform, including storage engines, Apache Kudu and Amazon S3 object storage, Apache Hive for data preparation, Apache Solr for free-text analytics, and Apache Impala for high-performance SQL analytics.

Is Hue a Hadoop?

Hue is a web user interface which provides a number of services and Hue is a Hadoop framework. Hive or HiveQL is an analytic query language used to process and retrieve data from a data warehouse. Hue is a Web UI that facilitates the users to interact with the Hadoop ecosystem.

Can Hue interact with Hadoop?

Hue runs in a browser and provides an easy-to-use interface to several applications to support interaction with Hadoop and HDFS.

What is hue in CDP?

Hue is a graphical user interface to operate and develop applications for Apache Hadoop. Hue applications are collected into a desktop-style environment and delivered as a Web application, requiring no additional installation for individual users.


2 Answers

Checkout slides #19 & #5, Hue is getting everywhere and is compatible with Hadoop 0.20 / 1.2.0 / 2.2.0: http://gethue.com/hue-goes-to-paris-hug-france/

Hue has tarball releases releases that you are free to install. You can also simply clone the source code (Hue is open source and Apache Licenced) github: https://github.com/cloudera/hue and build the branch you want.

Upstream documentation is here or CDH's one here.

Hue is also packaged in BigTop (and so based on Vanilla Hadoop).

Hue is a Web Server (Django based) which acts as a view on top of Hadoop. So Hue just needs to be installed and then configured by adding the hosts of NameNode, JobTracker, Resource Manager, Oozie, HiveServer... etc in its hue.ini.

Also, as detailed on the gehue.com/releases, the version you need might depend on your Hive version.

Notice that without Cloudera's distribution your mileage might vary but feel free to chime-in on the Hue user-list or gethue.com ;)

We are also seeing for improving Hue setup with Amazon AWS/EMR!

like image 179
Romain Avatar answered Sep 29 '22 10:09

Romain


To build and run hue 3.6.0 with apache hadoop 2.4.1

  1. git clone https://github.com/cloudera/hue.git (Notice! releases/tag/release-3.6.0 is unstable, It's better to build from latest master. I built from Aug 7, 87d6b2da1 - it's stable)

  2. cd hue

  3. $ vi maven/pom.xml

  4. change hadoop.version to 2.4.1
  5. replace hadoop-core with hadoop-common
  6. set hadoop-test version to 1.2.1

  7. remove files which need hadoop mr1

  8. $ rm desktop/libs/hadoop/java/src/main/java/org/apache/hadoop/mapred/ThriftJobTrackerPlugin.java

  9. $ rm desktop/libs/hadoop/java/src/main/java/org/apache/hadoop/thriftfs/ThriftJobTrackerPlugin.java

  10. build hue $ make apps

  11. configure hue $ vi desktop/conf/pseudo-distributed.ini

  12. run hue server in dev mode $ build/env/bin/hue runserver 0.0.0.0:8000

like image 39
x4444 Avatar answered Sep 29 '22 11:09

x4444