I want to install Hive on my Ubuntu.I read this Article,that help me so much.I did all of steps except step 4.I didn't understand exactly what should in this step do.
could you please explain me this step in detail ?
Step 1 : Download and Extract Hadoop
Step 2 : Set JAVA_HOME path to conf/hadoop-env.sh //This step is to set java path for hadoop
Step 3 : conf/core-site.xml:
<configuration>
<property>
<name>fs.default.name</name> //Place your home folder here for using hadoop
<value>hdfs://localhost:9000</value>
</property>
</configuration>
Step 4 : conf/hdfs-site.xml:
<configuration> //This setting for the number of replications of the file or you can add data node for the save the file
<property>
<name>dfs.replication</name>
<value>1</value>
</property>
</configuration>
Step 5 : conf/mapred-site.xml:
<configuration>
<property>
<name>mapred.job.tracker</name>
<value>localhost:9001</value> // add your master host in the place of localhost here
</property>
</configuration>
Step 6 : Login SSH localhost and Format a new distributed-filesystem
bin/hadoop namenode -format
Step 7 : Start the hadoop daemons:
bin/start-all.sh
Step 8 : Check the NameNode & JobTracker below port
http://localhost:50070/ //master
http://localhost:50030/ //slave
// Its is also better to try the ssh for check your working nodes
Step 9 : Download and Extract Hive
Step 10 : Set below Env variables.
export HADOOP_HOME=<hadoop-install-dir>
export HIVE_HOME=<hive-install-dir>
export PATH=$HIVE_HOME/bin:$PATH
$HIVE_HOME/bin/hive
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With