Spark without Hadoop: Failed to Launch

Tags:

I'm running Spark 2.1.0, Hive 2.1.1 and Hadoop 2.7.3 on Ubuntu 16.04.

I download the Spark project from github and build the "without hadoop" version:

./dev/make-distribution.sh --name "hadoop2-without-hive" --tgz "-Pyarn,hadoop-provided,hadoop-2.7,parquet-provided"

When I run ./sbin/start-master.sh, I get the following exception:

 Spark Command: /usr/lib/jvm/java-8-openjdk-amd64/jre/bin/java -cp /home/server/spark/conf/:/home/server/spark/jars/*:/home/server/hadoop/etc/hadoop/:/home/server/hadoop/share/hadoop/common/lib/:/home/server/hadoop/share/hadoop/common/:/home/server/hadoop/share/hadoop/mapreduce/:/home/server/hadoop/share/hadoop/mapreduce/lib/:/home/server/hadoop/share/hadoop/yarn/:/home/server/hadoop/share/hadoop/yarn/lib/ -Xmx1g org.apache.spark.deploy.master.Master --host ThinkPad-W550s-Lab --port 7077 --webui-port 8080
 ========================================
 Error: A JNI error has occurred, please check your installation and try again
 Exception in thread "main" java.lang.NoClassDefFoundError: org/slf4j/Logger
     at java.lang.Class.getDeclaredMethods0(Native Method)
     at java.lang.Class.privateGetDeclaredMethods(Class.java:2701)
     at java.lang.Class.privateGetMethodRecursive(Class.java:3048)
     at java.lang.Class.getMethod0(Class.java:3018)
     at java.lang.Class.getMethod(Class.java:1784)
     at sun.launcher.LauncherHelper.validateMainClass(LauncherHelper.java:544)
     at sun.launcher.LauncherHelper.checkAndLoadMain(LauncherHelper.java:526)
 Caused by: java.lang.ClassNotFoundException: org.slf4j.Logger
     at java.net.URLClassLoader.findClass(URLClassLoader.java:381)
     at java.lang.ClassLoader.loadClass(ClassLoader.java:424)
     at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:331)
     at java.lang.ClassLoader.loadClass(ClassLoader.java:357)
     ... 7 more

I edit SPARK_DIST_CLASSPATH according to the post Where are hadoop jar files in hadoop 2?

export SPARK_DIST_CLASSPATH=~/hadoop/share/hadoop/common/lib:~/hadoop/share/hadoop/common:~/hadoop/share/hadoop/mapreduce:~/hadoop/share/hadoop/mapreduce/lib:~/hadoop/share/hadoop/yarn:~/hadoop/share/hadoop/yarn/lib

But I'm still getting the same error. I can see the slf4j jar file is under ~/hadoop/share/hadoop/common/lib.

How could I fix this error?

Thank you!

527

asked Feb 17 '17 21:02

Top.Deck

1 Answers

“Hadoop free” builds need to modify SPARK_DIST_CLASSPATH to include Hadoop’s package jars.

The most convenient place to do this is by adding an entry in conf/spark-env.sh :

export SPARK_DIST_CLASSPATH=$(/path/to/hadoop/bin/hadoop classpath)

check this https://spark.apache.org/docs/latest/hadoop-provided.html

200

answered Oct 02 '22 08:10

yuxh

Related questions
                            
                                Hive enforces schema during read time?
                            
                                Hadoop 2.2.0 fails running start-dfs.sh with Error: JAVA_HOME is not set and could not be found
                            
                                Hadoop: How to unit test FileSystem
                            
                                Getting the following error "Datanode denied communication with namenode" while configuring hadoop 0.23.8
                            
                                Type mismatch in value from map: expected org.apache.hadoop.io.NullWritable, recieved org.apache.hadoop.io.Text
                            
                                Sampling a large distributed data set using pyspark / spark
                            
                                Hadoop: Cannot use Jps command
                            
                                Difference between Hadoop and Nosql [closed]
                            
                                Hadoop fs lookup for block size?
                            
                                Hadoop on MAC pseudo node : nodename nor servname provided, or not known
                            
                                Split size vs Block size in Hadoop
                            
                                Container killed by the ApplicationMaster Exit code is 143
                            
                                Hadoop on EC2 vs Elastic Map Reduce
                            
                                How does Apache Spark know about HDFS data nodes?
                            
                                hadoop connection refused on port 9000
                            
                                How does Hive choose the number of reducers for a job?
                            
                                Hadoop MapReduce vs MPI (vs Spark vs Mahout vs Mesos) - When to use one over the other?
                            
                                How to execute Spark programs with Dynamic Resource Allocation?
                            
                                Failed to detect a valid hadoop home directory
                            
                                How to find the most recent partition in HIVE table

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Spark without Hadoop: Failed to Launch

Tags:

apache-spark

hadoop

hive

Top.Deck

People also ask

1 Answers

yuxh

Recent Activity

Donate For Us