How to make it easier to deploy my Jar to Spark Cluster in standalone mode?

Tags:

I have a small cluster with 3 machines, and another machine for developing and testing. When developing, I set SparkContext to local. When everything is OK, I want to deploy the Jar file I build to every node. Basically I manually move this jar to cluster and copy to HDFS which shared by the cluster. Then I could change the code to:

//standalone mode
val sc = new SparkContext(
     "spark://mymaster:7077", 
     "Simple App", 
     "/opt/spark-0.9.1-bin-cdh4",   //spark home
     List("hdfs://namenode:8020/runnableJars/SimplyApp.jar") //jar location
)

to run it in my IDE. My question: Is there any way easier to move this jar to cluster?

427

asked Jun 05 '14 06:06

hakunami

1 Answers

In Spark, the program creating the SparkContext is called 'the driver'. It's sufficient that the jar file with your job is available to the local file system of the driver in order for it to pick it up and ship it to the master/workers.

In concrete, your config will look like:

//favor using Spark Conf to configure your Spark Context
val conf = new SparkConf()
             .setMaster("spark://mymaster:7077")
             .setAppName("SimpleApp")
             .set("spark.local.ip", "172.17.0.1")
             .setJars(Array("/local/dir/SimplyApp.jar"))

val sc = new SparkContext(conf)

Under the hood, the driver will start a server where the workers will download the jar file(s) from the driver. It's therefore important (and often an issue) that the workers have network access to the driver. This can often be ensured by setting 'spark.local.ip' on the driver in a network that's accessible/routable from the workers.

177

answered Oct 22 '22 06:10

maasg

Related questions
                            
                                Java EE6> Packaging JSF facelets (xhtml) and ManagedBeans as JAR
                            
                                Could not load a dependent class com/jcraft/jsch/Logger
                            
                                remove unused jars from project
                            
                                How to make a Jar file with dependencies by Gradle 7.0+?
                            
                                How to create a Java application which can be run by a click?
                            
                                Finding out the licenses of JAR libraries
                            
                                (JAVA) Use Command Prompt to create .jar file from multiple .class files
                            
                                creating module-info for automatic modules with jdeps in java 9
                            
                                How to avoid copying dependencies with Ivy
                            
                                Advantages/disadvantages of building one huge jar as opposed to several smaller?
                            
                                How to call method in jar file with terminal?
                            
                                play .wav file from jar as resource using java
                            
                                Converting APK to JAR [duplicate]
                            
                                Web App Libraries empty in Eclipse - no "jars" found
                            
                                TIBCO JMS jar file [closed]
                            
                                How to get the output from .jar execution in python codes?
                            
                                Setting up JNA in Android Studio
                            
                                org.json JAR provisioning
                            
                                Java jar files into a repository (CVS, SVN..)
                            
                                Maven add properties file to jar file

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to make it easier to deploy my Jar to Spark Cluster in standalone mode?

Tags:

jar

apache-spark

hakunami

People also ask

1 Answers

maasg

Recent Activity

Donate For Us