The page here (http://spark.apache.org/docs/latest/programming-guide.html) indicates packages can be included when the shell is launched via: <pre class="prettyprint"><code>$SPARK_HOME/bin/spark-shell --packages com.databricks:spark-csv_2.11:1.4.0 </code></pre> What is the syntax for including local packages (that are downloaded manually say)? Something to do with Maven coords?

Please use: <blockquote> ./spark-shell --jars my_jars_to_be_included </blockquote> There is a open question related to this: Please check this question out.

How to run spark shell with local packages?

Tags:

maven

packages

apache-spark

The page here (http://spark.apache.org/docs/latest/programming-guide.html) indicates packages can be included when the shell is launched via:

$SPARK_HOME/bin/spark-shell --packages com.databricks:spark-csv_2.11:1.4.0

What is the syntax for including local packages (that are downloaded manually say)? Something to do with Maven coords?

272

asked Jun 05 '16 15:06

mathtick

2 Answers

If the jars are present on the master/workers, you simply need to specify them on the classpath in spark-submit:

spark-shell \
spark.driver.extraClassPath="/path/to/jar/spark-csv_2.11.jar" \
spark.executor.extraClassPath="spark-csv_2.11.jar"

If the jars are only present in the Master, and you want them to be sent to the worker (only works for client mode), you can add the --jars flag:

spark-shell \
spark.driver.extraClassPath="/path/to/jar/spark-csv_2.11.jar" \
spark.executor.extraClassPath="spark-csv_2.11.jar" \
--jars "/path/to/jar/jary.jar:/path/to/other/other.jar"

For a more elaborated answer see Add jars to a Spark Job - spark-submit

151

answered Sep 27 '22 23:09

Yuval Itzchakov

Please use:

./spark-shell --jars my_jars_to_be_included

There is a open question related to this: Please check this question out.

answered Sep 27 '22 21:09

dbustosp

Related questions
                            
                                Install all modules artifacts to custom maven repository using Gradle and Android Studio
                            
                                Transitive AAR dependencies in Maven
                            
                                Spark streaming + json4s-jackson dependency problems
                            
                                404 error on main controller when using spring boot
                            
                                Is there a Maven plugin listing all JUnit tests in a multi module project
                            
                                Use maven-exec-plugin to run command line
                            
                                Getting maven to execute a program and store the output into a property that can be used in a .jar manifest
                            
                                Maven compilation fails with "cannot find symbol" while with Eclipse, it compiles
                            
                                JAXB english comments in generated file
                            
                                mvn command not working on mac OS X Yosemite
                            
                                How to share javascript code between maven modules
                            
                                Logging is processed by the logger twice
                            
                                IntelliJ No files were downloaded for xyz
                            
                                java.lang.VerifyError: Inconsistent stackmap frames at branch target 421
                            
                                How do I get an email address from users in 'Twitter4J'?
                            
                                Maven dependency for Google Spreadsheet
                            
                                New Maven dependcy turns on jOOQ logging
                            
                                Build parent pom and run specific integration test in Jenkins through maven?
                            
                                Where does Eclipse place the .war file?
                            
                                copyInputStreamToFile method does not exist

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to run spark shell with local packages?

Tags:

maven

packages

apache-spark

mathtick

People also ask

2 Answers

Yuval Itzchakov

dbustosp

Recent Activity

Donate For Us

How to run spark shell with *local* packages?

Tags:

maven

packages

apache-spark

mathtick

People also ask

2 Answers

Yuval Itzchakov

dbustosp

Related questions

Recent Activity

Donate For Us

How to run spark shell with local packages?