I am just getting started with Spark, so downloaded the <code>for Hadoop 1 (HDP1, CDH3)</code> binaries from here and extracted it on a Ubuntu VM. Without installing Scala, I was able to execute the examples in the Quick Start guide from the Spark interactive shell. <ol> <li>Does Spark come included with Scala? If yes, where are the libraries/binaries?</li> <li>For running Spark in other modes (distributed), do I need to install Scala on all the nodes?</li> </ol> As a side note, I observed that Spark has one of the best documentation around open source projects.

Does Spark come included with Scala? If yes, where are the libraries/binaries? The project configuration is placed in <code>project/</code> folder. I my case here it is: <pre class="prettyprint"><code>$ ls project/ build.properties plugins.sbt project SparkBuild.scala target </code></pre> When you do <code>sbt/sbt assembly</code>, it downloads appropriate version of Scala along with other project dependencies. Checkout the folder <code>target/</code> for example: <pre class="prettyprint"><code>$ ls target/ scala-2.9.2 streams </code></pre> Note that Scala version is 2.9.2 for me. For running Spark in other modes (distributed), do I need to install Scala on all the nodes? Yes. You can create a single assembly jar as described in Spark documentation <blockquote> If your code depends on other projects, you will need to ensure they are also present on the slave nodes. A popular approach is to create an assembly jar (or “uber” jar) containing your code and its dependencies. Both sbt and Maven have assembly plugins. When creating assembly jars, list Spark itself as a provided dependency; it need not be bundled since it is already present on the slaves. Once you have an assembled jar, add it to the SparkContext as shown here. It is also possible to submit your dependent jars one-by-one when creating a SparkContext. </blockquote>

Scala dependency on Spark installation

Tags:

scala

apache-spark

I am just getting started with Spark, so downloaded the for Hadoop 1 (HDP1, CDH3) binaries from here and extracted it on a Ubuntu VM. Without installing Scala, I was able to execute the examples in the Quick Start guide from the Spark interactive shell.

Does Spark come included with Scala? If yes, where are the libraries/binaries?
For running Spark in other modes (distributed), do I need to install Scala on all the nodes?

As a side note, I observed that Spark has one of the best documentation around open source projects.

272

asked Jan 24 '14 11:01

Praveen Sripati

1 Answers

Does Spark come included with Scala? If yes, where are the libraries/binaries?

The project configuration is placed in project/ folder. I my case here it is:

$ ls project/
build.properties  plugins.sbt  project  SparkBuild.scala  target

When you do sbt/sbt assembly, it downloads appropriate version of Scala along with other project dependencies. Checkout the folder target/ for example:

$ ls target/
scala-2.9.2  streams

Note that Scala version is 2.9.2 for me.

For running Spark in other modes (distributed), do I need to install Scala on all the nodes?

Yes. You can create a single assembly jar as described in Spark documentation

If your code depends on other projects, you will need to ensure they are also present on the slave nodes. A popular approach is to create an assembly jar (or “uber” jar) containing your code and its dependencies. Both sbt and Maven have assembly plugins. When creating assembly jars, list Spark itself as a provided dependency; it need not be bundled since it is already present on the slaves. Once you have an assembled jar, add it to the SparkContext as shown here. It is also possible to submit your dependent jars one-by-one when creating a SparkContext.

answered Oct 02 '22 13:10

tuxdna

Related questions
                            
                                How do I print the DDL that is generated by Slick?
                            
                                Play 2.1 JSON to Scala object
                            
                                Scala object cloning (copying) without value re-valuation
                            
                                Which Scala features have poor performance
                            
                                Scala: Bad inferred type for Option composed with StateT monad transformer
                            
                                Implement abstract methods with a def macro
                            
                                Why does the scaladoc say HashMap.toArray returns Array[A] instead of Array[(A,B)]?
                            
                                How to make two function parameters as implicit
                            
                                Is it possible to generate Apply from WeakTypeTag inside a scala macro?
                            
                                How can I use regex to match strings if the regex has nested group?
                            
                                scala does not warn about unused computation or value
                            
                                Returning AnyVal from Scala methods
                            
                                Take string and extract first word and comma separate in scala?
                            
                                Scala and type bound with a given operation
                            
                                Scala Macro Annotations: c.TypeCheck of annotated type causes StackOverflowError
                            
                                Pattern matching without case classes
                            
                                In Scala, when would be a good time to use lazily evaluated parameter rather than to use a function as a parameter?
                            
                                Spray: Bringing RequestContext in scope results in timeout
                            
                                Subtype polymorphism in shapeless mapping
                            
                                Scala Streams: how to avoid to keeping a reference to the head (and other elements)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With