How to work efficiently with SBT, Spark and "provided" dependencies?

Tags:

I'm building an Apache Spark application in Scala and I'm using SBT to build it. Here is the thing:

when I'm developing under IntelliJ IDEA, I want Spark dependencies to be included in the classpath (I'm launching a regular application with a main class)
when I package the application (thanks to the sbt-assembly) plugin, I do not want Spark dependencies to be included in my fat JAR
when I run unit tests through sbt test, I want Spark dependencies to be included in the classpath (same as #1 but from the SBT)

To match constraint #2, I'm declaring Spark dependencies as provided:

libraryDependencies ++= Seq(
  "org.apache.spark" %% "spark-streaming" % sparkVersion % "provided",
  ...
)

Then, sbt-assembly's documentation suggests to add the following line to include the dependencies for unit tests (constraint #3):

run in Compile <<= Defaults.runTask(fullClasspath in Compile, mainClass in (Compile, run), runner in (Compile, run))

That leaves me with constraint #1 not being full-filled, i.e. I cannot run the application in IntelliJ IDEA as Spark dependencies are not being picked up.

With Maven, I was using a specific profile to build the uber JAR. That way, I was declaring Spark dependencies as regular dependencies for the main profile (IDE and unit tests) while declaring them as provided for the fat JAR packaging. See https://github.com/aseigneurin/kafka-sandbox/blob/master/pom.xml

What is the best way to achieve this with SBT?

885

asked Apr 05 '16 21:04

Alexis Seigneurin

4 Answers

Use the new 'Include dependencies with "Provided" scope' in an IntelliJ configuration.

IntelliJ config with Provided scope checkbox

115

answered Sep 29 '22 00:09

Martin Tapp

(Answering my own question with an answer I got from another channel...)

To be able to run the Spark application from IntelliJ IDEA, you simply have to create a main class in the src/test/scala directory (test, not main). IntelliJ will pick up the provided dependencies.

object Launch {   def main(args: Array[String]) {     Main.main(args)   } }

Thanks Matthieu Blanc for pointing that out.

answered Sep 29 '22 00:09

Alexis Seigneurin

You need to make the IntellJ work.

The main trick here is to create another subproject that will depend on the main subproject and will have all its provided libraries in compile scope. To do this I add the following lines to build.sbt:

lazy val mainRunner = project.in(file("mainRunner")).dependsOn(RootProject(file("."))).settings(
  libraryDependencies ++= spark.map(_ % "compile")
)

Now I refresh project in IDEA and slightly change previous run configuration so it will use new mainRunner module's classpath:

intellj

Works flawlessly for me.

Source: https://github.com/JetBrains/intellij-scala/wiki/%5BSBT%5D-How-to-use-provided-libraries-in-run-configurations

answered Sep 29 '22 00:09

Atais

For running the spark jobs, the general solution of "provided" dependencies work: https://stackoverflow.com/a/21803413/1091436

You can then run the app from either sbt, or Intellij IDEA, or anything else.

It basically boils down to this:

run in Compile := Defaults.runTask(fullClasspath in Compile, mainClass in (Compile, run), runner in (Compile, run)).evaluated,
runMain in Compile := Defaults.runMainTask(fullClasspath in Compile, runner in(Compile, run)).evaluated

answered Sep 29 '22 01:09

VasiliNovikov

Related questions
                            
                                'Unable to load a Suite class' while running ScalaTest in IntelliJ
                            
                                Intellij Idea : view test coverage on a maven project
                            
                                IntelliJ IDEA: How can I create an exception breakpoint that stops on all exceptions *except for* ClassNotFoundException? [duplicate]
                            
                                Which files should I add to SVN ignore for an Android Studio project?
                            
                                IntelliJ: how to force editor to treat a file as javascript?
                            
                                IntelliJ IDEA - getClass().getResource("...") return null
                            
                                Junit5 with IntelliJ and Gradle
                            
                                Intellij Error:Cannot build Artifact 'XXX:war exploded' because it is included into a circular dependency
                            
                                Intellij IDEA 2016.2 high CPU usage
                            
                                Comparison of IntelliJ Python plugin or PyCharm
                            
                                have IntelliJ IDEA refresh the project and detect changed files
                            
                                Error Launching Idea. Failed to load JVM DLL C:\Program Files\Java\jdk1.8.0_112
                            
                                Java, Intellij IDEA problem Unrecognized option: --add-opens=jdk.compiler/com.sun.tools.javac.code=ALL-UNNAMED
                            
                                Android Studio git check in author name
                            
                                unable to set breakpoint in intellij
                            
                                Intellij 12 - Can't Reimport Module
                            
                                Simulate input from stdin when running a scala program in intellij
                            
                                Where is the JetBrains IntelliJ openapi documentation? [closed]
                            
                                What is the difference between "Module Dependencies" and "Libraries" in IntelliJ IDEA?
                            
                                How to disable Gradle daemon in IntelliJ Idea?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to work efficiently with SBT, Spark and "provided" dependencies?

Tags:

intellij-idea

sbt

apache-spark

sbt-assembly

Alexis Seigneurin

People also ask

4 Answers

Martin Tapp

Alexis Seigneurin

You need to make the IntellJ work.

Atais

VasiliNovikov

Recent Activity

Donate For Us