While checking how to use the cassandra connection, the documentation instructs to add this to the sbt file:
"libraryDependencies += "com.datastax.spark" %% "spark-cassandra-connector" % "1.6.0-M1"
In general, is there an obvious, straight forward logic to translate this into the corresponding:
spark-shell --packages "field1":"field2"
I've tried:
spark-shell --packages "com.datastax.spark":"spark-cassandra-connector"
and a few other things but that doesn't work.
I believe it is --packages "groupId:artifactId:version"
. If you have multiple packages, you can comma separate them.
--packages "groupId1:artifactId1:version1, groupId2:artifactId2:version2"
In sbt
val appDependencies = Seq(
"com.datastax.spark" % "spark-cassandra-connector_2.10" % "1.6.0-M1"
)
and
val appDependencies = Seq(
"com.datastax.spark" %% "spark-cassandra-connector" % "1.6.0-M1"
)
are identical.
In case you use %%
syntax (after the groupId) in sbt, it automatically picks up the artifact for your scala version. So using scala 2.10 it changes your spark-cassandra-connector to spark-cassandra-connector_2.10. Not sure this feature is there when using spark-shell, so you might need to ask for the scala2_10 version of your artifact explicitly like this: --packages "com.datastax.spark:spark-cassandra-connector_2.10:1.6.0-M1"
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With