Can SparkContext and StreamingContext co-exist in the same program?

Tags:

I am trying to set up a Sparkstreaming code which reads line from the Kafka server but processes it using rules written in another local file. I am creating streamingContext for the streaming data and sparkContext for other applying all other spark features - like string manipulation, reading local files etc

val sparkConf = new SparkConf().setMaster("local[*]").setAppName("ReadLine")
val ssc = new StreamingContext(sparkConf, Seconds(15))
ssc.checkpoint("checkpoint")

    val topicMap = topics.split(",").map((_, numThreads.toInt)).toMap
    val lines = KafkaUtils.createStream(ssc, zkQuorum, group, topicMap).map(_._2)
    val sentence = lines.toString

    val conf = new SparkConf().setAppName("Bi Gram").setMaster("local[2]")
    val sc = new SparkContext(conf)
    val stringRDD = sc.parallelize(Array(sentence))

But this throws the following error

Exception in thread "main" org.apache.spark.SparkException: Only one SparkContext may be running in this JVM (see SPARK-2243). To ignore this error, set spark.driver.allowMultipleContexts = true. The currently running SparkContext was created at:
org.apache.spark.SparkContext.<init>(SparkContext.scala:82)
org.apache.spark.streaming.StreamingContext$.createNewSparkContext(StreamingContext.scala:874)
org.apache.spark.streaming.StreamingContext.<init>(StreamingContext.scala:81)

707

asked Nov 16 '16 02:11

maddie

2 Answers

One application can only have ONE SparkContext. StreamingContext is created on SparkContext. Just need to create ssc StreamingContext using SparkContext

val sc = new SparkContext(conf)
val ssc = new StreamingContext(sc, Seconds(15))

If using the following constructor.

StreamingContext(conf: SparkConf, batchDuration: Duration)

It internally create another SparkContext

this(StreamingContext.createNewSparkContext(conf), null, batchDuration)

the SparkContext can get from StreamingContext by

ssc.sparkContext

200

answered Oct 25 '22 20:10

Rockie Yang

yes you can do it you have to first start spark session and

then use its context to start any number of streaming context

val spark = SparkSession.builder().appName("someappname").
config("spark.sql.warehouse.dir",warehouseLocation).getOrCreate()

val ssc = new StreamingContext(spark.sparkContext, Seconds(1))

Simple!!!

answered Oct 25 '22 18:10

imran

Related questions
                            
                                In Scala, how can I define a companion object for a class defined in Java?
                            
                                Java 7 style automatic resource management for Scala
                            
                                Is this a bug in Scala 2.9.1 lazy implementation or just an artifact of decompilation
                            
                                Has Scala any equivalence to Haskell's undefined?
                            
                                Java/Scala obtain a Field reference in a typesafe way
                            
                                Which new features are (or will be) added to Scaladoc in Scala 2.10? [closed]
                            
                                Boundaries between Services, Filters, and Codecs in Finagle
                            
                                Is there any functional language compiler/runtime which optimizes chained iterations?
                            
                                Extract second tuple element in list of tuples
                            
                                i really would like sbt and its console to work under cygwin any way you think it can be done?
                            
                                Why sbt compile doesn't copy unmanaged resources to classpath?
                            
                                Custom Scala enum, most elegant version searched
                            
                                Scala: how to understand the flatMap method of Try?
                            
                                Why does the build fail with unresolved dependency: com.typesafe.sbt#sbt-native-packager;0.7.4?
                            
                                jvm options not passed on to forked process
                            
                                Building Apache Spark using SBT: Invalid or corrupt jarfile
                            
                                Convert Java's Integer to Scala's Int
                            
                                Explanation for - No Reflection involved
                            
                                Is scala sorting stable?
                            
                                MongoDB scala driver: what is a best way to return Future when working with Observer callbacks?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Can SparkContext and StreamingContext co-exist in the same program?

Tags:

scala

apache-spark

spark-streaming

maddie

People also ask

2 Answers

Rockie Yang

imran

Recent Activity

Donate For Us