parallelize() method while using SparkSession in Spark 2.0

Question

I see that SparkSession doesn't have .parallelize() method, Do we need to use SparkContext again to create a RDD?. If so, is creating both SparkSession & SparkContext in a single program advisable?

eliasah · Accepted Answer

Once you build your SparkSession, you can fetch the underlying SparkContext created with it as followed :

Let's consider that SparkSession is already defined :

val spark : SparkSession = ???

You can get SparkContext now :

val sc = spark.sparkContext

loneStar · Answer

There is method of spark Context in the SparkSession Class

val data = spark.sparkContext.parallelize(Seq(1,2,3,4))
data: org.apache.spark.rdd.RDD[Int] = ParallelCollectionRDD[0] at parallelize at <console>:23

parallelize() method while using SparkSession in Spark 2.0

Tags:

vdep

2 Answers

eliasah

loneStar

Recent Activity

Donate For Us

parallelize() method while using SparkSession in Spark 2.0

Tags:

vdep

2 Answers

eliasah

loneStar

Related questions

Recent Activity

Donate For Us