How many RDDs does DStream generate for a batch interval?

Question

Does one batch interval of data generate one and only one RDD in DStream regardless of how big is the quantity of the data?

Mohammad Tameem · Accepted Answer

It's very late to reply to this thread. But still, It's worth adding a few more points. Number of RDDs depends upon how many receivers you have in your application. That's why "sparkContext.read" will have multiple RDDs. But if you have only one receiver or Kafka as a source (receiver-less) in that case you will get only one RDD.

How many RDDs does DStream generate for a batch interval?

Tags:

apache-spark

spark-streaming

Guo

1 Answers

Mohammad Tameem

Recent Activity

Donate For Us

How many RDDs does DStream generate for a batch interval?

Tags:

apache-spark

spark-streaming

Guo

1 Answers

Mohammad Tameem

Related questions

Recent Activity

Donate For Us