Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in spark-streaming

Spark Streaming: foreachRDD update my mongo RDD

Spark Streaming mapWithState seems to rebuild complete state periodically

How to fix "java.io.NotSerializableException: org.apache.kafka.clients.consumer.ConsumerRecord" in Spark Streaming Kafka Consumer?

What is the correct way to start/stop spark streaming jobs in yarn?

spark ssc.textFileStream is not streamining any files from directory

How to specify which java version to use in spark-submit command?

SBT Test Error: java.lang.NoSuchMethodError: net.jpountz.lz4.LZ4BlockInputStream

Amazon s3a returns 400 Bad Request with Spark

Spark off heap memory leak on Yarn with Kafka direct stream

How to create Spark RDD from an iterator?

Kafka topic partitions to Spark streaming

java.lang.NoClassDefFoundError: org/apache/spark/streaming/twitter/TwitterUtils$ while running TwitterPopularTags

Spark Dataframe validating column names for parquet writes

Spark streaming with Kafka - createDirectStream vs createStream

How to stop spark streaming when the data source has run out

AbstractMethodError creating Kafka stream

Limit Kafka batches size when using Spark Streaming

Spark Kafka Direct DStream - How many executors and RDD partitions in yarn-cluster mode if num-executors is set?

Condition in map function

Why does starting StreamingContext fail with “IllegalArgumentException: requirement failed: No output operations registered, so nothing to execute”?