Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in spark-streaming

SBT Test Error: java.lang.NoSuchMethodError: net.jpountz.lz4.LZ4BlockInputStream

Amazon s3a returns 400 Bad Request with Spark

Spark off heap memory leak on Yarn with Kafka direct stream

How to create Spark RDD from an iterator?

Kafka topic partitions to Spark streaming

java.lang.NoClassDefFoundError: org/apache/spark/streaming/twitter/TwitterUtils$ while running TwitterPopularTags

Spark Dataframe validating column names for parquet writes

Spark streaming with Kafka - createDirectStream vs createStream

How to stop spark streaming when the data source has run out

AbstractMethodError creating Kafka stream

Limit Kafka batches size when using Spark Streaming

Spark Kafka Direct DStream - How many executors and RDD partitions in yarn-cluster mode if num-executors is set?

Condition in map function

Why does starting StreamingContext fail with “IllegalArgumentException: requirement failed: No output operations registered, so nothing to execute”?

Queries with streaming sources must be executed with writeStream.start();

What's the meaning of DStream.foreachRDD function?

How do I stop a spark streaming job?

Spark DataFrame: does groupBy after orderBy maintain that order?

The value of "spark.yarn.executor.memoryOverhead" setting?

Use Spring together with Spark