Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in spark-streaming

How to run a python user-defined function on the partitions of RDDs using mapPartitions?

spark-streaming: how to output streaming data to cassandra

How to load history data when starting Spark Streaming process, and calculate running aggregations

Spark Streaming application fails with KafkaException: String exceeds the maximum size or with IllegalArgumentException

Hive Create Multi small files for each insert in HDFS

using cloud services to aggregate and group real-time statistics in a time window to trigger notifications

Spark history server filter jobs by user id or time

Spark not able to find checkpointed data in HDFS after executor fails

How many times K-means Spark Streaming processed the same data?

How to run Spark application assembled with Spark 2.1 on cluster with Spark 1.6?

Spark Structured Streaming - Read file from Nested Directories