apache-spark tutorials and guides

Pyspark Failed to find data source: kafka

Sep 05, 2022

How to configure Spark 2.4 correctly with user-provided Hadoop

Jul 16, 2022

apache-spark hadoop hive hadoop2

Spark Structured Streaming recovering from a query exception

Jul 14, 2022

scala apache-spark spark-structured-streaming

Setting textinputformat.record.delimiter in spark

Jan 21, 2021

scala hadoop mapreduce apache-spark

Apache Spark - reducebyKey - Java -

Aug 17, 2022

java apache-spark

Kafka to S3 - How to loading slices from kafka to S3

Jun 21, 2016

java scala amazon-s3 apache-spark apache-kafka

Exception while connecting to mongodb in spark

Apr 23, 2022

mongodb exception hadoop apache-spark hadoop-streaming

Compilation errors with spark cassandra connector and SBT

Jul 05, 2022

scala intellij-idea cassandra sbt apache-spark

Spark broadcast error: exceeds spark.akka.frameSize Consider using broadcast

Jul 27, 2020

scala apache-spark rdd

RDD.union vs SparkContex.union

Mar 24, 2022

apache-spark

Is it possible to use json4s 3.2.11 with Spark 1.3.0?

Feb 21, 2021

scala sbt apache-spark sbt-assembly json4s

Spark sort by key and then group by to get ordered iterable?

Aug 31, 2022

sorting apache-spark

How to compare every element in the RDD with every other element in the RDD ?

Jul 12, 2018

scala apache-spark nearest-neighbor

How do I flatMap a row of arrays into multiple rows?

Apr 16, 2022

apache-spark apache-spark-sql

UPDATE Cassandra table using spark cassandra connector

Sep 05, 2018

scala apache-spark cassandra-2.0 apache-spark-sql spark-cassandra-connector

How to add two Sparse Vectors in Spark using Python

Oct 20, 2022

python apache-spark sparse-matrix

Spark executor on yarn-client does not take executor core count configuration.

Jan 12, 2020

apache-spark hadoop-yarn

Spark DataFrame filtering: retain element belonging to a list

Aug 31, 2022

scala apache-spark dataframe apache-spark-sql apache-zeppelin

Checkpointing In ALS Spark Scala

Oct 30, 2022

scala apache-spark hdfs apache-spark-mllib

SparkSQL sql syntax for nth item in array

Aug 28, 2022

python apache-spark pyspark apache-spark-sql

New posts in apache-spark