apache-spark tutorials and guides

spark 2.1.1 : Parsed JSON values do not match with class constructor

Nov 09, 2021

How can I join a spark live stream with all the data collected by another stream during its entire life cycle?

Aug 30, 2022

apache-spark pyspark spark-streaming amazon-kinesis apache-spark-2.0

Efficient load CSV coordinate format (COO) input to local matrix spark

Oct 19, 2022

scala apache-spark matrix sparse-matrix apache-spark-ml

Spark: Reading big MySQL table into DataFrame fails

Sep 07, 2022

mysql apache-spark

SparkAppHandle Listener not getting invoked

Dec 12, 2021

scala apache-spark playframework

Spark 2.3 dynamic partitionBy not working on S3 AWS EMR 5.13.0

Nov 15, 2022

scala apache-spark amazon-s3 bigdata amazon-emr

KryoException: Unable to find class with spark structured streaming

Jun 18, 2021

apache-spark sbt-assembly kryo spark-structured-streaming

Pyspark and local variables inside UDFs

Sep 20, 2020

python apache-spark pyspark user-defined-functions

Spark watermark and windowing in Append mode

Jun 16, 2022

apache-spark spark-structured-streaming

Latent Dirichlet allocation (LDA) in Spark - replicate model

May 01, 2022

apache-spark pyspark lda

Apache Spark Executors Dead - is this the expected behaviour?

Aug 30, 2022

apache-spark hadoop-yarn

Spark concurrent writes on same HDFS location

Apr 25, 2022

apache-spark hadoop apache-spark-sql hdfs apache-nifi

Kappa architecture: when insert to batch/analytic serving layer happens

Sep 22, 2022

apache-spark architecture streaming apache-flink lambda-architecture

403 Error while accessing s3a using Spark

Sep 24, 2022

apache-spark hadoop amazon-s3 pyspark

AWS EMR: Pyspark: Rdd: mappartitions: Could not find valid SPARK_HOME while searching: Spark closures

May 22, 2022

apache-spark pyspark apache-spark-sql python-requests amazon-emr

saveAsTextFile method in spark

Nov 04, 2022

scala apache-spark

Connect to spark through a SOCKS proxy

Aug 16, 2022

scala ssh proxy apache-spark

How do I submit a Spark jar to a EMR cluster?

Dec 10, 2019

amazon-web-services mapreduce apache-spark bigdata emr

Where to download documentation for Spark?

Nov 19, 2022

apache-spark

SparkR Error in sparkR.init(master="local") in RStudio

Feb 27, 2022

apache-spark rstudio sparkr

New posts in apache-spark