apache-spark tutorials and guides

Job are not shown on Spark WebUI

May 04, 2026

apache-spark pyspark webui

Scala module 2.12.3 requires Jackson Databind version >= 2.12.0 and < 2.13.0 but I have databind 2.12.3

May 05, 2026

java apache-spark data-binding version

Is it possible to read ORC file to Spark Data Frame in sparklyr?

May 05, 2026

r apache-spark sparkr sparklyr orc

Spark window function without orderBy

May 05, 2026

apache-spark apache-spark-sql

Spark convert array of structs to Vector for Euclidean distance

May 05, 2026

apache-spark apache-spark-sql user-defined-functions apache-spark-mllib

Spark structured streaming maxOffsetsPerTrigger does not seem to work

May 03, 2026

apache-spark spark-structured-streaming

How to print/log outputs within foreachBatch function?

May 05, 2026

apache-spark databricks spark-structured-streaming

Pyspark Replicate Row based on column value

May 05, 2026

apache-spark pyspark apache-spark-sql

Reading partition columns without partition column names

May 05, 2026

apache-spark amazon-s3 pyspark parquet partition

Pyspark (spark 1.6.x) ImportError: cannot import name Py4JJavaError

May 05, 2026

python apache-spark pyspark

Parsing JSON object with large number of unique keys (not a list of objects) using PySpark

May 04, 2026

python json apache-spark pyspark

How to fail a spark application when there is an error

May 03, 2026

apache-spark apache-spark-sql

Dataproc cannot unzip .gz file zipped by AWS Kinesis

May 04, 2026

apache-spark google-cloud-platform google-cloud-dataproc

How to resolve pickle error in pyspark?

May 04, 2026

python dictionary unicode apache-spark pyspark

Apache Spark : When not to use mapPartition and foreachPartition?

May 04, 2026

scala apache-spark apache-spark-sql

Spark Streaming DStream.reduceByKeyAndWindow doesn't work

May 03, 2026

apache-spark spark-streaming

New posts in apache-spark