apache-spark tutorials and guides

spark.conf.set("spark.driver.maxResultSize", '6g') is not updating the default value - PySpark

Sep 18, 2025

apache-spark pyspark azure-databricks

Spark read.parquet takes too much time

Sep 18, 2025

performance apache-spark parquet

pySpark withColumn with a function

Sep 19, 2025

apache-spark pyspark apache-spark-sql user-defined-functions

Structured Streaming error py4j.protocol.Py4JNetworkError: Answer from Java side is empty

Sep 18, 2025

apache-spark pyspark apache-kafka spark-structured-streaming

Pyspark: how to read a .csv file in google bucket?

Sep 17, 2025

python apache-spark google-cloud-platform pyspark

Pyarrow error: while running a pandas udf in pyspark

Sep 19, 2025

python pandas apache-spark pyspark apache-spark-sql

How to pull Spark jobs client logs submitted using Apache Livy batches POST method using AirFlow

Sep 18, 2025

apache-spark airflow livy

Transform column with seconds to human readable duration

Sep 18, 2025

python apache-spark apache-spark-sql pyspark

Distributed Rules Engine

Sep 19, 2025

apache-spark drools rule-engine complex-event-processing

Spark Graphframes large dataset and memory Issues

Sep 17, 2025

apache-spark pyspark amazon-emr graphframes

list S3 files in Pyspark

Sep 18, 2025

python apache-spark amazon-s3 pyspark boto3

Value split is not a member of (String, String)

Sep 18, 2025

scala apache-spark apache-kafka spark-streaming spark-submit

Generate database schema diagram for Databricks

Sep 18, 2025

apache-spark database-schema databricks diagram

Merge two tables in Scala/Spark

Sep 18, 2025

scala apache-spark

Spark/Scala load Oracle Table to Hive

Sep 18, 2025

oracle-database apache-spark hive

How to find out the driver node for my Spark?

Sep 17, 2025

apache-spark port driver hadoop-yarn

Spark:executor.CoarseGrainedExecutorBackend: Driver Disassociated disassociated

Sep 17, 2025

apache-spark rdd

SPARK: How to parse a Array of JSON object using Spark

Sep 18, 2025

json apache-spark apache-spark-sql schema

how to save data in HDFS with spark?

Sep 16, 2025

hadoop apache-spark hdfs spark-streaming

Exception in thread "main" java.lang.NoClassDefFoundError: org/apache/spark/streaming/StreamingContext

Sep 16, 2025

scala apache-spark intellij-idea sbt spark-streaming

New posts in apache-spark