apache-spark tutorials and guides

Spark SQL is not converting timezone correctly [duplicate]

Feb 04, 2022

What's the difference between explode function and operator?

Sep 05, 2022

apache-spark apache-spark-sql

What to do with "WARN TaskSetManager: Stage contains a task of very large size"?

Dec 26, 2020

apache-spark apache-spark-1.6

Delta Lake rollback

Nov 04, 2022

apache-spark rollback databricks delta-lake

How does Spark achieve parallelism within one task on multi-core or hyper-threaded machines

Nov 06, 2022

multithreading apache-spark parallel-processing multicore

Pyspark Dataframe group by filtering

May 31, 2019

python apache-spark pyspark apache-spark-sql

Spark Dataframe Random UUID changes after every transformation/action

Apr 18, 2022

scala apache-spark dataframe uuid

How to run Scala script using spark-submit (similarly to Python script)?

Jul 01, 2021

scala apache-spark

Aggregate rows of Spark DataFrame to String after groupby

Aug 19, 2022

scala apache-spark dataframe

Read from Kafka and write to hdfs in parquet

Jun 16, 2022

hadoop apache-spark apache-kafka hdfs parquet

Spark Dataframe - Python - count substring in string

Oct 28, 2022

python string apache-spark pyspark apache-spark-sql

joda DateTime format cause null pointer error in spark RDD functions

Mar 23, 2022

scala apache-spark

TypeError: got an unexpected keyword argument

Mar 18, 2022

python apache-spark pyspark apache-spark-sql user-defined-functions

How to Launch Spark 2.0 on EC2

Dec 03, 2019

amazon-web-services apache-spark amazon-ec2

Apache Spark vs Apache Spark 2 [closed]

Oct 26, 2022

apache-spark apache-spark-2.0

How to handle an AnalysisException on Spark SQL?

Sep 05, 2022

python apache-spark pyspark apache-spark-sql databricks

What does in-memory data storage mean in the context of Apache Spark?

Nov 09, 2022

hadoop apache-spark

In Apache Spark. How to set worker/executor's environment variables?

Oct 28, 2022

amazon-web-services amazon-s3 apache-spark distributed-computing

SparkSQL error Table Not Found

Oct 22, 2018

sql scala apache-spark cassandra

NoSuchMethodException in MaxMind GeoIp dependency jackson-databind built with mvn shade

Jun 06, 2018

scala maven apache-spark jackson maxmind

New posts in apache-spark