apache-spark tutorials and guides

Getting the value of a DataFrame column in Spark

Mar 06, 2022

scala apache-spark

Apache spark error: not found: value sqlContext

Aug 21, 2022

scala apache-spark

Spark Shell "Failed to Initialize Compiler" Error on a mac

May 21, 2022

macos scala apache-spark installation

Add extra hours to timestamp columns in Pyspark data frame [duplicate]

Nov 09, 2022

python apache-spark pyspark

Spark SQL: how to cache sql query result without using rdd.cache()

Sep 16, 2022

caching query-optimization apache-spark

How to randomly sample from a Scala list or array?

Oct 31, 2022

arrays list scala apache-spark sample

How to filter based on array value in PySpark?

Nov 12, 2022

python apache-spark dataframe pyspark apache-spark-sql

How do you automate pyspark jobs on emr using boto3 (or otherwise)?

Nov 20, 2022

python amazon-s3 apache-spark pyspark amazon-emr

Spark-Shell Startup Errors

Mar 18, 2022

apache-spark derby

Amazon s3a returns 400 Bad Request with Spark

Nov 16, 2022

amazon-web-services amazon-s3 apache-spark hdfs spark-streaming

How to use groupBy to collect rows into a map?

Sep 24, 2022

apache-spark apache-spark-sql

Hadoop “Unable to load native-hadoop library for your platform” error on docker-spark?

Aug 30, 2022

hadoop apache-spark docker

AWS Glue executor memory limit

Sep 16, 2022

amazon-web-services apache-spark aws-glue

Does SparkSQL support subquery?

Mar 06, 2019

sql apache-spark subquery apache-spark-sql

Pyspark - Aggregation on multiple columns

Sep 16, 2022

python python-2.7 apache-spark pyspark

Spark, add new Column with the same value in Scala [duplicate]

Sep 16, 2022

scala apache-spark spark-dataframe

Zeppelin: How to restart sparkContext in zeppelin

Aug 28, 2022

apache-spark apache-zeppelin

How to filter column on values in list in pyspark?

Sep 16, 2022

apache-spark pyspark apache-spark-sql spark-dataframe pyspark-sql

Spark Scala: Cannot up cast from string to int as it may truncate

Feb 13, 2022

scala apache-spark apache-spark-sql

Spark SQL case insensitive filter for column conditions

Sep 16, 2022

apache-spark apache-spark-sql

New posts in apache-spark