apache-spark-sql tutorials

Spark parse string to timestamp with timezone

Mar 08, 2023

Upsert to CosmosDB from Spark error

Mar 09, 2023

scala apache-spark pyspark apache-spark-sql azure-cosmosdb

How to create an Encoder for Scala collection (to implement custom Aggregator)?

Mar 09, 2023

scala apache-spark apache-spark-sql apache-spark-encoders

Splittling list of JSON key/value pairs into columns of a row in a Dataset

Mar 08, 2023

scala apache-spark apache-spark-sql

How can I control the number of output files written from Spark DataFrame?

Mar 07, 2023

scala apache-spark apache-kafka apache-spark-sql spark-streaming

spark dataframe: explode list column

Mar 08, 2023

apache-spark apache-spark-sql

Iterate over elements of columns Scala

Mar 08, 2023

scala apache-spark apache-spark-sql

Spark Dataset/Dataframe join NULL skew key

Mar 08, 2023

apache-spark apache-spark-sql skew

How to fix "ImportError: PyArrow >= 0.8.0 must be installed; however, it was not found."?

Mar 05, 2023

apache-spark pyspark apache-spark-sql

Getting HDFS Location of Hive Table in Spark

Mar 06, 2023

scala apache-spark hive apache-spark-sql hiveql

Refresh metadata for Dataframe while reading parquet file

Mar 05, 2023

apache-spark apache-spark-sql parquet apache-spark-dataset

Add a new column to a PySpark DataFrame from a Python list

Mar 04, 2023

python apache-spark pyspark apache-spark-sql

flattening array of struct in pyspark

Mar 05, 2023

apache-spark pyspark apache-spark-sql

How to use variables in SQL queries?

Mar 04, 2023

apache-spark apache-spark-sql databricks

Writing to Google Cloud Storage with v2 algorithm safe?

Mar 04, 2023

apache-spark apache-spark-sql google-cloud-storage

Populate a column based on previous value and row Pyspark

Mar 03, 2023

apache-spark pyspark apache-spark-sql

Spark explode array column to columns

Mar 04, 2023

java arrays apache-spark pyspark apache-spark-sql

In spark SQL/Hive QL, How to select a column that is a reserved keyword

Feb 13, 2023

apache-spark hiveql apache-spark-sql

Cannot run RandomForestClassifier from spark ML on a simple example

Feb 11, 2023

scala apache-spark dataframe apache-spark-sql apache-spark-ml

Spark SQL's where clause excludes null values

Feb 11, 2023

sql apache-spark apache-spark-sql

New posts in apache-spark-sql