apache-spark-sql tutorials

PySpark: compute row maximum of the subset of columns and add to an exisiting dataframe

Sep 24, 2018

How to use Spark SQL to parse the JSON array of objects

May 20, 2022

json scala apache-spark apache-spark-sql bigdata

Sort Spark Dataframe with two columns in different order

May 26, 2022

scala sorting apache-spark dataframe apache-spark-sql

Remove an element from a Python list of lists in PySpark DataFrame

Sep 06, 2022

python apache-spark pyspark apache-spark-sql pyspark-sql

Column filtering in PySpark

Mar 07, 2017

python lambda apache-spark apache-spark-sql pyspark

How to sort a column with Date and time values in Spark?

Nov 01, 2022

apache-spark dataframe apache-spark-sql rdd

How to enable or disable Hive support in spark-shell through Spark property (Spark 1.6)?

Mar 25, 2022

apache-spark hive apache-spark-sql apache-spark-1.6

How to extract a single (column/row) value from a dataframe using PySpark?

Nov 03, 2022

pyspark apache-spark-sql

Spark-SQL : How to read a TSV or CSV file into dataframe and apply a custom schema?

Apr 26, 2022

scala apache-spark apache-spark-sql spark-dataframe

How to get the last row from DataFrame?

Oct 31, 2022

scala apache-spark apache-spark-sql spark-dataframe

Can I change the nullability of a column in my Spark dataframe?

Jun 24, 2022

python pyspark apache-spark-sql

How to convert map to dataframe?

Nov 06, 2022

scala apache-spark dictionary apache-spark-sql

Unsupported literal type class scala.runtime.BoxedUnit

Jun 25, 2022

scala apache-spark-sql datastax databricks

Getting java.lang.RuntimeException: Unsupported data type NullType when turning a dataframe into permanent hive table

Apr 22, 2022

apache-spark pyspark apache-spark-sql

Cannot convert type <class 'pyspark.ml.linalg.SparseVector'> into Vector

Sep 11, 2021

apache-spark pyspark apache-spark-sql apache-spark-mllib apache-spark-ml

Filling missing dates in spark dataframe column

Nov 14, 2022

scala datetime apache-spark apache-spark-sql

spark in yarn-cluser 'sc' not defined

Oct 22, 2022

python apache-spark apache-spark-sql

How to unwrap nested Struct column into multiple columns?

Sep 19, 2022

python apache-spark dataframe pyspark apache-spark-sql

New posts in apache-spark-sql