Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in apache-spark-sql

PySpark: compute row maximum of the subset of columns and add to an exisiting dataframe

How to use Spark SQL to parse the JSON array of objects

Sort Spark Dataframe with two columns in different order

Remove an element from a Python list of lists in PySpark DataFrame

Column filtering in PySpark

How to sort a column with Date and time values in Spark?

How to enable or disable Hive support in spark-shell through Spark property (Spark 1.6)?

How to extract a single (column/row) value from a dataframe using PySpark?

pyspark apache-spark-sql

Spark-SQL : How to read a TSV or CSV file into dataframe and apply a custom schema?

How to get the last row from DataFrame?

Can I change the nullability of a column in my Spark dataframe?

How to convert map to dataframe?

Unsupported literal type class scala.runtime.BoxedUnit

Getting java.lang.RuntimeException: Unsupported data type NullType when turning a dataframe into permanent hive table

Cannot convert type <class 'pyspark.ml.linalg.SparseVector'> into Vector

Filling missing dates in spark dataframe column

spark in yarn-cluser 'sc' not defined

How to unwrap nested Struct column into multiple columns?