Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in apache-spark-sql

Spark SQL UNION - ORDER BY column not in SELECT

How to identify columns based on datatype and convert them in pyspark?

Why Spark SQL translates String "null" to Object null for Float/Double types?

What is the most efficient way to select distinct value from a spark dataframe?

How to create Dataset (not DataFrame) without using case class but using StructType?

Use Spark Scala to transform flat data into nested object

Exceeding `spark.driver.maxResultSize` without bringing any data to the driver

Cannot have map type columns in DataFrame which calls set operations

AnalysisException: u'Cannot resolve column name

How to combine and collect elements of an RDD into a list in pyspark

How to access global temp view in another pyspark application?

Create array containing first element of each struct in an array in a Spark dataframe field

Pyspark remove field in struct column

How to convert DataFrame to Json?

How to delete rows from dataframe?

Spark vs Hive differences with ANALYZE TABLE command -

Find for each row the first non-null value in a group of columns and the column name

commenting in spark sql

Java Spark DataFrameReader java.lang.NegativeArraySizeException