Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in apache-spark-sql

How to change the schema of a DataFrame (to fix the names of some nested fields)?

Pyspark - from_unixtime not showing the correct datetime

How to convert from SparkR to sparklyr?

Spark SQL - How do i set a variable within the query, to re-use throughout?

Create a column in a PySpark dataframe using a list whose indices are present in one column of the dataframe

Convert a JSON string to a struct column without schema in Spark

Adaptive Query Execution and Shuffle Partitions

How to get length of complex datatype column in hive

Comparing two array columns in Scala Spark

Read spark csv with empty values without converting to null

Window function acts not as expected when I use Order By (PySpark)

Filter column with two different schemas in spark scala

.isin() with a column from a dataframe

pyspark apache-spark-sql

Does ordering a column before partitioning make a difference

Does SparkSession always use Hive Context?

Can I use Spark DataFrame inside regular Spark map operation?

How to execute hql files with multiple SQL queries per single file?