apache-spark-sql tutorials

Avro Schema to spark StructType

Oct 14, 2022

How to load specific Hive partition in DataFrame Spark 1.6?

Aug 26, 2022

apache-spark hive apache-spark-sql

Convert PySpark dataframe column type to string and replace the square brackets

Jan 28, 2018

python pyspark apache-spark-sql

Spark DataSet filter performance

May 26, 2022

apache-spark apache-spark-sql spark-dataframe apache-spark-dataset

Creating/accessing dataframe inside the transformation of another dataframe

Apr 20, 2022

scala apache-spark dataframe apache-spark-sql

How to concatenate a string to a column in Spark?

Nov 17, 2022

scala apache-spark apache-spark-sql concatenation

How to create a Row from a given case class?

May 16, 2022

scala apache-spark apache-spark-sql

Parquet vs Delta format in Azure Data Lake Gen 2 store

Sep 16, 2022

apache-spark apache-spark-sql azure-data-lake azure-databricks azure-data-lake-gen2

Spark SQL: automatic schema from csv

Apr 06, 2022

scala csv apache-spark apache-spark-sql

How to use countDistinct in Scala with Spark?

Mar 25, 2022

scala user-defined-functions apache-spark-sql

How to implement NOT IN for two DataFrames with different structure in Apache Spark

Oct 30, 2022

java sql apache-spark apache-spark-sql

Moving Spark DataFrame from Python to Scala whithn Zeppelin

Aug 22, 2022

python scala apache-spark apache-spark-sql apache-zeppelin

How to set Parquet file encoding in Spark

Apr 10, 2022

scala apache-spark apache-spark-sql parquet

jsontostructs to Row in spark structured streaming

Oct 27, 2022

java apache-spark apache-spark-sql apache-spark-2.0 spark-structured-streaming

How To Push a Spark Dataframe to Elastic Search (Pyspark)

May 03, 2022

python elasticsearch pyspark apache-spark-sql spark-dataframe

Create new column with an array of range of numbers

Aug 10, 2022

arrays scala apache-spark apache-spark-sql

Spark Dataframe Write to CSV creates _temporary directory file in Standalone Cluster Mode

Oct 03, 2022

java csv apache-spark dataframe apache-spark-sql

Spark Advanced Window with dynamic last

Sep 17, 2022

sql scala apache-spark apache-spark-sql pyspark-sql

Operation on Data Frame

Sep 25, 2022

scala apache-spark apache-spark-sql

Calculate the mode of a PySpark DataFrame column?

May 12, 2022

python apache-spark pyspark apache-spark-sql

New posts in apache-spark-sql