apache-spark-sql tutorials

Why does transform do side effects (println) only once in Structured Streaming?

Aug 24, 2022

Need to Know Partitioning Details in Dataframe Spark

Nov 18, 2019

apache-spark apache-spark-sql spark-dataframe

Sort by date an Array of a Spark DataFrame Column

May 03, 2022

scala apache-spark dataframe apache-spark-sql

Using stat.bloomFilter in Spark 2.0.0 to filter another dataframe

Dec 06, 2021

scala apache-spark apache-spark-sql apache-spark-dataset bloom-filter

How to enable Tungsten optimization in Spark 2?

Oct 25, 2019

apache-spark pyspark apache-spark-sql apache-spark-2.0

How to copy table by spark-sql

Jan 25, 2020

sql apache-spark-sql

How to use Column.isin with array column in join?

Aug 17, 2021

scala apache-spark apache-spark-sql

Convert Array into dataframe with columns and index in Scala

Jul 21, 2022

scala apache-spark-sql

Hive bucketing through sparkSQL

Nov 05, 2022

apache-spark hive apache-spark-sql data-processing

Transpose a dataframe in Pyspark

Jul 12, 2022

apache-spark pyspark apache-spark-sql

spark convert dataframe to dataset using case class with option fields

Jul 07, 2022

scala apache-spark apache-spark-sql apache-spark-dataset

How do I flatMap a row of arrays into multiple rows?

Apr 16, 2022

apache-spark apache-spark-sql

UPDATE Cassandra table using spark cassandra connector

Sep 05, 2018

scala apache-spark cassandra-2.0 apache-spark-sql spark-cassandra-connector

Spark DataFrame filtering: retain element belonging to a list

Aug 31, 2022

scala apache-spark dataframe apache-spark-sql apache-zeppelin

When registering a table using the %pyspark interpreter in Zeppelin, I can't access the table in %sql

Aug 12, 2022

apache-spark-sql apache-zeppelin

SparkSQL sql syntax for nth item in array

Aug 28, 2022

python apache-spark pyspark apache-spark-sql

How do I collect a List of Strings from spark DataFrame Column after a GroupBy operation?

Oct 02, 2022

java apache-spark apache-spark-sql

Spark remove duplicate rows from DataFrame [duplicate]

Nov 05, 2022

scala apache-spark dataframe apache-spark-sql

save dataframe as external hive table

Oct 14, 2022

apache-spark hive apache-spark-sql spark-dataframe

Apache Spark - Backend servers

Jun 05, 2022

php apache-spark apache-spark-sql

New posts in apache-spark-sql