apache-spark-sql tutorials

DataFrame partitionBy on nested columns

Sep 12, 2022

apache-spark apache-spark-sql spark-dataframe

Divide elements of column by a sum of elements (of same column) grouped by elements of another column

May 22, 2022

scala apache-spark apache-spark-sql

Implementing MERGE INTO sql in pyspark

Oct 14, 2022

sql merge pyspark apache-spark-sql

TypeError: 'JavaPackage' object is not callable

May 25, 2021

apache-spark pyspark apache-spark-sql

Spark pulling data into RDD or dataframe or dataset

Dec 17, 2020

hadoop apache-spark apache-spark-sql spark-dataframe data-ingestion

Is there any way to get the output of Spark's Dataset.show() method as a string?

Oct 26, 2022

apache-spark apache-spark-sql

UDF cause warning: CachedKafkaConsumer is not running in UninterruptibleThread (KAFKA-1894)

Oct 25, 2022

apache-spark pyspark apache-kafka apache-spark-sql spark-streaming

Does Spark support BigInteger type?

Aug 23, 2019

java scala apache-spark apache-spark-sql

Spark: Prevent shuffle/exchange when joining two identically partitioned dataframes

Mar 04, 2022

apache-spark join pyspark apache-spark-sql pyspark-dataframes

How to set hive.metastore.warehouse.dir in HiveContext?

May 09, 2022

apache-spark apache-spark-sql spark-hive

Spark Truncated Spark Plan

May 15, 2022

scala apache-spark apache-spark-sql

Spark createDataFrame(df.rdd, df.schema) vs checkPoint for breaking lineage

Aug 31, 2022

apache-spark apache-spark-sql

SparkSQL MissingRequirementError when registering table

Oct 27, 2022

scala sbt apache-spark apache-spark-sql

Spark Exception : Task failed while writing rows

Jan 27, 2020

java hadoop apache-spark apache-spark-sql parquet

Hive Sql dynamically get null column counts from a table

Sep 16, 2022

hive apache-spark-sql hiveql

Reading JSON files into Spark Dataset and adding columns from a separate Map

Feb 12, 2022

json scala apache-spark apache-spark-sql apache-spark-dataset

Spark 2.0 Timestamp Difference in Milliseconds using Scala

Feb 27, 2022

scala timestamp apache-spark-sql user-defined-functions apache-spark-2.0

my spark sql limit is very slow

Oct 27, 2022

apache-spark elasticsearch apache-spark-sql spark-submit

Spark read parquet with custom schema

Nov 09, 2022

apache-spark pyspark apache-spark-sql

Spark SQL convert dataset to dataframe

Sep 16, 2022

scala apache-spark apache-spark-sql

New posts in apache-spark-sql