apache-spark-sql tutorials

how to calculate max value in some columns per row in pyspark

Aug 10, 2022

Where is the union() method on the Spark DataFrame class?

Mar 29, 2021

java apache-spark dataframe apache-spark-sql

Dividing complex rows of dataframe to simple rows in Pyspark

Aug 28, 2022

python apache-spark dataframe pyspark apache-spark-sql

pyspark py4j.Py4JException: Method and([class java.lang.Integer]) does not exist

Mar 26, 2022

apache-spark pyspark apache-spark-sql

How to limit decimal values to 2 digits before applying agg function?

Oct 31, 2022

scala apache-spark apache-spark-sql apache-spark-1.5

Find column index by searching column header of a Dataset in Apache Spark Java

Sep 13, 2022

java apache-spark apache-spark-sql apache-spark-dataset

Spark Failure : Caused by: org.apache.spark.shuffle.FetchFailedException: Too large frame: 5454002341

Sep 26, 2022

apache-spark apache-spark-sql hadoop-yarn

Spark java.lang.ClassCastException: scala.collection.mutable.WrappedArray$ofRef cannot be cast to java.util.ArrayList

May 22, 2022

scala apache-spark apache-spark-sql

How to filter a Spark dataframe by a boolean column?

Nov 20, 2022

python apache-spark filter apache-spark-sql

Can I read a CSV represented as a string into Apache Spark using spark-csv

May 25, 2022

apache-spark apache-spark-sql spark-csv

How to calculate Median in spark sqlContext for column of data type double

May 19, 2022

apache-spark hive apache-spark-sql

How to replace NULL to 0 in left outer join in SPARK dataframe v1.6

May 24, 2022

scala apache-spark apache-spark-sql apache-spark-1.6

How to register UDF to use in SQL and DataFrame?

May 20, 2022

scala apache-spark apache-spark-sql user-defined-functions

How to check if a Hive table exists using PySpark

May 20, 2022

python-2.7 pyspark apache-spark-sql

Spark Dataset unique id performance - row_number vs monotonically_increasing_id

Jun 08, 2022

scala apache-spark apache-spark-sql apache-spark-dataset

Convert between spark.SQL DataFrame and pandas DataFrame [duplicate]

Sep 11, 2022

apache-spark apache-spark-sql apache-zeppelin

Get the last element from Apache Spark SQL split() Function

May 11, 2022

apache-spark-sql

Why does DataFrame.saveAsTable("df") save table to different HDFS host?

Dec 21, 2019

hadoop apache-spark hdfs apache-spark-sql

Adding 12 hours to datetime column in Spark

Jan 15, 2022

apache-spark apache-spark-sql

New posts in apache-spark-sql