apache-spark tutorials and guides

Filter array column content

Jul 17, 2019

apache-spark pyspark pyspark-sql

Spark Advanced Window with dynamic last

Sep 17, 2022

sql scala apache-spark apache-spark-sql pyspark-sql

How to create an unique autogenerated Id column in a spark dataframe

Feb 04, 2022

apache-spark

Using Jackson 2.9.9 in java Spark

Apr 04, 2022

java apache-spark jackson apache-spark-mllib

Spark dataframe checkpoint cleanup

Oct 19, 2022

scala apache-spark hive

List (or iterator) of tuples returned by MAP (PySpark)

Jul 23, 2016

python apache-spark

MLlib to Breeze vectors/matrices are private to org.apache.spark.mllib scope?

Nov 02, 2022

apache-spark apache-spark-mllib scala-breeze

How to use map-function in SPARK with Java

Nov 05, 2022

java csv apache-spark

How to include file in production mode for Play framework

May 22, 2018

scala intellij-idea playframework apache-spark

Operation on Data Frame

Sep 25, 2022

scala apache-spark apache-spark-sql

stop-all.sh in Spark sbin/ folder is not stopping all slave nodes

Feb 04, 2022

linux hadoop apache-spark

How to compute the inverse of a RowMatrix in Apache Spark?

Sep 08, 2021

scala apache-spark linear-algebra distributed-computing

system cannot find the path specified in spark-shell

Jun 09, 2020

apache-spark

Reducing potentially empty RDD's

Oct 14, 2022

scala apache-spark

Calculate the mode of a PySpark DataFrame column?

May 12, 2022

python apache-spark pyspark apache-spark-sql

How to read specific lines from sparkContext

May 09, 2022

java text apache-spark line

Read file on remote machine in Apache Spark using ftp

Mar 24, 2021

scala apache-spark ftp

Scalaz Type Classes for Apache Spark RDDs

Nov 14, 2022

scala apache-spark functional-programming rdd scalaz

Scala case class ignoring import in the Spark shell

Jun 01, 2020

scala apache-spark apache-spark-2.0

Do we still have to make a fat jar for submitting jobs in Spark 2.0.0?

Apr 28, 2020

apache-spark jar uberjar

New posts in apache-spark