apache-spark tutorials and guides

How to solve "Exception in thread "main" org.apache.spark.SparkException: Application application finished with failed status"?

Feb 09, 2023

apache-spark spark-streaming

Define spark udf by reflection on a String

Feb 09, 2023

scala apache-spark spark-dataframe udf scala-reflect

How to filter data using window functions in spark

Feb 09, 2023

scala apache-spark spark-dataframe window-functions

Why does SparkSession execute twice for one action?

Feb 08, 2023

java apache-spark apache-spark-sql

Spark: Removing rows which occur less than N times

Feb 09, 2023

apache-spark pyspark

NullPointerException in Spark RDD map when submitted as a spark job

Feb 08, 2023

scala hadoop apache-spark distributed-computing bigdata

Why extracting an argument in spark to local variable is considered safer?

Feb 09, 2023

scala function apache-spark distributed-computing bigdata

Transformation process in Apache Spark

Feb 09, 2023

apache-spark rdd

Spark doesnt print outputs on the console within the map function

Feb 08, 2023

scala apache-spark spark-streaming

Aggregate a Spark data frame using an array of column names, retaining the names

Feb 07, 2023

scala apache-spark apache-spark-sql aggregate-functions

Mongo Spark connector and mongo 3.2, root user cannot read database

Feb 08, 2023

mongodb apache-spark

PySpark PCA: how to convert dataframe rows from multiple columns to a single column DenseVector?

Feb 08, 2023

apache-spark pyspark apache-spark-mllib pca apache-spark-ml

RDD to DataFrame in pyspark (columns from rdd's first element)

Feb 07, 2023

python-2.7 apache-spark pyspark rdd pyspark-sql

Check equality for two Spark DataFrames in Scala

Feb 08, 2023

scala unit-testing apache-spark spark-dataframe

Why sortBy() cannot sort the data evenly in Spark?

Feb 08, 2023

python apache-spark pyspark rdd

convert string data in dataframe into double

Feb 08, 2023

scala apache-spark apache-spark-sql

RestAPI service call from Spark Streaming

Feb 07, 2023

scala rest apache-spark spark-streaming

How to create a schema from CSV file and persist/save that schema to a file?

Feb 07, 2023

scala apache-spark schema

How to convert all column of dataframe to numeric spark scala?

Feb 07, 2023

scala apache-spark apache-spark-sql

Starting Ipython with Spark 2

Feb 07, 2023

apache-spark ipython

New posts in apache-spark