Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in apache-spark

How to solve "Exception in thread "main" org.apache.spark.SparkException: Application application finished with failed status"?

Define spark udf by reflection on a String

How to filter data using window functions in spark

Why does SparkSession execute twice for one action?

Spark: Removing rows which occur less than N times

apache-spark pyspark

NullPointerException in Spark RDD map when submitted as a spark job

Why extracting an argument in spark to local variable is considered safer?

Transformation process in Apache Spark

apache-spark rdd

Spark doesnt print outputs on the console within the map function

Aggregate a Spark data frame using an array of column names, retaining the names

Mongo Spark connector and mongo 3.2, root user cannot read database

mongodb apache-spark

PySpark PCA: how to convert dataframe rows from multiple columns to a single column DenseVector?

RDD to DataFrame in pyspark (columns from rdd's first element)

Check equality for two Spark DataFrames in Scala

Why sortBy() cannot sort the data evenly in Spark?

convert string data in dataframe into double

RestAPI service call from Spark Streaming

How to create a schema from CSV file and persist/save that schema to a file?

scala apache-spark schema

How to convert all column of dataframe to numeric spark scala?

Starting Ipython with Spark 2

apache-spark ipython