Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in apache-spark

How to get nth row of Spark RDD?

hadoop apache-spark rdd

Removing punctuation marks form text in Scala - Spark

Add a new column to a Dataframe. New column i want it to be a UUID generator

The SPARK_HOME env variable is set but Jupyter Notebook doesn't see it. (Windows)

How to improve broadcast Join speed with between condition in Spark

How to use lag and rangeBetween functions on timestamp values?

Spark: Joining with array

Disable parquet metadata summary in Spark

apache-spark parquet

how to read json with schema in spark dataframes/spark sql

KStreams + Spark Streaming + Machine Learning

Spark Dataframe column with last character of other column

Adding constant value column to spark dataframe

Count the number of missing values in a dataframe Spark

spark submit "Service 'Driver' could not bind on port" error

apache-spark word-count

Why does pyspark fail with "Unable to locate hive jars to connect to metastore. Please set spark.sql.hive.metastore.jars."?

apache-spark pyspark

Error: Invalid or corrupt jarfile sbt/sbt-launch-0.13.5.jar

scala apache-spark

MinMax Normalization in scala

Spark 2.1 - Error While instantiating HiveSessionState

apache-spark

Spark configuration priority

apache-spark hadoop-yarn

How to set and get static variables from spark?