Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in apache-spark

Apache Spark running spark-shell on YARN error

Sparse Vector pyspark

value toDS is not a member of org.apache.spark.rdd.RDD

How to enable or disable Hive support in spark-shell through Spark property (Spark 1.6)?

Null values from a csv on Scala and Apache Spark

convert epoch to datetime in Scala / Spark

Pyspark: groupby and then count true values

apache-spark pyspark

Spark-SQL : How to read a TSV or CSV file into dataframe and apply a custom schema?

How to get the last row from DataFrame?

Filter dataframe on non-empty WrappedArray

How to convert map to dataframe?

Apache spark and python lambda

python apache-spark

Redis on Spark:Task not serializable

scala redis apache-spark

Getting java.lang.RuntimeException: Unsupported data type NullType when turning a dataframe into permanent hive table

Killing Spark job using command Prompt

apache-spark

Spark throws java.io.IOException: Failed to rename when saving part-xxxxx.gz

apache-spark amazon-s3 io rdd

Error while installing Spark on Google Colab

Saving as Text in Spark 1.30 using Dataframes in Scala

sql scala apache-spark

When specifying local[n1,n2,n3] for spark master, what are the three parameters?

apache-spark

OutofMemoryErrory creating fat jar with sbt assembly

jar cassandra apache-spark sbt