Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in apache-spark

Unable to run Spark with Mesos

scala apache-spark mesos

Scalac: Assertion failed while run Scalatest in IDEA

Why does Spark Cassandra Connector fail with NoHostAvailableException?

apache spark, "failed to create any local dir"

Running nosetests for pyspark

How to report JMX from Spark Streaming on EC2 to VisualVM?

How to invoke spark job in context of REST Web-service?

java rest jersey apache-spark

read json key-values with hive/sql and spark

Spark streaming with JMS - No API

apache-spark

In spark," INFO metrics.MetricsSaver: Saved 10:24 records to ...."

apache-spark

Spark streaming example calls updateStateByKey with additional parameters

How spark streaming identifies new files

How to increase Java heap space on Spark Amazon EC2 cluster?

Why HDFS not preferred with applications that require low latency?

hadoop apache-spark hdfs hawq

Using Spark Shell (CLI) in standalone mode on distributed files

How to keep the Spark web UI alive?

apache-spark

Group spark dataframe by date

How does partitioning work in Spark?

apache-spark partitioning

What are the likely causes of org.apache.spark.shuffle.MetadataFetchFailedException: Missing an output location for shuffle?

Does a join of co-partitioned RDDs cause a shuffle in Apache Spark?