Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in apache-spark

java.lang.StackOverflowError when using Kryo to serialize objects with references to each other

java apache-spark kryo kryonet

In Spark Streaming, how to detect for an empty batch?

apache-spark

Spark Streaming Bug - Window of Windowed DStream does not work

Getting java.lang.IllegalArgumentException: requirement failed while calling Sparks MLLIB StreamingKMeans from java application

Batch Size in Spark Streaming

Partitions not being pruned in simple SparkSQL queries

Multiple windows of different durations in Spark Streaming application

Failed to load class for data source: com.databricks.spark.csv

apache-spark

Spark JoinWithCassandraTable on TimeStamp partition key STUCK

Using TestHiveContext/HiveContext in unit tests

Locally change the log level for the zookeeper C client

Spark mapWithState shuffles all data to one node

How to give predicted and label columns in BinaryClassificationMetrics evaluation for Naive Bayes model

Not able to fetch result from hive transaction enabled table through spark-sql

How to write dataframe (obtained from hive table) into hadoop SequenceFile and RCFile?

How to convert RDD to DataFrame in Spark Streaming, not just Spark

Apache Toree and Spark Scala Not Working in Jupyter

Spark never finishes jobs and stages, JobProgressListener crash

apache-spark

The root scratch dir: /tmp/hive on HDFS should be writable. Current permissions are: rwx--------- (on Linux)

How to implement a ScalaTest FunSuite to avoid boilerplate Spark code and import implicits