Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in apache-spark-2.0

Out of Memory Error when Reading large file in Spark 2.1.0

spark job keep showing TaskCommitDenied (Driver denied task commit)

How to pivot streaming dataset?

How to cast a WrappedArray[WrappedArray[Float]] to Array[Array[Float]] in spark (scala)

Spark 2.0 Timestamp Difference in Milliseconds using Scala

Livy Server: return a dataframe as JSON?

SparkSession initialization error - Unable to use spark.read

Reading Avro messages from Kafka with Spark 2.0.2 (structured streaming)

Spark 2.0.0 Error: PartitioningCollection requires all of its partitionings have the same numPartitions

Avoid starting HiveThriftServer2 with created context programmatically

Pass system property to spark-submit and read file from classpath or custom path

How to convert RDD of dense vector into DataFrame in pyspark?

Apache Spark vs Apache Spark 2 [closed]

Why does using cache on streaming Datasets fail with "AnalysisException: Queries with streaming sources must be executed with writeStream.start()"?

Spark fails to start in local mode when disconnected [Possible bug in handling IPv6 in Spark??]

Timeout Exception in Apache-Spark during program Execution

dynamically bind variable/parameter in Spark SQL?

spark off heap memory config and tungsten