Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in apache-spark

Why does partition parameter of SparkContext.textFile not take effect?

scala apache-spark rdd

SBT cannot import Kafka encoder/decoder classes

How to limit decimal values to 2 digits before applying agg function?

Find column index by searching column header of a Dataset in Apache Spark Java

convert RDD to Dataset in Java Spark

java apache-spark

Spark Failure : Caused by: org.apache.spark.shuffle.FetchFailedException: Too large frame: 5454002341

Spark java.lang.ClassCastException: scala.collection.mutable.WrappedArray$ofRef cannot be cast to java.util.ArrayList

How to fix 'Unsupported class file major version 55' while executing 'org.apache.spark.sql.DataSet.collectAsList()'

How to filter a Spark dataframe by a boolean column?

Can I read a CSV represented as a string into Apache Spark using spark-csv

Error: not found: value lit/when - spark scala

Spark : Average of values instead of sum in reduceByKey using Scala

scala apache-spark

PySpark Will not start - ‘python’: No such file or directory

python apache-spark pyspark

Writing to HBase via Spark: Task not serializable

scala apache-spark hbase

RDD partitioning in spark Streaming

Creating hive table using parquet file metadata

How to calculate Median in spark sqlContext for column of data type double

How to replace NULL to 0 in left outer join in SPARK dataframe v1.6

How to register UDF to use in SQL and DataFrame?

apache spark: akka version error by build jar with all dependencies