Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in apache-spark

How to specify a missing value in a dataframe

Spark joinWithCassandraTable() on map multiple partition key ERROR

Spark + Python - Java gateway process exited before sending the driver its port number?

java python apache-spark

How do you add a numpy.array as a new column to a pyspark.SQL DataFrame?

Apache Spark MLlib Model File Format

Excessive partitioning (too many tasks) on Apache Spark/Cassandra cluster

SparkStreaming - ExitCodeException exitCode=13

Spark-shell connecting to Mesos stuck at sched.cpp

apache-spark mesos

Why does pyspark give "we couldn't find any external IP address" on macOS?

python apache-spark pyspark

SQLContext implicits

scala apache-spark

Spark job restarted after showing all jobs completed and then fails (TimeoutException: Futures timed out after [300 seconds])

Using Spark Kernel on Jupyter

How to select a subset of fields from an array column in Spark?

Why is my Spark App running in only 1 executor?

Spark UDAF: java.lang.InternalError: Malformed class name

dynamically changing library dependencies in sbt build file from provided etc

Drop first row of Spark DataFrame

Towards limiting the big RDD

How can I know spark-core version?

Is python smart enough to replace function calls with constant result?