Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in apache-spark

Spark Connection refused for BlockManager process

Spark saveAsTextFile to Azure Blob creates a blob instead of a text file

Compatibility issue with Scala and Spark for compiled jars

Exception in thread "main" java.lang.IllegalAccessError: class org.apache.spark.storage.StorageUtils$

How to spark-submit to ZooKeeper-managed Mesos cluster (gives java.net.UnknownHostException: zk for mesos://zk:// master URL)?

apache-spark mesos

Dataproc CPU usage too low even though all the cores got used

How to use groupBy, collect_list, arrays_zip, & explode together in pyspark to solve certain business problem

apache-spark pyspark

Oozie Spark action failed for kerberos environment

Spark streaming job doesn't delete shuffle files

Spark RDD: How to calculate statistics most efficiently?

Explode column with array of arrays - PySpark

Caching DataFrame in Spark Thrift Server

Spark dense_rank window function - without a partitionBy clause

How to delete documents(records) with Mongo-Hadoop connector for Spark

Spark Streaming Kafka Stream batch execution

Why does spark application fail with java.lang.NoClassDefFoundError: com/sun/jersey/api/client/config/ClientConfig even though the jar exists?

scala apache-spark pyspark

Zeppelin notebook execute not manual