Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in apache-spark
Do Parquet Metadata Files Need to be Rolled-back?
Oct 26, 2022
apache-spark
spark-streaming
parquet
Spark EC2 SSH connection error SSH return code 255
Oct 24, 2022
ssh
amazon-ec2
apache-spark
Spark program gives odd results when ran on standalone cluster
Oct 23, 2022
python
apache-spark
pyspark
bigdata
How many partitions does Spark create when a file is loaded from S3 bucket?
Oct 01, 2022
apache-spark
hadoop
amazon-s3
rdd
Structured streaming won't write DF to file sink citing /_spark_metadata/9.compact doesn't exist
Sep 27, 2022
apache-spark
amazon-s3
amazon-emr
spark-structured-streaming
Does Spark use data locality?
May 20, 2018
hadoop
cassandra
hbase
apache-spark
spark executor lost failure
Aug 12, 2022
scala
apache-spark
out-of-memory
executor
Apache Spark Streaming, How to handle Downstream dependency failures
Nov 13, 2022
apache-spark
spark-streaming
Reliability issues with Checkpointing/WAL in Spark Streaming 1.6.0
Nov 19, 2022
scala
apache-spark
spark-streaming
amazon-kinesis
checkpointing
How to solve this error org.apache.spark.sql.catalyst.errors.package$TreeNodeException
Apr 16, 2022
apache-spark
datastax-enterprise
cassandra-3.0
databricks
Spark Streaming: Could not compute split, block not found
Aug 20, 2022
apache-spark
spark-streaming
Parquet error when saving from Spark
Oct 25, 2022
apache-spark
parquet
How to change the attributes order in Apache SparkSQL `Project` operator?
Oct 02, 2021
scala
apache-spark
apache-spark-sql
Hive partitioned table reads all the partitions despite having a Spark filter
Apr 11, 2022
scala
apache-spark
hive
apache-spark-sql
Creating a large dictionary in pyspark
Mar 10, 2022
python
apache-spark
How to cache a Spark data frame and reference it in another script
Oct 07, 2017
apache-spark
pyspark
apache-spark-sql
pyspark-sql
Evaluating Spark DataFrame in loop slows down with every iteration, all work done by controller
Aug 30, 2022
apache-spark
pyspark
pyspark-sql
Spark DataFrame mapPartitions
Oct 27, 2022
python
apache-spark
pyspark
apache-spark-sql
Apache Spark SQL UDAF over window showing odd behaviour with duplicate input
Sep 24, 2021
apache-spark
apache-spark-sql
Add a header before text file on save in Spark
May 27, 2019
apache-spark
« Newer Entries
Older Entries »