apache-spark tutorials and guides

Spark DataSet filter performance

May 26, 2022

Dynamic Allocation for Spark Streaming

Nov 10, 2022

apache-spark spark-streaming dynamic-allocation apache-spark-2.0 apache-spark-1.6

Only one SparkContext may be running in this JVM - [SPARK]

Aug 29, 2022

java apache-spark twitter stream jvm

How to use dataset to groupby

Aug 27, 2022

apache-spark dataset apache-spark-2.0

Apache Spark - Why are executor being removed? What does 'Idle' mean?

Apr 11, 2022

apache-spark

Structured streaming : watermark vs. exactly-once semantics

Sep 15, 2022

apache-spark apache-kafka spark-structured-streaming

Creating/accessing dataframe inside the transformation of another dataframe

Apr 20, 2022

scala apache-spark dataframe apache-spark-sql

How can I count the average from Spark RDD?

Oct 26, 2022

scala apache-spark rdd

How to pattern match on Row with null values?

Jan 28, 2022

scala apache-spark

How to use both dataset.select and selectExpr in apache spark

Aug 20, 2022

apache-spark apache-spark-dataset

UnsupportedOperationException When Inserting into Map

Oct 25, 2022

apache-spark collections hashmap

How to concatenate a string to a column in Spark?

Nov 17, 2022

scala apache-spark apache-spark-sql concatenation

How to create a Row from a given case class?

May 16, 2022

scala apache-spark apache-spark-sql

Converting timestamp to UTC in Spark Scala

Feb 07, 2022

scala apache-spark timestamp

AWS Glue: How to add a column with the source filename in the output?

Oct 07, 2022

amazon-web-services apache-spark pyspark aws-glue

SparkLauncher. java.lang.NoSuchMethodError: org.yaml.snakeyaml.Yaml.<init>

Sep 05, 2022

java docker apache-spark spring-boot yaml

Write spark dataframe to single parquet file

Feb 15, 2022

apache-spark pyspark pyspark-sql

Problem with saving spark DataFrame as Hive table

Apr 03, 2022

python apache-spark hive pyspark

spark possible to split dataframe into parts for topandas

Oct 30, 2022

python pandas apache-spark

PySpark pandas_udfs java.lang.IllegalArgumentException error

May 03, 2022

pandas apache-spark pyspark pyarrow

New posts in apache-spark