apache-spark tutorials and guides

error when run zepplin connecting aws glue

Dec 12, 2022

How does Spark use Netty?

Dec 11, 2022

netty apache-storm apache-spark

Sortby in Javardd

Dec 11, 2022

java apache-spark

Spark SQL - loading csv/psv files with some malformed records

Dec 09, 2022

csv apache-spark apache-spark-sql parquet

spark-shell cannot parse Scala lines that start with dot / period

Dec 10, 2022

scala apache-spark

spark - Exception in thread "main" java.sql.SQLException: No suitable driver

Dec 10, 2022

scala apache-spark jdbc maven-assembly-plugin

How could I write the right entry point in Spark 2.0 program (Actually pyspark 2.0)?

Dec 09, 2022

apache-spark pyspark

What is the difference between SPARK Partitions and Worker Cores?

Dec 10, 2022

java hadoop apache-spark

Apache spark SQL group data by range

Dec 10, 2022

sql scala apache-spark apache-spark-sql

Read JSON file as Pyspark Dataframe using PySpark?

Dec 10, 2022

python apache-spark pyspark apache-spark-sql

Spark throwing ArrayIndexOutOfBoundsException when parallelizing list

Dec 10, 2022

java arrays list apache-spark indexoutofboundsexception

How to integrate Palantir Foundry with Amazon S3 or HDFS

Dec 09, 2022

apache-spark amazon-s3 palantir-foundry foundry-data-connection

Pyspark merge multiple columns into a json column

Dec 10, 2022

python dataframe apache-spark pyspark

Spark cannot read files stored on AWS S3 in Frankfurt region (Ireland region works fine)

Dec 08, 2022

amazon-web-services amazon-s3 apache-spark

Reading from google storage gs:// filesystem from local spark instance

Dec 08, 2022

apache-spark google-cloud-storage google-cloud-platform

spark-shell error on Windows - can it be ignored if not using hadoop?

Dec 07, 2022

apache-spark

Apache Spark: Convert column with a JSON String to new Dataframe in Scala spark [duplicate]

Dec 07, 2022

json scala apache-spark apache-spark-sql

Read XML in spark

Dec 08, 2022

xml apache-spark dataframe pyspark apache-spark-xml

the difference between "one Executor per Core vs one Executor with multiple Core"

Dec 08, 2022

apache-spark pyspark

Apache spark job failed immediately without retry, setting maxFailures doesn't work

Dec 06, 2022

apache-spark failover self-healing

New posts in apache-spark