Is writing to Parquet faster than CSV?

Question

Accepted Answer

CSV should generally be the fastest to write, JSON the easiest for a human to understand and Parquet the fastest to read. CSV is the defacto standard of a lot of data and for fair reasons; it's [relatively] easy to comprehend for both users and computers and made more accessible via Microsoft Excel.

How to More Efficiently Load Parquet Files in Spark (pySpark v1.2.0)

Tags:

apache-spark

apache-spark-sql

pyspark

parquet

jarfa

People also ask

1 Answers

kostya

Recent Activity

Donate For Us