I'm trying to get a big number of data (about 3M rows) and I have only two options to do that.
I didn't test any of these solutions yet to tell which one is best in terms of speed.
CSV should generally be the fastest to write, JSON the easiest for a human to understand and Parquet the fastest to read. CSV is the defacto standard of a lot of data and for fair reasons; it's [relatively] easy to comprehend for both users and computers and made more accessible via Microsoft Excel.
1 Answer. Show activity on this post. This is for a simple reason that the JSON has a lot more than the CSV. If you look at the generated JSON you will see that it has a lot of keys like field1, field2 etc.
If you want to retrieve simple data as lists or rows with some columns the option #2 is the good one, you can read below a set of advantages and disadvantages:
Pros
Cons
If the data will contain complex data as tuples, arrays and structures JSON are better because:
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With