Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Can parquet support concurrent write operations?

Tags:

parquet

is it possible to perform distributed concurrent writes to parquet format?

And is it possible to read parquet files while they are being written?

If there are methods for concurrent read/writes I'd be interested to learn about.

Thanks in advance for you help.

like image 543
Loic Avatar asked Aug 09 '15 22:08

Loic


1 Answers

I eventually had an answer from Parquet developers: answer is no to both questions:

Parquet writers are not thread-safe and files cannot be read or written by different readers or writers concurrently. Parquet doesn't expose flush/sync operations to the user (for good reason) so there isn't a way to reliably do this anyway.

like image 67
Loic Avatar answered Jan 04 '23 08:01

Loic