Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in fastparquet

Streaming parquet file python and only downsampling

Why index name always appears in the parquet file created with pandas?

How to read multiple parquet files (with same schema) from multiple directories with dask/fastparquet

dask fastparquet

Fastparquet giving "TypeError: expected str, bytes or os.PathLike object, not _io.BytesIO" while using dataframe.to_parquet()

pandas to_parquet fails on large datasets

How to open huge parquet file using Pandas without enough RAM

filtering with dask read_parquet method gives unwanted results

Unable to read a parquet file

error with snappy while importing fastparquet in python

Pandas dataframe type datetime64[ns] is not working in Hive/Athena

pyarrow.lib.ArrowInvalid: ('Could not convert X with type Y: did not recognize Python value type when inferring an Arrow data type')

Is saving a HUGE dask dataframe into parquet possible?

Does any Python library support writing arrays of structs to Parquet files?

Decompression 'SNAPPY' not available with fastparquet

A comparison between fastparquet and pyarrow?

How to read partitioned parquet files from S3 using pyarrow in python