I understand that Pandas can read and write to and from Parquet files using different backends: <code>pyarrow</code> and <code>fastparquet</code>. I have a Conda distribution with the Intel distribution and "it works": I can use <code>pandas.DataFrame.to_parquet</code>. However I do not have <code>pyarrow</code> installed so I guess that <code>fastparquet</code> is used (which I cannot find either). Is there a way to identify which backend is used?

Just execute these 2 commands in linux shell/bash <pre class="prettyprint"><code>pip install pyarrow pip install fastparquet </code></pre>

How to identify Pandas' backend for Parquet

Tags:

parquet

I understand that Pandas can read and write to and from Parquet files using different backends: pyarrow and fastparquet.

I have a Conda distribution with the Intel distribution and "it works": I can use pandas.DataFrame.to_parquet. However I do not have pyarrow installed so I guess that fastparquet is used (which I cannot find either).

Is there a way to identify which backend is used?

529

asked Jun 08 '18 12:06

Cedric H.

1 Answers

Just execute these 2 commands in linux shell/bash

pip install pyarrow

pip install fastparquet

119

answered Oct 05 '22 19:10

ANKIT CHOPADE

Related questions
                            
                                how to get argparse to read arguments from a file with an option rather than prefix
                            
                                Blend overlapping images in python
                            
                                matplotlib make axis ticks label for dates bold
                            
                                Refreshing a QWidget
                            
                                Receiving "NO CARRIER" error while tring to make a call using GSM modem in Python
                            
                                Macros in django templates
                            
                                How to see if the list contains consecutive numbers
                            
                                reverse dataframe's rows' order with pandas [duplicate]
                            
                                Using Sympy Equations for Plotting
                            
                                How to pass a parameter to only one part of a pipeline object in scikit learn?
                            
                                Select rows from a DataFrame based on multiple values in a column in pandas [duplicate]
                            
                                Set yaxis of all subplots to the same range - Matplotlib
                            
                                python datetime remove minute and second information
                            
                                How to extract feature importances from an Sklearn pipeline
                            
                                how to use assert_frame_equal in unittest
                            
                                Django Rest Framework Without Database
                            
                                dataframe, set index from list
                            
                                Checking whether two rectangles overlap in python using two bottom left corners and top right corners
                            
                                jupyterlab - change styling - font, font size
                            
                                AttributeError: 'UUID' object has no attribute 'replace' when using backend-agnostic GUID type

How to identify Pandas' backend for Parquet

Tags:

python

pandas

parquet

Cedric H.

People also ask

1 Answers

ANKIT CHOPADE

Recent Activity

Donate For Us