Read all but last line of CSV file in pandas

Tags:

I have CSV files which I read in in pandas with:

#!/usr/bin/env python  import pandas as pd import sys  filename = sys.argv[1] df = pd.read_csv(filename)

Unfortunately, the last line of these files is often corrupt (has the wrong number of commas). Currently I open each file in a text editor and remove the last line.

Is it possible to remove the last line in the same python/pandas script that loads the CSV to save having to take this extra non-automated step?

209

asked Nov 13 '15 09:11

graffe

1 Answers

pass error_bad_lines=False and it will skip this line automatically

df = pd.read_csv(filename, error_bad_lines=False)

The advantage of error_bad_lines is it will skip and not bork on any erroneous lines but if the last line is always duff then skipfooter=1 is better

Thanks to @DexterMorgan for pointing out that skipfooter option forces the engine to use the python engine which is slower than the c engine for parsing a csv.

165

answered Sep 17 '22 21:09

EdChum

Related questions
                            
                                pandas rename index values
                            
                                What does the function control_dependencies do?
                            
                                Error detected while processing BufRead Auto commands for "*.py"
                            
                                what does axes.flat in matplotlib do?
                            
                                Is there a difference between capital and lowercase string prefixes?
                            
                                Matplotlib: TypeError: 'AxesSubplot' object is not subscriptable [duplicate]
                            
                                How to sort one list based on another? [duplicate]
                            
                                Django model class methods for predefined values
                            
                                ctypes loading a c shared library that has dependencies
                            
                                Exploitable Python Functions [closed]
                            
                                Overflow in exp in scipy/numpy in Python?
                            
                                Remove Max and Min values from python list of integers
                            
                                Python: How to get group ids of one username (like id -Gn )
                            
                                How to convert an image from np.uint16 to np.uint8?
                            
                                Why does json.dumps(list(np.arange(5))) fail while json.dumps(np.arange(5).tolist()) works
                            
                                How to set and get a parent class attribute from an inherited class in Python?
                            
                                Animate a rotating 3D graph in matplotlib
                            
                                Android Market API - Python ImportError: No module named google.protobuf
                            
                                Is "norm" equivalent to "Euclidean distance"?
                            
                                Impute entire DataFrame (all columns) using Scikit-learn (sklearn) without iterating over columns

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Read all but last line of CSV file in pandas

Tags:

python

pandas

dataframe

graffe

People also ask

1 Answers

EdChum

Recent Activity

Donate For Us