How can I fix "Error tokenizing data" on pandas csv reader?

Tags:

I'm trying to read a csv file with pandas.

This file actually has only one row but it causes an error whenever I try to read it.

Something wrong seems happening in line 8 but I could hardly find the 8th line since there's clearly only one row on it.

I do like:

with codecs.open("path_to_file", "rU", "Shift-JIS", "ignore") as file:

df = pd.read_csv(file, header=None, sep="\t")
df

Then I get:

ParserError: Error tokenizing data. C error: Expected 1 fields in line 8, saw 3

I don't get what's really going on, so any of your advice will be appreciated.

595

asked Nov 12 '18 04:11

user9191983

2 Answers

I struggled with this almost a half day , I opened the csv with notepad and noticed that separate is TAB not comma and then tried belo combination.

df = pd.read_csv('C:\\myfile.csv',sep='\t', lineterminator='\r')

118

answered Sep 21 '22 21:09

Hietsh Kumar

Try df = pd.read_csv(file, header=None, error_bad_lines=False)

answered Sep 23 '22 21:09

Po Xin

Related questions
                            
                                Python: Pandas Concatenate each row into a string
                            
                                How do I change a button label created with 'interact_manual' from 'ipywidgets'? and how do I change the size and color of that button?
                            
                                Converting a column of minutes to hours and minutes python
                            
                                Keras, TensorFlow : "TypeError: Cannot interpret feed_dict key as Tensor"
                            
                                joblib parallel processing of a multiple return values function
                            
                                Remove top row from a dataframe
                            
                                Find the similarity between two string columns of a DataFrame
                            
                                propagate conditional column value in pandas
                            
                                Pandas to_sql() to update unique values in DB?
                            
                                How to filter logs from gunicorn?
                            
                                Paho MQTT Python Client: No exceptions thrown, just stops
                            
                                Finding maximum weighted edge in a networkx graph in python
                            
                                Why can I repeat the + in Python arbitrarily in a calculation?
                            
                                Numpy Random Choice not working for 2-dimentional list
                            
                                Correct way to use GeoPy Nominatim
                            
                                How to implement "positional-only parameter" in a user defined function in python?
                            
                                Create pandas dataframe from string (in csv format)
                            
                                Perspective transform with Python PIL using src / target coordinates
                            
                                Replace string in PySpark
                            
                                Align text in the putText() in OpenCV

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How can I fix "Error tokenizing data" on pandas csv reader?

Tags:

python

pandas

csv

tokenize

user9191983

People also ask

2 Answers

Hietsh Kumar

Po Xin

Recent Activity

Donate For Us