Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Pandas - Is it possible to read_csv with no quotechar?

I am trying to read a csv file that has single instances of " in certain lines, ex:

car,"plane,jet jet,ski,"hat 

When I use pandas read_csv to read this file, it recognizes " as a quote character and does not correctly read in lines such as the one above. I would like to not have any quote character at all when I use read_csv.

I tried setting quotechar=None and quotechar='' but both spits out an error since quotechar has to be a string of length 1. Is it possible to not have a quotechar at all when using read_csv?

Thanks!

like image 524
killajoule Avatar asked Feb 02 '15 19:02

killajoule


People also ask

What is the difference between read_table and read_csv in pandas?

The difference between read_csv() and read_table() is almost nothing. In fact, the same function is called by the source: read_csv() delimiter is a comma character. read_table() is a delimiter of tab \t .

What does parse_dates do in read_csv?

By default, date columns are represented as object when loading data from a CSV file. To read the date column correctly, we can use the argument parse_dates to specify a list of date columns.


1 Answers

From the Pandas Documentation

quoting : int or csv.QUOTE_* instance, default None Control field quoting behavior per csv.QUOTE_* constants. Use one of QUOTE_MINIMAL (0), QUOTE_ALL (1), QUOTE_NONNUMERIC (2) or QUOTE_NONE (3). Default (None) results in QUOTE_MINIMAL behavior.

So you'll want to include quoting=3 as a parameter to your read_csv().

like image 59
MrAlexBailey Avatar answered Sep 19 '22 19:09

MrAlexBailey