Pandas read csv, trim last two characters

Question

A csv file looks like this:

a,b,c
1,2,3, 
4,5,6, 
a,b,c,

When I tried reading this file with pandas read_csv, the data frame looks like this :

   |---------------|
   |   | a | b | c |
   |---------------|
   | 1 | 2 | 3 |   |
   | 4 | 5 | 6 |   |
   | a | b | c |   |
   |---------------|

I think the problem here in the data is : it looks like 1,2,3,space and pandas think there are 4 columns and the first column is unnamed. Is there any way I can change this to :

   |-----------|
   | a | b | c |
   |-----------|
   | 1 | 2 | 3 |
   | 4 | 5 | 6 |
   | a | b | c |
   |-----------|

These files are around 50 million rows and there are many files. Is there any way to do it with minimal run-time ?

Scott Boston · Accepted Answer

Use usecol parameter in pd.read_csv to read only the first three columns in the csv file.

from io import StringIO
csvtext = StringIO("""a,b,c
1,2,3, 
4,5,6, 
a,b,c, """)

df = pd.read_csv(csvtext, usecols=[0,1,2])
df

Output:

Pandas read csv, trim last two characters

Tags:

python

python-3.x

pandas

python-2.7

Venkata Gogu

1 Answers

Scott Boston

Recent Activity

Donate For Us

Pandas read csv, trim last two characters

Tags:

python

python-3.x

pandas

python-2.7

Venkata Gogu

1 Answers

Scott Boston

Related questions

Recent Activity

Donate For Us