read_csv doesn't read the column names correctly on this file?

Tags:

I have a csv file as follows:

I want to save it as a dataframe with x,y axes as names, then plot it. However when I assign x,y I get a messed up DataFrame, what is happening?

column_names = ['x','y']
x = pd.read_csv('csv-file.csv', header = None, names = column_names)
print(x)

          x   y
0   0 5 NaN
1  1 10 NaN
2  2 15 NaN
3  3 20 NaN
4  4 25 NaN

I've tried without specifying None for header, to no avail.

714

asked May 31 '16 18:05

Vyraj

1 Answers

Add parameter sep="\s+" or delim_whitespace=True to read_csv:

import pandas as pd

temp=u"""0 5
1 10
2 15
3 20
4 25"""
#after testing replace io.StringIO(temp) to filename
column_names = ['x','y']
df = pd.read_csv(pd.compat.StringIO(temp), sep="\s+", header = None, names = column_names)

print (df)
   x   y
0  0   5
1  1  10
2  2  15
3  3  20
4  4  25

Or:

column_names = ['x','y']
df = pd.read_csv(pd.compat.StringIO(temp),
                 delim_whitespace=True, 
                 header = None, 
                 names = column_names)

print (df)
   x   y
0  0   5
1  1  10
2  2  15
3  3  20
4  4  25

112

answered Nov 07 '22 14:11

jezrael

Related questions
                            
                                Flask blueprint unit-testing
                            
                                Sphinx apidoc section titles for Python module/package names
                            
                                Is Python incorrectly handling this "arbitrary precision integer"?
                            
                                C array vs NumPy array
                            
                                python NameError: name '__file__' is not defined [duplicate]
                            
                                how to unstack (or pivot?) in pandas
                            
                                Dealing with the class imbalance in binary classification
                            
                                How to find and replace nth occurrence of word in a sentence using python regular expression?
                            
                                FAILED: No config file 'alembic.ini' found
                            
                                Serve image stored in SQLAlchemy LargeBinary column
                            
                                Select everything but a list of columns from pandas dataframe
                            
                                How to turn off INFO from logs in PySpark with no changes to log4j.properties?
                            
                                python re.sub, only replace part of match [duplicate]
                            
                                Retrieving public dns of EC2 instance with BOTO3
                            
                                Sqlalchemy: subquery in FROM must have an alias
                            
                                Using getattr in Jinja2 gives me an error (jinja2.exceptions.UndefinedError: 'getattr' is undefined)
                            
                                Getting csv.Sniffer to work with quoted values
                            
                                How to access Enum types in Django templates
                            
                                Django rest auth email instead of username
                            
                                Calculate max draw down with a vectorized solution in python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

read_csv doesn't read the column names correctly on this file?

Tags:

python

pandas

dataframe

csv

Vyraj

People also ask

1 Answers

jezrael

Recent Activity

Donate For Us