Prevent pandas from reading None as Nan

Tags:

I have cleaned a dataset and had to replace a lot of NaN values with None. After that I saved it to a new csv file, when I read the cleaned dataset back using pandas.read_csv, all the None values are represented as NaN, how can I avoid this?

460

asked Feb 03 '17 15:02

Effective_cellist

1 Answers

You can use parameter keep_default_na and na_values in read_csv and then replace strings None to values None:

import pandas as pd
from pandas.compat import StringIO

temp=u"""a,b
None,NaN
a,8"""
#after testing replace 'StringIO(temp)' to 'filename.csv'
df = pd.read_csv(StringIO(temp),keep_default_na=False,na_values=['NaN'])

print (df)
      a    b
0  None  NaN
1     a  8.0

print (type(df.a.iloc[0]))
<class 'str'>

df = df.replace({'None':None})
print (df)
      a    b
0  None  NaN
1     a  8.0

print (type(df.a.iloc[0]))
<class 'NoneType'>

111

answered Sep 20 '22 00:09

jezrael

Related questions
                            
                                Error: pip install scipy
                            
                                Conda: Package missing in current win-64 channels
                            
                                The specifics of adding a header to an api call with a swagger codegen client in python are unclear
                            
                                Why does a semicolon return an empty string in IPython? [duplicate]
                            
                                How can I convert django project to exe?
                            
                                psycopg2 copy_expert() - how to copy in a gzipped csv file?
                            
                                Pandas, Pivot error - cannot label index with null key
                            
                                Why Pocket API returns 403 Forbidden always?
                            
                                How to override queryset count() method in Django's admin list
                            
                                Create Pandas dataframe with list as values in rows
                            
                                What is the use of tf.select
                            
                                What are the metaphors underlying python's packaging vocabulary?
                            
                                Convert numpy array with floats to binary (0 or 1 integers)
                            
                                Simultaneously iterate over multiple list and capture difference in values
                            
                                Making arctan2() continuous beyond 2pi
                            
                                how to split numpy array and perform certain actions on split arrays [Python]
                            
                                Python - read 1000 lines from a file at a time
                            
                                Matplotlib 3D surface plot from 2D pandas dataframe
                            
                                Google-oauth inside Jupyter Notebook
                            
                                GIMP on Windows - executing a python-fu script from the command line

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Prevent pandas from reading None as Nan

Tags:

python

pandas

csv

nan

numpy

Effective_cellist

People also ask

1 Answers

jezrael

Recent Activity

Donate For Us