I am new to Pandas for Python and am busy reading a csv file. Unfortunately the Excel file has some cells with #VALUE! and #DIV/0! in them. I cannot fix this in Excel because the data is pulled from other sheets. Pandas turns these columns into <code>objects</code> instead of <code>numpy64</code>, so I cannot plot from them. I want to replace the #VALUE! and #DIV/0! strings with NaN entries in Pandas, however i cannot find how to do this. I have tried the following (my code runs, but it changes nothing): <pre class="prettyprint"><code>import pandas as pd import numpy as np df = pd.read_csv('2013AllData.csv') df.replace('#DIV/0!', np.nan) </code></pre>

Rather than replacing after loading, just set the param <code>na_values</code> when reading the csv in and it will convert them to <code>NaN</code> values when the df is created: <pre class="prettyprint"><code>df = pd.read_csv('2013AllData.csv', na_values=['#VALUE!', '#DIV/0!']) </code></pre> Check the docs: http://pandas.pydata.org/pandas-docs/stable/generated/pandas.read_csv.html#pandas.read_csv

Pandas read csv replacing #DIV/0! and #VALUE! with NaN

Tags:

python

replace

pandas

csv

excel

I am new to Pandas for Python and am busy reading a csv file. Unfortunately the Excel file has some cells with #VALUE! and #DIV/0! in them. I cannot fix this in Excel because the data is pulled from other sheets. Pandas turns these columns into objects instead of numpy64, so I cannot plot from them. I want to replace the #VALUE! and #DIV/0! strings with NaN entries in Pandas, however i cannot find how to do this. I have tried the following (my code runs, but it changes nothing):

import pandas as pd
import numpy as np
df = pd.read_csv('2013AllData.csv')
df.replace('#DIV/0!', np.nan)

876

asked Jan 20 '15 10:01

nicolejane33

1 Answers

Rather than replacing after loading, just set the param na_values when reading the csv in and it will convert them to NaN values when the df is created:

df = pd.read_csv('2013AllData.csv', na_values=['#VALUE!', '#DIV/0!'])

Check the docs: http://pandas.pydata.org/pandas-docs/stable/generated/pandas.read_csv.html#pandas.read_csv

187

answered Oct 01 '22 06:10

EdChum

Related questions
                            
                                Count and print number of files in subfolders using Python
                            
                                making a stacked barchart in pandas
                            
                                How to check if a timestamp is a whole hour
                            
                                How to get a brief summary or exactly the number of errors and warnings using pylint in python
                            
                                Display new window in second monitor, opencv
                            
                                AttributeError: 'list' object has no attribute 'split'
                            
                                PyQt - get list of all checked in QTreeWidget
                            
                                Fill matrix with transposed version
                            
                                Django rest framework custom filter for POST request
                            
                                NaNs comparing equal in Numpy
                            
                                Handling maximum recursion depth exceeded
                            
                                Pythonic way of getting all consecutive 2-tuples from list
                            
                                Project Scipy Voronoi diagram from 3d to 2d
                            
                                how do I create a python list with a negative index
                            
                                Getting text of a table quickly in Selenium
                            
                                Scrapy with selenium, webdriver failing to instantiate
                            
                                Get month by week, day and year
                            
                                Set Working Directory to Notebook Directory
                            
                                Celery pickle type content disallowed error
                            
                                How to run a single deploy when Travis builds succeeds?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With