I am trying to fill none values in a Pandas dataframe with 0's for only some subset of columns. When I do: <pre class="prettyprint"><code>import pandas as pd df = pd.DataFrame(data={'a':[1,2,3,None],'b':[4,5,None,6],'c':[None,None,7,8]}) print df df.fillna(value=0, inplace=True) print df </code></pre> The output: <pre class="prettyprint"><code> a b c 0 1.0 4.0 NaN 1 2.0 5.0 NaN 2 3.0 NaN 7.0 3 NaN 6.0 8.0 a b c 0 1.0 4.0 0.0 1 2.0 5.0 0.0 2 3.0 0.0 7.0 3 0.0 6.0 8.0 </code></pre> It replaces every <code>None</code> with <code>0</code>'s. What I want to do is, only replace <code>None</code>s in columns <code>a</code> and <code>b</code>, but not <code>c</code>. What is the best way of doing this?

You can select your desired columns and do it by assignment: <pre class="prettyprint"><code>df[['a', 'b']] = df[['a','b']].fillna(value=0) </code></pre> The resulting output is as expected: <pre class="prettyprint"><code> a b c 0 1.0 4.0 NaN 1 2.0 5.0 NaN 2 3.0 0.0 7.0 3 0.0 6.0 8.0 </code></pre>

You can using <code>dict</code> , <code>fillna</code> with different value for different column <pre class="prettyprint"><code>df.fillna({'a':0,'b':0}) Out[829]: a b c 0 1.0 4.0 NaN 1 2.0 5.0 NaN 2 3.0 0.0 7.0 3 0.0 6.0 8.0 </code></pre> After assign it back <pre class="prettyprint"><code>df=df.fillna({'a':0,'b':0}) df Out[831]: a b c 0 1.0 4.0 NaN 1 2.0 5.0 NaN 2 3.0 0.0 7.0 3 0.0 6.0 8.0 </code></pre>

Pandas dataframe fillna() only some columns in place

Tags:

python

pandas

dataframe

I am trying to fill none values in a Pandas dataframe with 0's for only some subset of columns.

When I do:

import pandas as pd df = pd.DataFrame(data={'a':[1,2,3,None],'b':[4,5,None,6],'c':[None,None,7,8]}) print df df.fillna(value=0, inplace=True) print df

The output:

     a    b    c 0  1.0  4.0  NaN 1  2.0  5.0  NaN 2  3.0  NaN  7.0 3  NaN  6.0  8.0      a    b    c 0  1.0  4.0  0.0 1  2.0  5.0  0.0 2  3.0  0.0  7.0 3  0.0  6.0  8.0

It replaces every None with 0's. What I want to do is, only replace Nones in columns a and b, but not c.

What is the best way of doing this?

265

asked Jun 30 '16 22:06

Sait

2 Answers

You can select your desired columns and do it by assignment:

df[['a', 'b']] = df[['a','b']].fillna(value=0)

The resulting output is as expected:

     a    b    c 0  1.0  4.0  NaN 1  2.0  5.0  NaN 2  3.0  0.0  7.0 3  0.0  6.0  8.0

186

answered Oct 11 '22 20:10

root

You can using dict , fillna with different value for different column

df.fillna({'a':0,'b':0}) Out[829]:       a    b    c 0  1.0  4.0  NaN 1  2.0  5.0  NaN 2  3.0  0.0  7.0 3  0.0  6.0  8.0

After assign it back

df=df.fillna({'a':0,'b':0}) df Out[831]:       a    b    c 0  1.0  4.0  NaN 1  2.0  5.0  NaN 2  3.0  0.0  7.0 3  0.0  6.0  8.0

answered Oct 11 '22 21:10

BENY

Related questions
                            
                                How do I use method overloading in Python?
                            
                                How to add a string in a certain position?
                            
                                Pythonic way to return list of every nth item in a larger list
                            
                                reducing number of plot ticks
                            
                                Define css class in django Forms
                            
                                Round to 5 (or other number) in Python
                            
                                Common elements comparison between 2 lists
                            
                                python capitalize first letter only
                            
                                How to delete last item in list?
                            
                                How to divide flask app into multiple py files?
                            
                                What's the difference between subprocess Popen and call (how can I use them)?
                            
                                How can I access "static" class variables within methods in Python?
                            
                                Find p-value (significance) in scikit-learn LinearRegression
                            
                                How can I find the number of arguments of a Python function?
                            
                                pandas loc vs. iloc vs. at vs. iat?
                            
                                pandas resample documentation
                            
                                python dataframe pandas drop column using int
                            
                                Type hinting a collection of a specified type
                            
                                How can I quantify difference between two images?
                            
                                Python strptime() and timezones?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With