I have a Dataframe df like this: <pre class="prettyprint"><code> A B C D 2 1 O s h 4 2 P 7 3 Q 9 4 R h m </code></pre> I have a function f to calculate C and D based on B for a row: <pre class="prettyprint"><code>def f(p): #p is the value of column B for a row. return p+'k', p+'n' </code></pre> How can I populate the missing values for row 4&7 by applying the function f to the Dataframe? The expected outcome is like below: <pre class="prettyprint"><code> A B C D 2 1 O s h 4 2 P Pk Pn 7 3 Q Qk Qn 9 4 R h m </code></pre> The function f has to be used as the real function is very complicated. Also, the function only needs to be applied to the rows missing C and D

Maybe there is a more elegant way, but I would do in this way: <pre class="prettyprint"><code>df['C'] = df['B'].apply(lambda x: f(x)[0]) df['D'] = df['B'].apply(lambda x: f(x)[1]) </code></pre> Applying the function to the columns and get the first and the second value of the outputs. It returns: <pre class="prettyprint"><code> A B C D 0 1 O Ok On 1 2 P Pk Pn 2 3 Q Qk Qn 3 4 R Rk Rn </code></pre> EDIT: In a more concise way, thanks to this answer: <pre class="prettyprint"><code>df[['C','D']] = df['B'].apply(lambda x: pd.Series([f(x)[0],f(x)[1]])) </code></pre>

Pandas Dataframe: How to update multiple columns by applying a function?

Tags:

python

pandas

I have a Dataframe df like this:

   A   B   C    D
2  1   O   s    h
4  2   P    
7  3   Q
9  4   R   h    m

I have a function f to calculate C and D based on B for a row:

def f(p): #p is the value of column B for a row. 
     return p+'k', p+'n'

How can I populate the missing values for row 4&7 by applying the function f to the Dataframe?

The expected outcome is like below:

   A   B   C    D
2  1   O   s    h
4  2   P   Pk   Pn
7  3   Q   Qk   Qn
9  4   R   h    m

The function f has to be used as the real function is very complicated. Also, the function only needs to be applied to the rows missing C and D

458

asked Sep 16 '15 08:09

John Smith

1 Answers

Maybe there is a more elegant way, but I would do in this way:

df['C'] = df['B'].apply(lambda x: f(x)[0])
df['D'] = df['B'].apply(lambda x: f(x)[1])

Applying the function to the columns and get the first and the second value of the outputs. It returns:

   A  B   C   D
0  1  O  Ok  On
1  2  P  Pk  Pn
2  3  Q  Qk  Qn
3  4  R  Rk  Rn

EDIT:

In a more concise way, thanks to this answer:

df[['C','D']] = df['B'].apply(lambda x: pd.Series([f(x)[0],f(x)[1]]))

167

answered Oct 21 '22 07:10

Fabio Lamanna

Related questions
                            
                                Time a while loop python
                            
                                Handling with multiple domains in Flask
                            
                                Scrapy: Define items dynamically
                            
                                Why does S3 (using with boto and django-storages) give signed url even for public files?
                            
                                Selenium webdriver and unicode
                            
                                Python/PIL affine transformation
                            
                                Detect key input in Python
                            
                                Django template for loop
                            
                                Resetting the expiration time for a cookie in Flask
                            
                                How to make markers on lines smaller in matplotlib?
                            
                                Python - Conversion of list of arrays to 2D array
                            
                                How to iterate through a module's functions [duplicate]
                            
                                How to filter filter_horizontal in Django admin?
                            
                                whitespace in regular expression
                            
                                PDB: How to inspect local variables of functions in nested stack frames?
                            
                                matplotlib animation movie: quality of movie decreasing with time
                            
                                sklearn: use Pipeline in a RandomizedSearchCV?
                            
                                How to make two markers share the same label in the legend using matplotlib?
                            
                                Print exception with stack trace to file
                            
                                Error with Sklearn Random Forest Regressor

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With