Suppose we have this simple pandas.DataFrame: <pre class="prettyprint"><code>import pandas as pd df = pd.DataFrame( columns=['quantity', 'value'], data=[[1, 12.5], [3, 18.0]] ) >>> print(df) quantity value 0 1 12.5 1 3 18.0 </code></pre> I would like to create a new column, say <code>modified_value</code>, that applies a function N times to the <code>value</code> column, N being the <code>quantity</code> column. Suppose that function is <code>new_value = round(value/2, 1)</code>, the expected result would be: <pre class="prettyprint"><code> quantity value modified_value 0 1 12.5 6.2 # applied 1 time 1 3 9.0 1.1 # applied 3 times, 9.0 -> 4.5 -> 2.2 -> 1.1 </code></pre> What would be an elegant/vectorized way to do so?

Use <code>reduce</code>: <pre class="prettyprint"><code>from functools import reduce def repeated(f, n): def rfun(p): return reduce(lambda x, _: f(x), range(n), p) return rfun </code></pre> <hr> <pre class="prettyprint"><code>def myfunc(value): return round(value/2, 1) </code></pre> <hr> <pre class="prettyprint"><code>df['modified_valued'] = df.apply(lambda x: repeated(myfunc, int(x['quantity']))(x['value']), axis=1) </code></pre> We can also use list comprehension instead <code>apply</code> <pre class="prettyprint"><code>df['modified_valued'] = [repeated(myfunc, int(quantity))(value) for quantity, value in zip (df['quantity'], df['value'])] </code></pre> Output <pre class="prettyprint"><code> quantity value modified_valued 0 1 12.5 6.2 1 3 18.0 2.2 </code></pre>

Applying/Composing a function N times to a pandas column, N being different for each row

Tags:

python

pandas

Suppose we have this simple pandas.DataFrame:

import pandas as pd

df = pd.DataFrame(
  columns=['quantity', 'value'],
  data=[[1, 12.5], [3, 18.0]]
)

>>> print(df)
   quantity  value
0         1   12.5
1         3   18.0

I would like to create a new column, say modified_value, that applies a function N times to the value column, N being the quantity column. Suppose that function is new_value = round(value/2, 1), the expected result would be:

   quantity  value  modified_value
0         1   12.5            6.2   # applied 1 time
1         3   9.0             1.1   # applied 3 times, 9.0 -> 4.5 -> 2.2 -> 1.1

What would be an elegant/vectorized way to do so?

809

asked Mar 31 '20 14:03

pierre_loic

2 Answers

You can write a custom repeat function, then use apply:

def repeat(func, x, n):
    ret = x
    for i in range(int(n)):
        ret = func(ret)

    return ret

def my_func(val): return round(val/2, 1)

df['new_col'] = df.apply(lambda x: repeat(my_func, x['value'], x['quantity']), 
                         axis=1)

# or without apply
# df['new_col'] = [repeat(my_func, v, n) for v,n in zip(df['value'], df['quantity'])]

156

answered Oct 19 '22 18:10

Quang Hoang

Use reduce:

from functools import reduce
def repeated(f, n):
    def rfun(p):
        return reduce(lambda x, _: f(x), range(n), p)
    return rfun

def myfunc(value): return  round(value/2, 1)

df['modified_valued'] = df.apply(lambda x: repeated(myfunc,
                                                    int(x['quantity']))(x['value']),
                                 axis=1)

We can also use list comprehension instead apply

df['modified_valued'] = [repeated(myfunc, int(quantity))(value) 
                         for quantity, value in zip (df['quantity'], df['value'])]

Output

   quantity  value  modified_valued
0         1   12.5              6.2
1         3   18.0              2.2

answered Oct 19 '22 19:10

ansev

Related questions
                            
                                How to define date in GORM
                            
                                Vuetify v-treeview - How to open programmatically a node?
                            
                                Applying WebACL to API Gateway
                            
                                How can I prioritize the y scale I get from geom_histogram?
                            
                                How to convert string to DateTime in C# EF Core query
                            
                                Angular reactive forms - validate on blur but update model while typing
                            
                                How can I make Radio Buttons and Check Boxes change size relative to the system resolution?
                            
                                How to induce reactivity when updating multiple props in an object using VueJS?
                            
                                Why can you only use expressions in JSX, not statements?
                            
                                What can be a dependency for React hooks?
                            
                                How to add greek letters to a label in geom_text() label in ggplot2 [duplicate]
                            
                                How to detect failure and reset/restart webassembly Module?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With