I have a df like this: <pre class="prettyprint"><code>col1 col2 [1,3,4,5] [3,3,6,2] [1,4,5,5] [3,8,4,3] [1,3,4,8] [8,3,7,2] </code></pre> Trying to divide the elements in the lists in col1 and col2 together to get what's in the result column: <pre class="prettyprint"><code>col1 col2 result [1,3,4,5] [3,3,6,2] [.33,1,.66,2.5] [1,4,5,5] [3,8,4,3] [.33,.5,1.25,1.66] [1,3,4,8] [8,3,7,2] [.33,1,.57,4] </code></pre> Tried a lot of different approaches - but always get an error. Attempts: <pre class="prettyprint"><code>#attempt1 df['col1'].div(df['col2'], axis=0) #attempt2 from operator import truediv for i in df.col1: a = np.array(df['col1']) for t in df.col2: b = np.array(df['col2']) x = a/b print(x) #attempt3 for i in df.index: a = col1 b = col2 x = map(truediv, a, b) #attempt4 a = col1 b = col2 result = [x/y for x, y in zip(a, b)] #then apply to df #attempt5 a = col1 b = col2 result = a/b print(percent_matched) #then #apply to df >>>TypeError: unsupported operand type(s) for /: 'list' and 'list' </code></pre> Any ideas?

<ul> <li>Use <code>.applymap</code> to convert the columns to <code>np.array</code>s</li> <li>Then use <code>.div</code> to divide the columns</li> <li>If <code>result</code> must be rounded, tack on <code>.apply(lambda x: np.round(x, 3))</code>, when calculating that column. <ul> <li><code>np.round()</code></li> <li><code>df['result'] = df.col1.div(df.col2).apply(lambda x: np.round(x, 3))</code></li> </ul> </li> </ul> <pre class="prettyprint lang-py prettyprint-override"><code>import numpy as np import pandas as pd data = {'col1': [[1,3,4,5], [1,4,5,5], [1,3,4,8]], 'col2': [[3,3,6,2], [3,8,4,3], [8,3,7,2]]} df = pd.DataFrame(data) # convert columns to arrays df = df.applymap(np.array) # divide the columns df['result'] = df.col1.div(df.col2) </code></pre>

Divide two pandas columns of lists by each other

Tags:

python

pandas

I have a df like this:

col1        col2
[1,3,4,5]   [3,3,6,2]
[1,4,5,5]   [3,8,4,3]
[1,3,4,8]   [8,3,7,2]

Trying to divide the elements in the lists in col1 and col2 together to get what's in the result column:

col1        col2        result
[1,3,4,5]   [3,3,6,2]   [.33,1,.66,2.5]
[1,4,5,5]   [3,8,4,3]   [.33,.5,1.25,1.66]
[1,3,4,8]   [8,3,7,2]   [.33,1,.57,4]

Tried a lot of different approaches - but always get an error.

Attempts:

#attempt1
df['col1'].div(df['col2'], axis=0)

#attempt2
from operator import truediv

for i in df.col1:
     a = np.array(df['col1'])
     for t in df.col2:
         b = np.array(df['col2'])
         x = a/b
         print(x)


#attempt3
for i in df.index:
    a = col1
    b = col2
    x = map(truediv, a, b)

#attempt4
a = col1
b = col2
result = [x/y for x, y in zip(a, b)]
#then apply to df

#attempt5
a = col1
b = col2
result = a/b
print(percent_matched)
#then #apply to df

>>>TypeError: unsupported operand type(s) for /: 'list' and 'list'

Any ideas?

448

asked Aug 25 '20 22:08

max

2 Answers

Use .applymap to convert the columns to np.arrays
Then use .div to divide the columns
If result must be rounded, tack on .apply(lambda x: np.round(x, 3)), when calculating that column.
- np.round()
- df['result'] = df.col1.div(df.col2).apply(lambda x: np.round(x, 3))

import numpy as np
import pandas as pd

data = {'col1': [[1,3,4,5], [1,4,5,5], [1,3,4,8]], 'col2': [[3,3,6,2], [3,8,4,3], [8,3,7,2]]}

df = pd.DataFrame(data)

# convert columns to arrays
df = df.applymap(np.array)

# divide the columns
df['result'] = df.col1.div(df.col2)

142

answered Sep 28 '22 07:09

Trenton McKinney

You can use list comprehension with apply, this is conditional on both the lists being of same length

df['result'] = df.apply(lambda x: [np.round(x['col1'][i]/x['col2'][i], 2) for i in range(len(x['col1']))], axis = 1)

    col1            col2            result
0   [1, 3, 4, 5]    [3, 3, 6, 2]    [0.33, 1.0, 0.67, 2.5]
1   [1, 4, 5, 5]    [3, 8, 4, 3]    [0.33, 0.5, 1.25, 1.67]
2   [1, 3, 4, 8]    [8, 3, 7, 2]    [0.12, 1.0, 0.57, 4.0]

Edit: As @TrentonMcKinney suggested, this can be done without using LC. This solution capitalized on Numpy's vectorized operations,

df.apply(lambda x: np.round(np.array(x[0]) / np.array(x[1]), 3), axis=1)

answered Sep 28 '22 06:09

Vaishali

Related questions
                            
                                Using @pytest.fixture(scope="module") with @pytest.mark.asyncio
                            
                                Is there any way to tell if a function object was a lambda or a def?
                            
                                Merge pandas DataFrame columns starting with the same letters
                            
                                How to use Newspaper3k library without downloading articles?
                            
                                spacy with joblib library generates _pickle.PicklingError: Could not pickle the task to send it to the workers
                            
                                How to return plain text from flask endpoint? Needed by Prometheus
                            
                                RuntimeWarning: coroutine 'main' was never awaited
                            
                                pip install error: Microsoft Visual C++ 10.0 is required
                            
                                How do I properly decorate a `classmethod` with `functools.lru_cache`?
                            
                                assert true vs assert is not None
                            
                                how to use np.max for empty numpy array without ValueError: zero-size array to reduction operation maximum which has no identity
                            
                                plot_confusion_matrix without estimator
                            
                                Making a tqdm progress bar for asyncio
                            
                                AWS Lambda "Unable to marshal response" Error
                            
                                How to run Python 3 function even after user has closed web browser/tab?
                            
                                PyTorch Lightning move tensor to correct device in validation_epoch_end
                            
                                How can I resolve - TypeError: cannot safely cast non-equivalent float64 to int64?
                            
                                PyTorch: What is the difference between tensor.cuda() and tensor.to(torch.device("cuda:0"))?
                            
                                Install local wheel file with requirements.txt
                            
                                Is OOP possible using discord.py without cogs?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With