This is my dataframe: <pre class="prettyprint"><code>> df a b 0 1 set([2, 3]) 1 2 set([2, 3]) 2 3 set([4, 5, 6]) 3 1 set([1, 34, 3, 2]) </code></pre> Now when I <code>groupby</code>, I want to update sets. If it was a <code>list</code> there was no problem. But the output of my command is: <pre class="prettyprint"><code>> df.groupby('a').sum() a b 1 NaN 2 set([2, 3]) 3 set([4, 5, 6]) </code></pre> What should I do in groupby to update sets? The output I'm looking for is as below: <pre class="prettyprint"><code>a b 1 set([2, 3, 1, 34]) 2 set([2, 3]) 3 set([4, 5, 6]) </code></pre>

This might be close to what you want <pre class="prettyprint"><code>df.groupby('a').apply(lambda x: set.union(*x.b)) </code></pre> In this case it takes the union of the sets. If you need to keep the column names you could use: <pre class="prettyprint"><code>df.groupby('a').agg({'b':lambda x: set.union(*x)}).reset_index('a') </code></pre> Result: <pre class="prettyprint"><code> a b 0 1 set([1, 2, 3, 34]) 1 2 set([2, 3]) 2 3 set([4, 5, 6]) </code></pre>

how to concat sets when using groupby in pandas dataframe?

Tags:

python

pandas

This is my dataframe:

Click to copy

> df
       a             b
    0  1         set([2, 3])
    1  2         set([2, 3])
    2  3      set([4, 5, 6])
    3  1  set([1, 34, 3, 2])

Now when I groupby, I want to update sets. If it was a list there was no problem. But the output of my command is:

Click to copy

> df.groupby('a').sum()

a         b                
1             NaN
2     set([2, 3])
3  set([4, 5, 6])

What should I do in groupby to update sets? The output I'm looking for is as below:

Click to copy

a         b                
1     set([2, 3, 1, 34])
2     set([2, 3])
3     set([4, 5, 6])

712

asked Oct 06 '15 10:10

Alireza

1 Answers

This might be close to what you want

Click to copy

df.groupby('a').apply(lambda x: set.union(*x.b))

In this case it takes the union of the sets.

If you need to keep the column names you could use:

Click to copy

df.groupby('a').agg({'b':lambda x: set.union(*x)}).reset_index('a')

Result:

Click to copy

    a   b
0   1   set([1, 2, 3, 34])
1   2   set([2, 3])
2   3   set([4, 5, 6])

196

answered Nov 08 '22 18:11

matt_s

Related questions
                            
                                Python operator precedence - and vs greater than
                            
                                Read/write values using Ethernet/IP
                            
                                Using Python Virtual Environments with Terminator
                            
                                How to change background color in ttk.Combobox's listview?
                            
                                How do I change the plot size of a regplot in Seaborn?
                            
                                Python while loop condition check for string
                            
                                How to use a specific data structure as the default_factory for a defaultdict?
                            
                                Resizing matplotlib figure with set_fig(width/height) doesn't work
                            
                                Converting python string to datetime obj with AM/PM
                            
                                WTForms : How to add "autofocus" attribute to a StringField
                            
                                String conditional formatting "equal to" in Excel using Python's xlsxwriter
                            
                                Adding a 1-D Array to a 3-D array in Numpy
                            
                                Hide ipython notebook prompt
                            
                                python matplotlib: drawing 3D sphere with circumferences
                            
                                Slicing outside numpy array
                            
                                Python unittesting for a class method
                            
                                Remove NaN and convert to float32 in Python Pandas
                            
                                Django 1.8 getting kwargs in Serializer
                            
                                setattr and getattr with methods
                            
                                Why is it "easier to ask forgiveness than it is to get permission" in Python?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

how to concat sets when using groupby in pandas dataframe?

Tags:

python

pandas

Alireza

People also ask

1 Answers

matt_s

Recent Activity

Donate For Us