Using pandas groupby().apply(list) on multiple columns at once [duplicate]

Tags:

I'm trying to combine multiple rows of a dataframe into one row, with the columns with different values being combined in a list. There are multiple columns with different values.

The df.groupby('a')['b'].apply(list) works well if only 1 column ('b' in this instance) has to be made to a list, but I can't figure out how to do it for multiple columns.

Dataframe:

   a  b  c       d
0  1  b  1   first
1  1  b  2  second
2  2  c  1   third
3  2  c  2  fourth
4  2  c  3   fifth

Prefered dataframe post operation:

   a  b          c                       d
0  1  b     [1, 2]         [first, second]
1  2  c  [1, 2, 3]  [third, fourth, fifth]

Is there an easy way to do this?

774

asked May 13 '19 09:05

MvR

1 Answers

df = df.groupby(['a','b']).apply(lambda x: [list(x['c']), list(x['d'])]).apply(pd.Series)
df.columns =['a','b','c','d']

Output

   a  b          c                       d
0  1  b     [1, 2]         [first, second]
1  2  c  [1, 2, 3]  [third, fourth, fifth]

141

answered Sep 22 '22 19:09

iamklaus

Related questions
                            
                                Cannot load mkl_intel_thread.dll on python executable
                            
                                How to assign random values from a list to a column in a pandas dataframe?
                            
                                MySQL One-to-Many to JSON format
                            
                                When to use dynamodb.client, dynamodb.resource and dynamodb.Table?
                            
                                how to write gray (1-channel) image with opencv for python
                            
                                Can't connect to mysql db withh python - bad handshake
                            
                                Column-dependent bounds in torch.clamp
                            
                                How do I write a BeautifulSoup strainer that only parses objects with certain text between the tags?
                            
                                Pybind11 and std::vector -- How to free data using capsules?
                            
                                reload module with pyximport?
                            
                                Leading underscore before the name of Python module
                            
                                What's the difference between state.sls and state.apply?
                            
                                Pandas Dataframe replace Nan from a row when a column value matches
                            
                                "No installed app with label 'admin'" in empty Django 2.2 project
                            
                                How can I untokenize a spacy.tokens.token.Token?
                            
                                What is the difference between exec_command and send with invoke_shell() on Paramiko?
                            
                                How to apply Guided BackProp in Tensorflow 2.0?
                            
                                How to validate date type in POST payload with flask restplus?
                            
                                Making Django Signals Specific To Admin Save ONLY
                            
                                How to convert "tensor" to "numpy" array in tensorflow?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Using pandas groupby().apply(list) on multiple columns at once [duplicate]

Tags:

python

pandas

dataframe

pandas-groupby

apply

MvR

People also ask

1 Answers

iamklaus

Recent Activity

Donate For Us