Pandas group by on groupby to list of lists

Tags:

Given a dataframe structured like:

rule_id | ordering | sequence_id
   1    |    0     |     12     
   1    |    1     |     13
   1    |    1     |     14
   2    |    0     |     1
   2    |    1     |     2
   2    |    2     |     12

I need to transform it into:

rule_id |  sequences
   1    |  [[12],[13,14]]
   2    |  [[1],[2],[12]]

that seems like easy groupby into groupby to list operation - I can not however make it work in pandas.

df.groupby(['rule_id', 'ordering'])['sequence_id'].apply(list)

leaves me with

rule_id  ordering
1        0               [12]
         1            [13,14]
2        0                [1]
         1                [2]
         2               [12]

How does one apply another groupBy operation to furtherly concat results into one list?

733

asked Apr 03 '18 08:04

blahblah

Video Answer

1 Answers

Use another groupby by first level of MultiIndex:

df.groupby(['rule_id', 'ordering'])['sequence_id'].apply(list).groupby(level=0).apply(list)

128

answered Oct 20 '22 08:10

jezrael

Related questions
                            
                                Intersection of nD line with convex hull in Python
                            
                                grid zorder seems not to take effect (matplotlib)
                            
                                Does pyodbc support any form of named parameters?
                            
                                Are constant computations cached in Python?
                            
                                Anaconda3 2.4 with python 3.5 installation error (procedure entry not found; Windows 10)
                            
                                How to limit the maximum number of running Celery tasks by name
                            
                                Installing via `setup.py develop` fails - pip works
                            
                                Tuning parameters for implicit pyspark.ml ALS matrix factorization model through pyspark.ml CrossValidator
                            
                                Is there are Python equivalent of Perl's __DATA__ filehandle?
                            
                                expression can be simplified on boolean literal [duplicate]
                            
                                Python 3 type hints for performance optimizations
                            
                                Python + ZMQ: Operation cannot be accomplished in current state
                            
                                How to remove Python 3 warnings for print statements in Pycharm when using the Python 2 interpreter?
                            
                                How does python optimize conditional list comprehensions
                            
                                Selenium ChromeDriver does not recognize newly compiled Headless Chromium (Python)
                            
                                AttributeError: 'module' object has no attribute 'main' for tf.app.run()
                            
                                How to transfer Anaconda env installed on one machine to another? [Both with Ubuntu installed]
                            
                                Index n dimensional array with (n-1) d array
                            
                                In Tensorflow's Dataset API how do you map one element into multiple elements?
                            
                                Using context managers without "with" block

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Pandas group by on groupby to list of lists

Tags:

python

pandas

dataframe

blahblah

People also ask

Video Answer

1 Answers

jezrael

Recent Activity

Donate For Us