Pandas groupby multiple columns, list of multiple columns

Tags:

I have the following data:

Invoice NoStockCode Description                         Quantity    CustomerID  Country
536365  85123A      WHITE HANGING HEART T-LIGHT HOLDER  6           17850       United Kingdom
536365  71053       WHITE METAL LANTERN                 6           17850       United Kingdom
536365  84406B      CREAM CUPID HEARTS COAT HANGER      8           17850       United Kingdom

I am trying to do a groupby so i have the following operation:

df.groupby(['InvoiceNo','CustomerID','Country'])['NoStockCode','Description','Quantity'].apply(list)

I want to get the output

|Invoice |CustomerID |Country        |NoStockCode              |Description                                                                                 |Quantity       
|536365| |17850      |United Kingdom |85123A, 71053, 84406B    |WHITE HANGING HEART T-LIGHT HOLDER, WHITE METAL LANTERN, CREAM CUPID HEARTS COAT HANGER     |6, 6, 8

Instead I get:

|Invoice |CustomerID |Country        |0         
|536365| |17850      |United Kingdom |['NoStockCode','Description','Quantity']

I have tried agg and other methods, but I haven't been able to get all of the columns to join as a list. I don't need to use the list function, but in the end I want the different columns to be lists.

488

asked Jul 29 '18 20:07

GrandmasLove

1 Answers

I can't reproduce your code right now, but I think that:

print (df.groupby(['InvoiceNo','CustomerID','Country'], 
                  as_index=False)['NoStockCode','Description','Quantity']
          .agg(lambda x: list(x)))

would give you the expected output

142

answered Sep 28 '22 03:09

Ben.T

Related questions
                            
                                sqlite3.OperationalError: no such column:
                            
                                Sort dictionary alphabetically when the key is a string (name)
                            
                                How to print BASE_DIR from settings.py from django app in terminal?
                            
                                Django Rest Framework 3.1 breaks pagination.PaginationSerializer
                            
                                Return SQLAlchemy results as dicts instead of lists
                            
                                Using pandas.Dataframe.groupby without alphabetical ordering
                            
                                Elegant way to match a string to a random color matplotlib
                            
                                psycopg2 on elastic beanstalk - can't deploy app
                            
                                why is logged_out.html not overriding in django registration?
                            
                                Difference between encoding utf-8 and utf8 in Python 3.5
                            
                                Python's closure - local variable referenced before assignment
                            
                                Terminate a Python multiprocessing program once a one of its workers meets a certain condition
                            
                                Flask session variable not persisting between requests
                            
                                PySpark — UnicodeEncodeError: 'ascii' codec can't encode character
                            
                                Dropping foreign keys in Alembic downgrade?
                            
                                Remove Multiple Blanks In DataFrame
                            
                                Check if column value is in other columns in pandas
                            
                                How to change values in a dataframe Python
                            
                                Pandas slicing excluding the end
                            
                                CondaHTTPError: HTTP 000 CONNECTION FAILED for url <https://repo.continuum.io/pk gs/r/win-64/repodata.json.bz2>

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Pandas groupby multiple columns, list of multiple columns

Tags:

python

pandas

pandas-groupby

GrandmasLove

People also ask

1 Answers

Ben.T

Recent Activity

Donate For Us