Working with set_index in Pandas DataFrame

Tags:

pandas

Using an imported CSV file, I indexed the DataFrame like this...

 rdata.set_index(['race_date', 'track_code', 'race_number', 'horse_name'])

This is what a section of the DataFrame looks like...

 race_date  track_code race_number horse_name          work_date  work_track
 2007-08-24 BM         8           Count Me Twice     2007-05-31         PLN
                                   Count Me Twice     2007-06-09         PLN
                                   Count Me Twice     2007-06-16         PLN
                                   Count Me Twice     2007-06-23         PLN
                                   Count Me Twice     2007-08-05         PLN
                                   Judge's Choice     2007-06-07          BM
                                   Judge's Choice     2007-06-14          BM
                                   Judge's Choice     2007-07-08          BM
                                   Judge's Choice     2007-08-18          BM

Why isn't the 'horse_name' column being grouped like the date, track and race? Perhaps it's by design, thus how can I slice this larger DataFrame by race to have a new DataFrame with 'horse_name' as its index?

778

asked Aug 06 '13 03:08

TravisVOX

1 Answers

It's not a bug. This is exactly how it's intended to work.

DataFrame has to show show every single item in it's data. So if the index has one level, that level will be fully expanded. If it has two levels, first level will be grouped and the second will be fully expanded, if it has tree levels, first two will be grouped and the third will be expanded, and so on.

So this is why the horse name is not grouped. How would you be able to see all the items in the DataFrame if you group also by the horse name :)

Try doing:

 rdata.set_index(['race_date', 'track_code', 'race_number'])

or:

 rdata.set_index(['race_date', 'track_code'])

You'll see that the last level of the index is always fully expanded, to enable you to see all the items in the DataFrame.

157

answered Oct 01 '22 05:10

Viktor Kerkez

Related questions
                            
                                How to connect to Facebook Graph API from Python using Requests if I do not need user access token?
                            
                                Python dependencies between groups using argparse
                            
                                Dumping HTTP requests with Flask
                            
                                python efficiency and large objects in memory
                            
                                Scrape using Beautiful Soup preserving &nbsp; entities
                            
                                Conditional replacement in pandas
                            
                                Enable Python to utilize all cores for fitting scikit-learn models
                            
                                Pandas group by operations on a data frame
                            
                                Python Pandas Accessing values from second index in multi-indexed dataframe
                            
                                Need a thread-safe asynchronous message queue
                            
                                python print vs __str__?
                            
                                locking camera in mayavi
                            
                                French and lxml text
                            
                                Numpy: outer product of n vectors
                            
                                Dynamically build a lambda function in python
                            
                                python: why does os.makedirs cause WindowsError?
                            
                                how to use QuerySelectField in flask?
                            
                                how to clean up incomplete alembic run
                            
                                Updating the x-axis values using matplotlib animation
                            
                                Convert a BaseClass object into a SubClass object idiomatically?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With