Set level values in MultiIndex

Tags:

python

pandas

How can I set the level values of a Series, either by using a dictionary to replace the values, or just with a list of values as long as the series?

Here's a sample DataFrame:

     sector from_country to_country           0
0  Textiles          FRA        AUS   47.502096
1  Textiles          FRA        USA  431.890710
2  Textiles          GBR        AUS   83.500590
3  Textiles          GBR        USA  324.836158
4      Wood          FRA        AUS   27.515607
5      Wood          FRA        USA  276.501148
6      Wood          GBR        AUS    1.406096
7      Wood          GBR        USA    8.996177

Now set the index:

df = df.set_index(['sector', 'from_country', 'to_country']).squeeze()

For example, if I wanted to change based on the following key/value pairs:

In [69]: replace_dict = {'FRA':'France', 'GBR':'UK'}
In [70]: new_vals = [replace_dict[x] for x in df.index.get_level_values('from_country')]

I would like the output to look like:

In [68]: df.index.set_level_values(new_vals, level='from_country')
Out[68]: 
sector    from_country  to_country
Textiles  France        AUS            47.502096
                        USA           431.890710
          UK            AUS            83.500590
                        USA           324.836158
Wood      France        AUS            27.515607
                        USA           276.501148
          UK            AUS             1.406096
                        USA             8.996177

I currently do this, but it seems pretty dumb to me:

def set_index_values(df_or_series, new_values, level):
    """
    Replace the MultiIndex level `level` with `new_values`

    `new_values` must be the same length as `df_or_series`
    """
    levels = df_or_series.index.names
    retval = df_or_series.reset_index(level)
    retval[level] = new_values
    retval = retval.set_index(level, append=True).reorder_levels(levels).sortlevel().squeeze()
    return retval

598

asked Oct 01 '15 16:10

LondonRob

1 Answers

Slightly hacky, but you can do this with .index.set_levels:

In [11]: df1.index.levels[1]
Out[11]: Index(['FRA', 'GBR'], dtype='object', name='from_country')

In [12]: df1.index.levels[1].map(replace_dict.get)
Out[12]: array(['France', 'UK'], dtype=object)

In [13]: df1.index = df1.index.set_levels(df1.index.levels[1].map(replace_dict.get), "from_country")

In [14]: df1
Out[14]:
sector    from_country  to_country
Textiles  France        AUS            47.502096
                        USA           431.890710
          UK            AUS            83.500590
                        USA           324.836158
Wood      France        AUS            27.515607
                        USA           276.501148
          UK            AUS             1.406096
                        USA             8.996177
Name: 0, dtype: float64

Note: There is a way to get the level number from the name, but I don't recall it.

131

answered Sep 22 '22 13:09

Andy Hayden

Related questions
                            
                                OpenCV error with AdaptiveThreshold
                            
                                Drop the date from a matplotlib time series plot
                            
                                Export pip packages
                            
                                sklearn - model keeps overfitting
                            
                                How to tell the HTTP server to not send chunked encoding
                            
                                How to run WordCountTopology from storm-starter in Intellij
                            
                                Is there an alternative to Ansible on Python3
                            
                                How to prevent key creation through d[key] = val
                            
                                How to raise an error / return a {"foo":["This field is required."]} response in Django REST
                            
                                Multiple results for each individual row (one-to-many) with Pandas
                            
                                How to create .exe using py2exe(or pyinstaller) on Ubuntu
                            
                                Detecting unicode private use area characters with python
                            
                                Is it possible to make a module iterable in Python?
                            
                                How to get sql query from peewee?
                            
                                How to convert Pandas Index to month name
                            
                                Missing conf file for sphinx command
                            
                                How to get the number of <p> tags inside div in scrapy?
                            
                                python xgboost on mac install
                            
                                Better way to check a list for specific elements - python
                            
                                Matplotlib Navigation Toolbar: remove "Edit curves lines and axes parameters"

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With