Converting Index into MultiIndex (hierarchical index) in Pandas

Tags:

pandas

In the data I am working with the index is compound - i.e. it has both item name and a timestamp, e.g. [email protected]|2013-05-07 05:52:51 +0200.

I want to do hierarchical indexing, so that the same e-mails are grouped together, so I need to convert a DataFrame Index into a MultiIndex (e.g. for the entry above - ([email protected], 2013-05-07 05:52:51 +0200)).

What is the most convenient method to do so?

487

asked Jul 23 '13 19:07

Piotr Migdal

1 Answers

Once we have a DataFrame

import pandas as pd
df = pd.read_csv("input.csv", index_col=0)  # or from another source

and a function mapping each index to a tuple (below, it is for the example from this question)

def process_index(k):
    return tuple(k.split("|"))

we can create a hierarchical index in the following way:

df.index = pd.MultiIndex.from_tuples([process_index(k) for k,v in df.iterrows()])

An alternative approach is to create two columns then set them as the index (the original index will be dropped):

df['e-mail'] = [x.split("|")[0] for x in df.index] 
df['date'] = [x.split("|")[1] for x in df.index]
df = df.set_index(['e-mail', 'date'])

or even shorter

df['e-mail'], df['date'] = zip(*map(process_index, df.index))
df = df.set_index(['e-mail', 'date'])

answered Sep 26 '22 06:09

Piotr Migdal

Related questions
                            
                                Any tips on writing testing-friendly code?
                            
                                How to output coverage XML with nosetests?
                            
                                How to use variables already defined in ConfigParser
                            
                                Use ImageMagick with python. (on a linux system) [duplicate]
                            
                                how to change 39.54484700000000 to 39.54 and using python [duplicate]
                            
                                Python: Sort list with parallel list
                            
                                setting color range in matplotlib patchcollection
                            
                                Why doesn't the operator module have a function for logical or?
                            
                                Regex/code for removing "FWD", "RE", etc, from email subject
                            
                                Python sax to lxml for 80+GB XML
                            
                                Handling an undefined template variable in Tornado
                            
                                Reading 3 bytes as an integer
                            
                                python multi-threading slower than serial?
                            
                                Using include to dynamically point to HTML
                            
                                Python - converting sock.recv to string
                            
                                prevent subprocess.Popen from displaying output in python
                            
                                Get the "bits" of a float in Python?
                            
                                Is concurrent.futures a medicine of the GIL?
                            
                                Reversed array in numpy?
                            
                                Python 2: AttributeError: 'list' object has no attribute 'strip'

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Converting Index into MultiIndex (hierarchical index) in Pandas

Tags:

python

pandas

Piotr Migdal

People also ask

1 Answers

Piotr Migdal

Recent Activity

Donate For Us