How to convert pandas dataframe to hierarchical dictionary

Tags:

python

pandas

I have the following pandas dataframe:

df1 = pd.DataFrame({'date': [200101,200101,200101,200101,200102,200102,200102,200102],'blockcount': [1,1,2,2,1,1,2,2],'reactiontime': [350,400,200,250,100,300,450,400]})

I am trying to create a hierarchical dictionary, with the values of the embedded dictionary as lists, that looks like this:

{200101: {1:[350, 400], 2:[200, 250]}, 200102: {1:[100, 300], 2:[450, 400]}}

How would I do this? The closest I get is using this code:

df1.set_index('date').groupby(level='date').apply(lambda x: x.set_index('blockcount').squeeze().to_dict()).to_dict()

Which returns:

{200101: {1: 400, 2: 250}, 200102: {1: 300, 2: 400}}

858

asked Jan 20 '20 02:01

alechay

1 Answers

Here is another way using pivot_table:

d = df1.pivot_table(index='blockcount',columns='date',
     values='reactiontime',aggfunc=list).to_dict()

print(d)

{200101: {1: [350, 400], 2: [200, 250]},
 200102: {1: [100, 300], 2: [450, 400]}}

200

answered Sep 22 '22 18:09

anky

Related questions
                            
                                Python Connection to Hive
                            
                                How do you get the first 3 elements in Python OrderedDict?
                            
                                What is the checkmark icon next to my project in PyCharm?
                            
                                Assertion failure : size.width>0 && size.height>0 in function imshow
                            
                                Python Pandas slice multiindex by second level index (or any other level)
                            
                                Custom double star operator for a class?
                            
                                cx_Oracle: How can I receive each row as a dictionary?
                            
                                Fit mixture of two gaussian/normal distributions to a histogram from one set of data, python
                            
                                how should i read a csv file without the 'unnamed' row with pandas? [duplicate]
                            
                                Custom padding for convolutions in TensorFlow
                            
                                ValueError: num must be 1 <= num <= 2, not 3
                            
                                Identify if list has consecutive elements that are equal
                            
                                Apply a list of Python functions in order elegantly
                            
                                python SyntaxError: invalid syntax %matplotlib inline
                            
                                Equivalent to time.sleep for a PyQt application
                            
                                Google & Oauthlib - Scope has changed
                            
                                Openpyxl: How to add filters to all columns
                            
                                "error: Unable to find vcvarsall.bat" when compiling Cython code
                            
                                spaCy and spaCy models in setup.py
                            
                                Finding highest value in a dictionary

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With