Converting a dataframe to dictionary with multiple values

Tags:

I have a dataframe like

Sr.No   ID       A         B          C         D
 1     Tom     Earth    English      BMW
 2     Tom     Mars     Spanish      BMW       Green          
 3     Michael Mercury  Hindi        Audi      Yellow
 4     John    Venus    Portugese    Mercedes  Blue
 5     John             German       Audi      Red

I am trying to convert this to a dictionary by ID like :

{'ID' : 'Tom', 'A' : ['Earth', 'Mars'], 'B' : ['English', 'Spanish'], 'C' : 
                                                ['BMW', 'BMW'], 'D':['Green'] }, 

{'ID' : 'Michael', 'A' : ['Mercury'], 'B' : ['Hindi'], 'C' : ['Audi'],
                                                               'D':['Yellow']},

{'ID' : 'John', 'A' : ['Venus'], 'B' : ['Portugese', 'German'], 'C' : 
                                     ['Mercedes', 'Audi'], 'D':['Blue', 'Red'] }

This is somewhat similar to what I want.

I also tried ,

df.set_index('ID').to_dict()

but this gives me dictionary of length 5 instead of 3. Any help would be appreciated.

234

asked Aug 22 '16 11:08

Ronak Shah

2 Answers

Grouping by 'ID' and apply to_dict to each group with orient='list' comes pretty close:

df.groupby('ID').apply(lambda dfg: dfg.to_dict(orient='list')).to_dict()
Out[25]: 
{'John': {'A': ['Venus', nan],
  'B': ['Portugese', 'German'],
  'C': ['Mercedes', 'Audi'],
  'D': ['Blue', 'Red'],
  'ID': ['John', 'John'],
  'Sr.No': [4, 5]},
 'Michael': {'A': ['Mercury'],
  'B': ['Hindi'],
  'C': ['Audi'],
  'D': ['Yellow'],
  'ID': ['Michael'],
  'Sr.No': [3]},
 'Tom': {'A': ['Earth', 'Mars'],
  'B': ['English', 'Spanish'],
  'C': ['BMW', 'BMW'],
  'D': [nan, 'Green'],
  'ID': ['Tom', 'Tom'],
  'Sr.No': [1, 2]}}

It should just be a matter of formatting the result slightly.

Edit: to remove 'ID' from the dictionaries:

df.groupby('ID').apply(lambda dfg: dfg.drop('ID', axis=1).to_dict(orient='list')).to_dict()
Out[5]: 
{'John': {'A': ['Venus', nan],
  'B': ['Portugese', 'German'],
  'C': ['Mercedes', 'Audi'],
  'D': ['Blue', 'Red'],
  'Sr.No': [4, 5]},
 'Michael': {'A': ['Mercury'],
  'B': ['Hindi'],
  'C': ['Audi'],
  'D': ['Yellow'],
  'Sr.No': [3]},
 'Tom': {'A': ['Earth', 'Mars'],
  'B': ['English', 'Spanish'],
  'C': ['BMW', 'BMW'],
  'D': [nan, 'Green'],
  'Sr.No': [1, 2]}}

179

answered Sep 21 '22 10:09

IanS

You can use groupby with orient of to_dict as list and convert the resultant series to a dictionary.

df.set_index('Sr.No', inplace=True)
df.groupby('ID').apply(lambda x: x.to_dict('list')).reset_index(drop=True).to_dict()

{0: {'C': ['Mercedes', 'Audi'], 'ID': ['John', 'John'], 'A': ['Venus', nan],  
     'B': ['Portugese', 'German'], 'D': ['Blue', 'Red']}, 
 1: {'C': ['Audi'], 'ID': ['Michael'], 'A': ['Mercury'], 'B': ['Hindi'], 'D': ['Yellow']}, 
 2: {'C': ['BMW', 'BMW'], 'ID': ['Tom', 'Tom'], 'A': ['Earth', 'Mars'], 
     'B': ['English', 'Spanish'], 'D': [nan, 'Green']}}

Inorder to remove ID, you can also do:

df.groupby('ID')['A','B','C','D'].apply(lambda x: x.to_dict('list'))  \
                                 .reset_index(drop=True).to_dict()

answered Sep 20 '22 10:09

Nickil Maveli

Related questions
                            
                                Import pandas on docker with tensorflow
                            
                                How can one configure flask to be accessible via public IP interface? [duplicate]
                            
                                How can I conditionally update multiple columns in a panda dataframe
                            
                                how to get the shifted index value of a dataframe in Pandas?
                            
                                How to set the build description via Jenkins REST API or Python?
                            
                                How does the indexing of subplots work
                            
                                python flask can't find '__main__' module in ''
                            
                                Python at Synology, how to get Python3 modules installed and where is Python2.7 installed?
                            
                                how to convert column names into column values in pandas - python
                            
                                Splitting a string in pandas and join it to the old data
                            
                                Pandas, conditional column assignment based on column values
                            
                                Pandas: drop rows based on duplicated values in a list
                            
                                Add UUID's to pandas DF
                            
                                Why is matplotlib's notched boxplot folding back on itself?
                            
                                Error when creating executable file with pyinstaller
                            
                                assertRaises for method with optional parameters
                            
                                Using replace() method in python by index [duplicate]
                            
                                Django Channels
                            
                                How to create a new color image with python Imaging?
                            
                                UnicodeDecodeError on python3 [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Converting a dataframe to dictionary with multiple values

Tags:

python

dictionary

pandas

dataframe

Ronak Shah

People also ask

2 Answers

IanS

Nickil Maveli

Recent Activity

Donate For Us