Convert a column containing a list of dictionaries to multiple columns in pandas dataframe

Tags:

python

pandas

I have a Pandas dataframe like :

pd.DataFrame({'a':[1,2], 'b':[[{'c':1,'d':5},{'c':3, 'd':7}],[{'c':10,'d':50}]]})
Out[2]: 
   a                                         b
0  1  [{u'c': 1, u'd': 5}, {u'c': 3, u'd': 7}]
1  2                    [{u'c': 10, u'd': 50}]

And I want to expand the 'b' column and repeat 'a' column if there are more than one element in 'b' as follow:

Out[2]: 
   a   c   d
0  1   1   5
1  1   3   7
2  2  10  50

I tried to use apply function on each row but I was not successful, apparently apply convert one row to one row.

708

asked Jul 26 '17 09:07

Ali Mirzaei

1 Answers

You can use concat with list comprehension:

df = pd.concat([pd.DataFrame(x) for x in df['b']], keys=df['a'])
       .reset_index(level=1, drop=True).reset_index()

print (df)
   a   c   d
0  1   1   5
1  1   3   7
2  2  10  50

EDIT:

If index is unique, then is possible use join for all columns:

df1 = pd.concat([pd.DataFrame(x) for x in df['b']], keys=df.index)
        .reset_index(level=1,drop=True)
df = df.drop('b', axis=1).join(df1).reset_index(drop=True)
print (df)
   a   c   d
0  1   1   5
1  1   3   7
2  2  10  50

I try simplify solution:

l = df['b'].str.len()
df1 = pd.DataFrame(np.concatenate(df['b']).tolist(), index=np.repeat(df.index, l))
df = df.drop('b', axis=1).join(df1).reset_index(drop=True)
print (df)
   a   c   d
0  1   1   5
1  1   3   7
2  2  10  50

182

answered Oct 24 '22 19:10

jezrael

Related questions
                            
                                Kivy - change FileChooser defaul location
                            
                                Unable to read MAT file with scipy
                            
                                Import .dat file in Python 3
                            
                                Is there any way to make a "for" loop in python double my index value after each iteration?
                            
                                How to parse xml from local file or url with lxml?
                            
                                Communication between Python and C#
                            
                                Pandas DataFrame to drop rows in the groupby
                            
                                Multiple async calls blocking
                            
                                Horizontal barplot in Seaborn using dataframe
                            
                                Python- Removing items
                            
                                Why matplotlib doesn't update in Anaconda to the 2.0 version
                            
                                Python how to use defaultdict fromkeys to generate a dictionary with predefined keys and empty lists
                            
                                How to play audio from outside static folder in Flask?
                            
                                Pytest with mock/pytest-mock
                            
                                Is there a way to specify a conditional type hint in Python?
                            
                                How to create a SECRET_HASH for AWS Cognito using boto3?
                            
                                How to convert a pandas dataframe into one dimensional array?
                            
                                Python tqdm and print weird printout order [duplicate]
                            
                                How to plot int to datetime on x axis using seaborn?
                            
                                Python: running pygame through Bash on Ubuntu on Windows

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With