Pandas concat dictionary to dataframe

Tags:

I have an existing dataframe and I'm trying to concatenate a dictionary where the length of the dictionary is different from the dataframe

>>> df
         A        B        C
0  0.46324  0.32425  0.42194
1  0.10596  0.35910  0.21004
2  0.69209  0.12951  0.50186
3  0.04901  0.31203  0.11035
4  0.43104  0.62413  0.20567
5  0.43412  0.13720  0.11052
6  0.14512  0.10532  0.05310

and

test = {"One": [0.23413, 0.19235, 0.51221], "Two": [0.01293, 0.12235, 0.63291]}

I'm trying to add test to df, while changing the keys to "D" and "C" and I've had a look at

http://pandas.pydata.org/pandas-docs/stable/merging.html and http://pandas.pydata.org/pandas-docs/stable/generated/pandas.concat.html

which indicates that I should be able to concatenate the dictionary to the dataframe

I've tried:

pd.concat([df, test], axis=1, ignore_index=True, keys=["D", "E"])
pd.concat([df, test], axis=1, ignore_index=True)

but I'm not having any luck, the result I'm trying to achieve is

df
         A        B        C        D        E
0  0.46324  0.32425  0.42194  0.23413  0.01293  
1  0.10596  0.35910  0.21004  0.19235  0.12235
2  0.69209  0.12951  0.50186  0.51221  0.63291
3  0.04901  0.31203  0.11035      NaN      NaN
4  0.43104  0.62413  0.20567      NaN      NaN 
5  0.43412  0.13720  0.11052      NaN      NaN
6  0.14512  0.10532  0.05310      NaN      NaN

366

asked Apr 01 '16 21:04

Lukasz

2 Answers

The only way you can do that is with:

df.join(pd.DataFrame(test).rename(columns={'One':'D','Two':'E'}))

          A       B       C       D       E
0   0.46324 0.32425 0.42194 0.23413 0.01293
1   0.10596 0.35910 0.21004 0.19235 0.12235
2   0.69209 0.12951 0.50186 0.51221 0.63291
3   0.04901 0.31203 0.11035     NaN     NaN
4   0.43104 0.62413 0.20567     NaN     NaN
5   0.43412 0.13720 0.11052     NaN     NaN
6   0.14512 0.10532 0.05310     NaN     NaN

because as @Alexander mentioned correctly the number of rows being concatenated should match. Otherwise, as in your case, missing rows will be filled with NaN

answered Oct 13 '22 21:10

Sergey Bushmanov

Assuming you want to add them as rows:

>>> pd.concat([df, pd.DataFrame(test.values(), columns=df.columns)], ignore_index=True)
         A        B        C
0  0.46324  0.32425  0.42194
1  0.10596  0.35910  0.21004
2  0.69209  0.12951  0.50186
3  0.04901  0.31203  0.11035
4  0.43104  0.62413  0.20567
5  0.43412  0.13720  0.11052
6  0.14512  0.10532  0.05310
7  0.01293  0.12235  0.63291
8  0.23413  0.19235  0.51221

If added as new columns:

df_new = pd.concat([df, pd.DataFrame(test.values()).T], ignore_index=True, axis=1)
df_new.columns = \
    df.columns.tolist() + [{'One': 'D', 'Two': 'E'}.get(k) for k in test.keys()]

>>> df_new
         A        B        C        E        D
0  0.46324  0.32425  0.42194  0.01293  0.23413
1  0.10596  0.35910  0.21004  0.12235  0.19235
2  0.69209  0.12951  0.50186  0.63291  0.51221
3  0.04901  0.31203  0.11035      NaN      NaN
4  0.43104  0.62413  0.20567      NaN      NaN
5  0.43412  0.13720  0.11052      NaN      NaN
6  0.14512  0.10532  0.05310      NaN      NaN

Order is not guaranteed in dictionaries (e.g. test), so the new column names actually need to be mapped to the keys.

answered Oct 13 '22 20:10

Alexander

Related questions
                            
                                Conditional field requirement with DjangoRestFramework serializer
                            
                                Save XLSX file to a specified location using OpenPyXL
                            
                                How to properly update requests in Ubuntu 14.04
                            
                                How can SciKit-Learn Random Forest sub sample size may be equal to original training data size?
                            
                                Pandas : vectorized operations on maximum values per row
                            
                                Google Cloud vision API: "Request had insufficient authentication scopes."
                            
                                Replace some specific values in pandas column based on conditions in other column
                            
                                how to choose initial centroids for k-means clustering
                            
                                finding the max of a column in an array
                            
                                Sorting child elements with lxml based on attribute value
                            
                                Contourf on the faces of a Matplotlib cube
                            
                                How Does String Conversion Between PyUnicode String and C String Work? [closed]
                            
                                How to remove username field in the register form on django admin?
                            
                                In SQLAlchemy, how do I query composite primary keys?
                            
                                Google Drive Python API; upload a media object which is NOT a file
                            
                                Python equivalent to Matlab funciton 'imfill' for grayscale?
                            
                                TypeError: object is not subscriptable
                            
                                How can I get only the latest file/files created/modified on S3 location through python
                            
                                How to use oauth2 to access StackExchange API?
                            
                                Cloning a private repo using HTTPS with gitpython

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Pandas concat dictionary to dataframe

Tags:

python

dictionary

pandas

Lukasz

People also ask

2 Answers

Sergey Bushmanov

Alexander

Recent Activity

Donate For Us