Best way to add dictionary to dataframe

Tags:

I have a Pandas Dataframe and want to add the data from a dictionary uniformly to all rows in my dataframe. Currently I loop over the dictionary and set the value to my new columns. Is there a more efficient way to do this?

notebook

# coding: utf-8    
import pandas as pd

df = pd.DataFrame({'age' : [1, 2, 3],'name' : ['Foo', 'Bar', 'Barbie']}) 
d = {"blah":42,"blah-blah":"bar"}
for k,v in d.items():
    df[k] = v
df

934

asked Apr 13 '18 13:04

Rutger Hofste

2 Answers

Use assign if all keys are not numeric:

df = df.assign(**d)
print (df)
   age    name  blah blah-blah
0    1     Foo    42       bar
1    2     Bar    42       bar
2    3  Barbie    42       bar

If possible numeric join working nice:

d = {8:42,"blah-blah":"bar"}
df = df.join(pd.DataFrame(d, index=df.index))
print (df)

   age    name   8 blah-blah
0    1     Foo  42       bar
1    2     Bar  42       bar
2    3  Barbie  42       bar

109

answered Oct 17 '22 19:10

jezrael

The answer in my opinion is no. Looping through key,values in a dict is already efficient and assigning columns with df[k] = v is more readable. Remember that in the future you just want to remember why you did something and you won't care much if you spare some microseconds. The only thing missing is a comment why you do it.

d = {"blah":42,"blah-blah":"bar"}

# Add columns to compensate for missing values in document XXX
for k,v in d.items():
    df[k] = v

Timings (but the error is too big... I'd say they are equivalent in speed):

Your solution:

809 µs ± 70 µs per loop (mean ± std. dev. of 7 runs, 1000 loops each)

df.assign():

893 µs ± 24.2 µs per loop (mean ± std. dev. of 7 runs, 1000 loops each)

answered Oct 17 '22 19:10

Anton vBR

Related questions
                            
                                Return index value as string
                            
                                self-join with Pandas
                            
                                Cast NumPy array to/from custom C++ Matrix-class using pybind11
                            
                                Writing a python method that refers to both the instance and the class
                            
                                How to name a Pandas Series
                            
                                Does a plotly dash dashboard publish data online?
                            
                                Converting HDF5 to Parquet without loading into memory
                            
                                Comparison: import statement vs __import__ function
                            
                                Pandas add column from one dataframe to another based on a join
                            
                                How to find pyspark dataframe memory usage?
                            
                                How can I tell if a dataframe is of mixed type?
                            
                                Statsmodels seasonal_decompose - what is naive about it?
                            
                                Check if elements occur together in all lists?
                            
                                How to create a square dataframe/matrix given 3 columns - Python
                            
                                multiprocessing.Pipe is even slower than multiprocessing.Queue?
                            
                                How to implement SMOTE in cross validation and GridSearchCV
                            
                                python: perform gdalwarp in memory with gdal bindings
                            
                                Do I need to add my project directory to the system path in every script to import a function from another directory?
                            
                                python how to run process in detached mode
                            
                                How to run a coroutine and wait it result from a sync func when the loop is running?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Best way to add dictionary to dataframe

Tags:

python

pandas

Rutger Hofste

People also ask

2 Answers

jezrael

Anton vBR

Recent Activity

Donate For Us