Iteration over the rows of a Pandas DataFrame as dictionaries

Tags:

I need to iterate over a pandas dataframe in order to pass each row as argument of a function (actually, class constructor) with **kwargs. This means that each row should behave as a dictionary with keys the column names and values the corresponding ones for each row.

This works, but it performs very badly:

import pandas as pd


def myfunc(**kwargs):
    try:
        area = kwargs.get('length', 0)* kwargs.get('width', 0)
        return area
    except TypeError:
        return 'Error : length and width should be int or float'


df = pd.DataFrame({'length':[1,2,3], 'width':[10, 20, 30]})

for i in range(len(df)):
    print myfunc(**df.iloc[i])

Any suggestions on how to make that more performing ? I have tried iterating with tried df.iterrows(), but I get the following error :

TypeError: myfunc() argument after ** must be a mapping, not tuple

I have also tried df.itertuples() and df.values , but either I am missing something, or it means that I have to convert each tuple / np.array to a pd.Series or dict , which will also be slow. My constraint is that the script has to work with python 2.7 and pandas 0.14.1.

578

asked Nov 14 '18 09:11

Matina G

2 Answers

one clean option is this one:

for row_dict in df.to_dict(orient="records"):
    print(row_dict['column_name'])

158

answered Oct 16 '22 05:10

avloss

You can try:

for k, row in df.iterrows():
    myfunc(**row)

Here k is the dataframe index and row is a dict, so you can access any column with: row["my_column_name"]

answered Oct 16 '22 04:10

stellasia

Related questions
                            
                                Python equivalent of Scala case class
                            
                                Opening web camera in Google Colab
                            
                                Shapefile reader in Python?
                            
                                Executing a Django Shell Command from the Command Line
                            
                                Why don't scripting languages output Unicode to the Windows console?
                            
                                Pyramid authorization for stored items
                            
                                Extracting words from a string, removing punctuation and returning a list with separated words
                            
                                SyntaxError: cannot assign to operator
                            
                                Creating a Python list comprehension with an if and break
                            
                                Creating a dictionary with list of lists in Python
                            
                                Timedelta is not defined
                            
                                Python, write in memory zip to file
                            
                                Calculate sunrise and sunset times for a given GPS coordinate within PostgreSQL
                            
                                Delete an uploaded file after downloading it from Flask
                            
                                Optimize the performance of dictionary membership for a list of Keys
                            
                                Sklearn preprocessing - PolynomialFeatures - How to keep column names/headers of the output array / dataframe
                            
                                Changing the scale of a tensor in tensorflow
                            
                                Pandas dataframe error: matplotlib.axes._subplots.AxesSubplot
                            
                                Replace nan values in tensorflow tensor
                            
                                Save and load model optimizer state

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Iteration over the rows of a Pandas DataFrame as dictionaries

Tags:

performance

python

pandas

Matina G

People also ask

2 Answers

avloss

stellasia

Recent Activity

Donate For Us