How to loop over grouped Pandas dataframe?

People also ask

Can you loop through pandas DataFrame?

DataFrame Looping (iteration) with a for statement. You can loop over a pandas dataframe, for each column row by row. Below pandas. Using a DataFrame as an example.

How do you iterate over rows in a data frame?

In case you still want/have to iterate over a DataFrame or Series, you can use iterrows() or itertuples() methods.

How do you loop a panda series?

iteritems() function iterates over the given series object. the function iterates over the tuples containing the index labels and corresponding value in the series. Example #1: Use Series. iteritems() function to iterate over all the elements in the given series object.

df.groupby('l_customer_id_i').agg(lambda x: ','.join(x)) does already return a dataframe, so you cannot loop over the groups anymore.

In general:

df.groupby(...) returns a GroupBy object (a DataFrameGroupBy or SeriesGroupBy), and with this, you can iterate through the groups (as explained in the docs here). You can do something like:
```
grouped = df.groupby('A')

for name, group in grouped:
    ...
```
When you apply a function on the groupby, in your example df.groupby(...).agg(...) (but this can also be transform, apply, mean, ...), you combine the result of applying the function to the different groups together in one dataframe (the apply and combine step of the 'split-apply-combine' paradigm of groupby). So the result of this will always be again a DataFrame (or a Series depending on the applied function).

Here is an example of iterating over a pd.DataFrame grouped by the column atable. For this sample, "create" statements for an SQL database are generated within the for loop:

import pandas as pd

df1 = pd.DataFrame({
    'atable':     ['Users', 'Users', 'Domains', 'Domains', 'Locks'],
    'column':     ['col_1', 'col_2', 'col_a', 'col_b', 'col'],
    'column_type':['varchar', 'varchar', 'int', 'varchar', 'varchar'],
    'is_null':    ['No', 'No', 'Yes', 'No', 'Yes'],
})

df1_grouped = df1.groupby('atable')

# iterate over each group
for group_name, df_group in df1_grouped:
    print('\nCREATE TABLE {}('.format(group_name))

    for row_index, row in df_group.iterrows():
        col = row['column']
        column_type = row['column_type']
        is_null = 'NOT NULL' if row['is_null'] == 'No' else ''
        print('\t{} {} {},'.format(col, column_type, is_null))

    print(");")

You can iterate over the index values if your dataframe has already been created.

df = df.groupby('l_customer_id_i').agg(lambda x: ','.join(x))
for name in df.index:
    print name
    print df.loc[name]

Related questions
                            
                                Getting individual colors from a color map in matplotlib
                            
                                What is the best way to exit a function (which has no return value) in python before the function ends (e.g. a check fails)?
                            
                                ImportError: No module named matplotlib.pyplot
                            
                                How to run an .ipynb Jupyter Notebook from terminal?
                            
                                Display a decimal in scientific notation
                            
                                Common xlabel/ylabel for matplotlib subplots
                            
                                Replacing Pandas or Numpy Nan with a None to use with MysqlDB
                            
                                Python "SyntaxError: Non-ASCII character '\xe2' in file" [duplicate]
                            
                                How to retrieve inserted id after inserting row in SQLite using Python?
                            
                                What does the slash mean in help() output?
                            
                                What is the maximum float in Python?
                            
                                Windows path in Python
                            
                                TypeError: a bytes-like object is required, not 'str' in python and CSV
                            
                                How can I return two values from a function in Python?
                            
                                Encoding an image file with base64
                            
                                Lists in ConfigParser
                            
                                What is the difference between a pandas Series and a single-column DataFrame?
                            
                                Python 3.x rounding behavior
                            
                                Usage of sys.stdout.flush() method
                            
                                How to do a scatter plot with empty circles in Python?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to loop over grouped Pandas dataframe?

Tags:

python

iteration

pandas

dataframe

pandas-groupby

People also ask

Recent Activity

Donate For Us