Using Pandas to Iteratively Add Columns to a Dataframe

Tags:

I have some relatively simple code that I'm struggling to put together. I have a CSV that I've read into a dataframe. The CSV is panel data (i.e., unique company and year observations for each row). I have two columns that I want to perform a function on and then I want to create new variables based on the output of the function.

Here's what I have so far with code:

#Loop through rows in a CSV file
for index, rows in df.iterrows():
    #Start at column 6 and go to the end of the file
    for row in rows[6:]:
        data = perform_function1( row )
        output =  perform_function2(data)    
        df.ix[index, 'new_variable'] = output
        print output

I want this code to iterate starting in column 6 and then going to the end of the file (e.g., I have two columns I want to perform the function on Column6 and Column7) and then create new columns based on the functions that were performed (e.g., Output6 and Output7). The code above returns the output for Column7, but I can't figure out how to create a variable that allows me to capture the outputs from both columns (i.e., a new variable that isn't overwritten by loop). I searched Stackoverflow and didn't see anything that immediately related to my question (maybe because I'm too big of a noob?). I would really appreciate your help.

Thanks,

P.S. I'm not sure if I've provided enough detail. Please let me know if I need to provide more.

835

asked Jun 12 '15 20:06

TaterTots

2 Answers

Operating iteratively doesn't take advantage of Pandas' capabilities. Pandas' strength is in applying operations efficiently across the whole dataframe, rather than in iterating row by row. It's great for a task like this where you want to chain a few functions across your data. You should be able to accomplish your whole task in a single line.

df["new_variable"] = df.ix[6:].apply(perform_function1).apply(perform_function2)

perform_function1 will be applied to each row, and perform_function2 will be applied to the results of the first function.

115

answered Oct 19 '22 12:10

ASGM

If you want to apply function to certain columns in a dataframe

# Get the Series
colmun6 = df.ix[:, 5]  
# perform_function1 applied to each row
output6 = column6.apply(perform_function1)  
df["new_variable"] = output6

answered Oct 19 '22 11:10

GeauxEric

Related questions
                            
                                Reshaping pandas DataFrame from Meshgrid
                            
                                SQLAlchemy quoting of table names - Can't redefine 'quote' or 'quote_schema' arguments
                            
                                CSRF verification fails when trying to login in an already logged in application Django
                            
                                gdata spreadsheet library for python not working anymore?
                            
                                Getting method calls and their arguments from method object
                            
                                Algorithm equalivence from Matlab to Python
                            
                                Multiline comments in Kivy
                            
                                NumPy: How to avoid this loop?
                            
                                List of coordinates to matrix of distances
                            
                                How to Pivot in Google BigQuery [duplicate]
                            
                                Unable to access files from public s3 bucket with boto
                            
                                openpyxl returning empty cell values for formula series
                            
                                python pandas read_excel returns UnicodeDecodeError on describe()
                            
                                Matplotlib Legend Guide basic examples
                            
                                How to get an arbitrary element from a frozenset?
                            
                                How to convert a binary (string) into a float value?
                            
                                Create and download a CSV file from a Flask view
                            
                                Cycle through list starting at a certain element
                            
                                PyQt4 File select widget
                            
                                how to return None if constructor arguments invalid

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Using Pandas to Iteratively Add Columns to a Dataframe

Tags:

python

loops

pandas

dataframe

TaterTots

People also ask

2 Answers

ASGM

GeauxEric

Recent Activity

Donate For Us