How can one go about finding the last occurring non zero element in every column of a dataframe? Input <pre class="prettyprint"><code> A B 0 0 1 1 0 2 2 9 0 3 10 0 4 0 0 5 0 0 </code></pre> Output <pre class="prettyprint"><code> A B 0 10 2 </code></pre>

You can convert <code>0</code> to missing values, use forward filling and select last row by indexing, last cast to integer: <pre class="prettyprint"><code>df = df.mask(df==0).ffill().iloc[[-1]].astype(int) print (df) A B 5 10 2 </code></pre>

Something like: <pre class="prettyprint"><code>results = {} for column in df.columns: results[column] = df.loc[df[column]!=0, column].iloc[-1] </code></pre> This will make a dictionary with all columns as keys and they last non-zero values as values. EDIT: If you want it in a dataframe, plus dict comprehension for one-liner: <pre class="prettyprint"><code>results = pd.DataFrame({column:[df.loc[df[column]!=0, column].iloc[-1]] for column in df.columns}) </code></pre>

Loop over the columns then the rows and store the last non zero variable <pre class="prettyprint"><code>list = []* number_of_columns for i in range(len(df)): dfcolumn = df[:,i] for item in dfcolumn: if item != 0: list[i] = [i, item] print(list) </code></pre>

How to find the last non zero element in every column throughout dataframe?

Tags:

python

pandas

dataframe

How can one go about finding the last occurring non zero element in every column of a dataframe?

Input

Output

    A  B
0  10  2

830

asked Jun 19 '19 11:06

deeraf

4 Answers

You can convert 0 to missing values, use forward filling and select last row by indexing, last cast to integer:

df = df.mask(df==0).ffill().iloc[[-1]].astype(int)
print (df)
    A  B
5  10  2

129

answered Oct 20 '22 15:10

jezrael

Here's one approach using ndarray.argmax and advanced indexing:

first_max = df.values[df.ne(0).values.argmax(0), range(df.shape[1])]
out = pd.DataFrame([first_max], columns=df.columns)

df = pd.DataFrame({'A': [0,0,0,10,0,0] , 'B': [0,2,0,0,0,0]})

first_max = df.values[df.ne(0).values.argmax(0), range(df.shape[1])]
# array([10,  2])
pd.DataFrame([first_max], columns=df.columns)

    A  B
0  10  2

Update

In order to find the last nonzero:

row_ix = df.shape[0]-df.ne(0).values[::-1].argmax(0)-1
first_max = df.values[row_ix, range(df.shape[1])]
out = pd.DataFrame([first_max], columns=df.columns)

answered Oct 20 '22 14:10

yatu

Something like:

results = {}
for column in df.columns:
    results[column] = df.loc[df[column]!=0, column].iloc[-1]

This will make a dictionary with all columns as keys and they last non-zero values as values.

EDIT: If you want it in a dataframe, plus dict comprehension for one-liner:

results = pd.DataFrame({column:[df.loc[df[column]!=0, column].iloc[-1]] for column in df.columns})

answered Oct 20 '22 13:10

Jim Eisenberg

Loop over the columns then the rows and store the last non zero variable

list = []* number_of_columns
for i in range(len(df)):
    dfcolumn = df[:,i]
    for item in dfcolumn:
        if item !=  0:
            list[i] = [i, item]

print(list)

answered Oct 20 '22 13:10

Jkind9

Related questions
                            
                                Flask-sqlalchemy disable autoflush for the whole session
                            
                                Extracting Pylint Score
                            
                                Python: Accessing YAML values using "dot notation"
                            
                                pandas remove seconds from datetime index
                            
                                How to install numpy+mkl for python 2.7 on windows 64 bit?
                            
                                Trained Machine Learning model is too big
                            
                                How to get rid of warning "DeprecationWarning generator 'ngrams' raised StopIteration"
                            
                                Converting list of Arrays to list of Lists?
                            
                                Sum a list of Pandas DataFrames
                            
                                Specific way of requiring one of two fields in django model definition
                            
                                __str__ method not working when objects are inside a list or dict
                            
                                Pandas group by weekday (M/T/W/T/F/S/S)
                            
                                PyQt5 QImage from Numpy Array
                            
                                Alexa Skill Development using flask-ask and ngrok
                            
                                Python installer for Windows: disable path length limit option not available
                            
                                How to stop my pandas data table from being truncated when printed?
                            
                                Return Pandas multiindex as list of tuples?
                            
                                Store Excel file exported from Pandas in AWS
                            
                                Let's Encrypt certbot-auto fails because a Python / pip problem
                            
                                Append values from dataframe column to list

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With