I'm posting this because the topic just got brought up in another question/answer and the behavior isn't very well documented. Consider the dataframe <code>df</code> <pre class="prettyprint"><code>df = pd.DataFrame(dict( A=list('xxxyyy'), B=[np.nan, 1, 2, 3, 4, np.nan] )) A B 0 x NaN 1 x 1.0 2 x 2.0 3 y 3.0 4 y 4.0 5 y NaN </code></pre> I wanted to get the first and last rows of each group defined by column <code>'A'</code>. I tried <pre class="prettyprint"><code>df.groupby('A').B.agg(['first', 'last']) first last A x 1.0 2.0 y 3.0 4.0 </code></pre> However, This doesn't give me the <code>np.NaN</code>s that I expected. How do I get the actual first and last values in each group?

As noted here by @unutbu: The groupby.first and groupby.last methods return the first and last non-null values respectively. To get the actual first and last values, do: <pre class="prettyprint"><code>def h(x): return x.values[0] def t(x): return x.values[-1] df.groupby('A').B.agg([h, t]) h t A x NaN 2.0 y 3.0 NaN </code></pre>

Why doesn't first and last in a groupby give me first and last

Tags:

python

pandas

group-by

pandas-groupby

I'm posting this because the topic just got brought up in another question/answer and the behavior isn't very well documented.

Consider the dataframe df

df = pd.DataFrame(dict(
    A=list('xxxyyy'),
    B=[np.nan, 1, 2, 3, 4, np.nan]
))

   A    B
0  x  NaN
1  x  1.0
2  x  2.0
3  y  3.0
4  y  4.0
5  y  NaN

I wanted to get the first and last rows of each group defined by column 'A'.

I tried

df.groupby('A').B.agg(['first', 'last'])

   first  last
A             
x    1.0   2.0
y    3.0   4.0

However, This doesn't give me the np.NaNs that I expected.

How do I get the actual first and last values in each group?

456

asked Aug 17 '17 20:08

piRSquared

1 Answers

As noted here by @unutbu:

The groupby.first and groupby.last methods return the first and last non-null values respectively.

To get the actual first and last values, do:

def h(x):
    return x.values[0]

def t(x):
    return x.values[-1]

df.groupby('A').B.agg([h, t])

     h    t
A          
x  NaN  2.0
y  3.0  NaN

135

answered Sep 28 '22 17:09

piRSquared

Related questions
                            
                                Check if key is missing after loading json from file in python
                            
                                virtualenv activate does not work
                            
                                ImportError: No module named 'ldap' Python 3.5
                            
                                How to encrypt a password field in django
                            
                                Grouping by with Where conditions in Pandas
                            
                                How to print the content of the generator?
                            
                                Python numpy unwrap function
                            
                                Python round() too slow, faster way to reduce precision?
                            
                                What does .div do in Pandas (Python)
                            
                                how to use rowcount in mysql using python
                            
                                How to return a generator from another function
                            
                                Removing lists from each cell in pandas dataframe
                            
                                Django Pytest Test URL Based on Settings
                            
                                How to decile python pandas dataframe by column value, and then sum each decile?
                            
                                Using my Google Geocoding API key with Python geocoder
                            
                                GPS time in weeks since epoch in Python?
                            
                                Calculating Primes and Appending to a List
                            
                                Django UserCreationForm with one password
                            
                                Python Pandas - Convert column to percentage on Groupby DF
                            
                                python cv2 video resolution

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why doesn't first and last in a groupby give me first and last

Tags:

python

pandas

group-by

pandas-groupby

piRSquared

People also ask

1 Answers

piRSquared

Recent Activity

Donate For Us