pandas get data for the end day of month?

Tags:

The data is given as following:

             return 
2010-01-04  0.016676    
2010-01-05  0.003839
...
2010-01-05  0.003839
2010-01-29  0.001248
2010-02-01  0.000134
...

What I want get is to extract all value that is the last day of month appeared in the data .

2010-01-29  0.00134
2010-02-28  ......

If I directly use pandas.resample, i.e., df.resample('M).last(). I would select the correct rows with the wrong index. (it automatically use the last day of the month as the index)

2010-01-31  0.00134
2010-02-28  ......

How can I get the correct answer in a Pythonic way?

408

asked May 18 '18 18:05

MTANG

1 Answers

An assumption made here is that your date data is part of the index. If not, I recommend setting it first.

Single Year

I don't think the resampling or grouper functions would do. Let's group on the month number instead and call DataFrameGroupBy.tail.

df.groupby(df.index.month).tail(1)

Multiple Years

If your data spans multiple years, you'll need to group on the year and month. Using a single grouper created from dt.strftime—

df.groupby(df.index.strftime('%Y-%m')).tail(1)

Or, using multiple groupers—

df.groupby([df.index.year, df.index.month]).tail(1)

Note—if your index is not a DatetimeIndex as assumed here, you'll need to replace df.index with pd.to_datetime(df.index, errors='coerce') above.

163

answered Sep 18 '22 04:09

cs95

Related questions
                            
                                Flask database migrations on heroku
                            
                                BeautifulSoup and class with spaces
                            
                                django.db.utils.IntegrityError: duplicate key value violates unique constraint "auth_permission_pkey"
                            
                                How to bind enter key to a tkinter button
                            
                                Why is a computation much slower within a Dask/Distributed worker?
                            
                                'function' object has no attribute 'assert_called_once_with'
                            
                                additional row colors in seaborn cluster map
                            
                                Python: Lib to use epoll if available, fallback to select
                            
                                Convert Google Vision API response to JSON
                            
                                Longest Common Subsequence in Python
                            
                                What's the difference between data time major and batch major?
                            
                                User input boolean in python
                            
                                Pandas split on regex
                            
                                map function run into infinite loop in 3.X
                            
                                How to open a Chrome Profile through Python
                            
                                Vectorized way to count occurrences of string in either of two columns
                            
                                get index of the first block of at least n consecutive False values in boolean array
                            
                                convert dict of dict to dataframe in pandas
                            
                                understanding level =0 and group_keys
                            
                                How fetch latest records using find_one in pymongo

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

pandas get data for the end day of month?

Tags:

python

pandas

dataframe