I have a dataframe of surface weather observations (<code>fzraHrObs</code>) organized by a station identifier code and date. <code>fzraHrObs</code> has several columns of weather data. The station code and date (datetime objects) look like: <pre class="prettyprint"><code>usaf dat 716270 2014-11-23 12:00:00 2015-12-20 08:00:00 2015-12-20 09:00:00 2015-12-21 04:00:00 2015-12-28 03:00:00 716280 2015-12-19 08:00:00 2015-12-19 08:00:00 </code></pre> I would like to get a count of the number of unique dates (days) per year for each station - i.e. the number of days of obs per year at each station. In my example above this would give me: <pre class="prettyprint"><code> usaf Year Count 716270 2014 1 2015 3 716280 2014 0 2015 1 </code></pre> I've tried using groupby and grouping by station, year, and date: <code>grouped = fzraHrObs['dat'].groupby(fzraHrObs['usaf'], fzraHrObs.dat.dt.year, fzraHrObs.dat.dt.date])</code> Count, size, nunique, etc. on this just gives me the number of obs on each date, not the number of dates themselves per year. Any suggestions on getting what I want here?

Could be something like this, group the date by <code>usaf</code> and <code>year</code> and then count the number of unique values: <pre class="prettyprint"><code>import pandas as pd df.dat.apply(lambda dt: dt.date()).groupby([df.usaf, df.dat.apply(lambda dt: dt.year)]).nunique() # usaf dat # 716270 2014 1 # 2015 3 # 716280 2015 1 # Name: dat, dtype: int64 </code></pre>

The following should work: <pre class="prettyprint"><code>df.groupby(['usaf', df.dat.dt.year])['dat'].apply(lambda s: s.dt.date.nunique()) </code></pre> What I did differently is group by two levels only, then use the <code>nunique</code> method of pandas series to count the number of unique dates in each group.

Count unique dates in pandas dataframe

Tags:

python

pandas

I have a dataframe of surface weather observations (fzraHrObs) organized by a station identifier code and date. fzraHrObs has several columns of weather data. The station code and date (datetime objects) look like:

usaf      dat
716270    2014-11-23 12:00:00
          2015-12-20 08:00:00
          2015-12-20 09:00:00
          2015-12-21 04:00:00
          2015-12-28 03:00:00
716280    2015-12-19 08:00:00
          2015-12-19 08:00:00

I would like to get a count of the number of unique dates (days) per year for each station - i.e. the number of days of obs per year at each station. In my example above this would give me:

    usaf      Year     Count
    716270    2014     1
              2015     3
    716280    2014     0
              2015     1

I've tried using groupby and grouping by station, year, and date: grouped = fzraHrObs['dat'].groupby(fzraHrObs['usaf'], fzraHrObs.dat.dt.year, fzraHrObs.dat.dt.date])

Count, size, nunique, etc. on this just gives me the number of obs on each date, not the number of dates themselves per year. Any suggestions on getting what I want here?

659

asked Aug 10 '16 14:08

MeteoMtl

2 Answers

Could be something like this, group the date by usaf and year and then count the number of unique values:

import pandas as pd
df.dat.apply(lambda dt: dt.date()).groupby([df.usaf, df.dat.apply(lambda dt: dt.year)]).nunique()

#   usaf   dat 
# 716270  2014    1
#         2015    3
# 716280  2015    1
# Name: dat, dtype: int64

115

answered Sep 25 '22 10:09

Psidom

The following should work:

df.groupby(['usaf', df.dat.dt.year])['dat'].apply(lambda s: s.dt.date.nunique())

What I did differently is group by two levels only, then use the nunique method of pandas series to count the number of unique dates in each group.

answered Sep 26 '22 10:09

IanS

Related questions
                            
                                Non-monotonic memory consumption in Python2 dictionaries
                            
                                Can I insert a line into ruamel.yaml's CommentedMap?
                            
                                How to mock a property inside a class in Python
                            
                                How to install firefoxdriver webdriver for python3 selenium on ubuntu?
                            
                                concatenation of two or more base64 strings in python
                            
                                inserting millions of documents - mongo / pymongo - insert_many
                            
                                Django-registration resend activation Email with new code
                            
                                pip freeze: show only packages installed via pip
                            
                                ipython autoreload doesn't work
                            
                                AttributeError, 'dict' object has no attribute 'iteritems'; Flask-SQLAlchemy error while committing to database
                            
                                How does a Python module that contains class of same name work when imported?
                            
                                PySpark: retrieve mean and the count of values around the mean for groups within a dataframe
                            
                                How to resolve this error? "RestartFreqExceeded: 5 in 1s" in django+celery+rabbitmq+mysql+redis
                            
                                get encoding specified in magic line / shebang (from within module)
                            
                                Packages missing in current osx-64 and channels
                            
                                Pyglet HUD text location / scaling
                            
                                S3Cmd doesn't work with S3 Ninja
                            
                                Passing session from template view to python requests api call
                            
                                Is it possible to add a value named 'None' to enum type?
                            
                                How can I make np.save work for an ndarray subclass?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With