Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Seaborn regplot using datetime64 as the x axis

I have a dataframe looks like this:

date         score  
2017-06-04    90
2017-06-03    80
2017-06-02    70

When I tried this:

sns.regplot(x=date, y=score, data=df)

I got an error:

TypeError: reduction operation 'mean' not allowed for this dtype

The dtype for date is datetime64[ns], and int64 for the score column.

How can I covert the date column so that regplot will work?

like image 961
Cheng Avatar asked Jun 04 '17 13:06

Cheng


1 Answers

Seaborn doesn't support datetimes in regplot but here's an ugly hack:

df = df.sort_values('date')
df['date_f'] = pd.factorize(df['date'])[0] + 1
mapping = dict(zip(df['date_f'], df['date'].dt.date))

ax = sns.regplot('date_f', 'score', data=df)
labels = pd.Series(ax.get_xticks()).map(mapping).fillna('')
ax.set_xticklabels(labels)

produces

enter image description here

This is the main approach used in time-series regression. If you have daily data, you code day 1 as 1 and increase the number as the days go by. This assumes you have a regularly-spaced time series.

like image 149
ayhan Avatar answered Sep 17 '22 12:09

ayhan