Renaming months from number to name in pandas

Tags:

i have the following dataframe:

High    Low Open    Close   Volume  Adj Close   year    pct_day
month   day                             
1   1   NaN NaN NaN NaN NaN NaN 2010.0  0.000000
2   7869.853149 7718.482498 7779.655014 7818.089966 7.471689e+07    7818.089966 2010.0  0.007826
3   7839.965652 7719.758224 7775.396255 7777.940002 8.185879e+07    7777.940002 2010.0  0.002582
4   7747.175260 7624.540007 7691.152083 7686.288672 1.018877e+08    7686.288672 2010.0  -0.000744
5   7348.487095 7236.742135 7317.313616 7287.688546 1.035424e+08    7287.688546 2010.0  -0.002499
... ... ... ... ... ... ... ... ... ...
12  27  7849.846680 7760.222526 7810.902051 7798.639258 4.678145e+07    7798.639258 2009.5  -0.000833
28  7746.209996 7678.152204 7713.497907 7710.449358 4.187133e+07    7710.449358 2009.5  0.000578
29  7357.001540 7291.827806 7319.393874 7338.938345 4.554891e+07    7338.938345 2009.5  0.003321
30  7343.726938 7276.871507 7322.123779 7302.545316 3.967812e+07    7302.545316 2009.5  -0.000312
31  NaN NaN NaN NaN NaN NaN 2009.5  0.000000

Since it is not clear from the above pasted dataframe, below is a snapshot:

enter image description here

The months are in 1,2 3 ... Is it possible to rename the month index to Jan Feb Mar format?

Edit :

I am having a hard time implementing the example by @ChihebNexus

My code is as follows since it is a datetime :

full_dates = pd.date_range(start, end)
data = data.reindex(full_dates)
data['year'] = data.index.year
data['month'] = data.index.month
data['week'] = data.index.week
data['day'] = data.index.day
data.set_index('month',append=True,inplace=True)
data.set_index('week',append=True,inplace=True)
data.set_index('day',append=True,inplace=True)
df = data.groupby(['month', 'day']).mean()

845

asked May 16 '20 19:05

2 Answers

I would do it using calendar and pd.CategoricalDtype to ensure sorting works correctly.

import pandas as pd
import numpy as np
import calendar

#Create dummy dataframe
dateindx = pd.date_range('2019-01-01', '2019-12-31', freq='D')

df = pd.DataFrame(np.random.randint(0,1000, (len(dateindx), 5)), 
             index=pd.MultiIndex.from_arrays([dateindx.month, dateindx.day]),
             columns=['High', 'Low','Open', 'Close','Volume'])

#Use calendar library for abbreviations and order
dd=dict((enumerate(calendar.month_abbr)))

#rename level zero of multiindex
df = df.rename(index=dd,level=0)

#Create calendar month data type with order for sorting
cal_dtype = pd.CategoricalDtype(list(calendar.month_abbr), ordered=True)

#Change the dtype of the level zero index
df.index = df1.index.set_levels(df.index.levels[0].astype(cal_dtype), level=0)
df

Output:

        High  Low  Open  Close  Volume
Jan 1    501  720   671    943     586
    2    410   67   207    945     284
    3    473  481   527    415     852
    4    157  809   484    592     894
    5    294   38   458     62     945
...      ...  ...   ...    ...     ...
Dec 27   305  354   347      0     726
    28   764  987   564    260      72
    29   730  151   846    137     118
    30   999  399   634    674      81
    31   347  980   441    600     676

[365 rows x 5 columns]

119

answered Oct 18 '22 22:10

I see that you've hard time to implement my answer into your code. This is why i've making this update to show you how you can implement my code within the code snipped you've added to your question. This is an example:

from datetime import datetime
import pandas as pd


start = '1/4/2020'
end = '3/5/2020'

data = pd.DataFrame()
full_dates = pd.date_range(start, end)
data = data.reindex(full_dates)
data['year'] = data.index.year
data['month'] = data.index.month
data['week'] = data.index.week
data['day'] = data.index.day
data.set_index('month', append=True, inplace=True)
data.set_index('week', append=True, inplace=True)
data.set_index('day', append=True, inplace=True)
df = data.groupby(['month', 'day']).mean()
idx = pd.Index(df.index).get_level_values(0)
df = df.set_index(pd.MultiIndex.from_tuples(((
    '{:%b}'.format(datetime.strptime(str(k), '%m')),
    v
) for k, v in idx), names=['month', 'day']), ['month', 'day'])
print(df)

Output:

           year
month day      
Jan   4    2020
      5    2020
      6    2020
      7    2020
      8    2020
...         ...
Mar   1    2020
      2    2020
      3    2020
      4    2020
      5    2020

[62 rows x 1 columns]

answered Oct 18 '22 21:10

Chiheb Nexus

Related questions
                            
                                how to get response_time and response_size while using aiohttp
                            
                                I can't import Python modules in Xcode 11 using PythonKit
                            
                                Get UnsatisfiableError when Installing OpenCV for Python through Anaconda on Windows
                            
                                How do you use EC.presence_of_element_located((By.ID, "myDynamicElement")) except to specify class not ID
                            
                                Vectorizing a "pure" function with numpy, assuming many duplicates
                            
                                Visualising the decision tree in sklearn
                            
                                How change Schemes from HTTP to HTTPS in drf_yasg?
                            
                                Time complexity: deleting element of deque
                            
                                Explanding GeoPandas Multipolygon Dataframe To One Poly Per Line
                            
                                split rows in pandas dataframe
                            
                                How to concatenate a list with a nested list?
                            
                                Unpack value(s) into variable(s) or None (ValueError: not enough values to unpack) [duplicate]
                            
                                Achieving multiple inheritance using python dataclasses
                            
                                How to throw HTTP error code with AWS Lambda using Lambda Proxy?
                            
                                Python3 : module 'tabula' has no attribute 'read_pdf'
                            
                                How do you model something-over-time in Python?
                            
                                Unable to import pandas (pandas._libs.window.aggregations)
                            
                                Pyenv's python is missing bzip2 module
                            
                                Plotly: Figure window doesn't appear using Spyder
                            
                                Unavailable to install Tensorflow 1.x on Ubuntu 20.04 LTS using pip

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Renaming months from number to name in pandas

Tags:

python

python-3.x

pandas

Slartibartfast

People also ask

2 Answers

Scott Boston

Chiheb Nexus

Recent Activity

Donate For Us