Categorical dtype changes after using melt

Tags:

python

pandas

In answering this question, I found that after using melt on a pandas dataframe, a column that was previously an ordered Categorical dtype becomes an object. Is this intended behaviour?

Note: not looking for a solution, just wondering if there is any reason for this behaviour or if it's not intended behavior.

Example:

Using the following dataframe df:

  Cat  L_1  L_2  L_3
0   A    1    2    3
1   B    4    5    6
2   C    7    8    9

df['Cat'] = pd.Categorical(df['Cat'], categories = ['C','A','B'], ordered=True)

# As you can see `Cat` is a category
>>> df.dtypes
Cat    category
L_1       int64
L_2       int64
L_3       int64
dtype: object

melted = df.melt('Cat')

>>> melted
  Cat variable  value
0   A      L_1      1
1   B      L_1      4
2   C      L_1      7
3   A      L_2      2
4   B      L_2      5
5   C      L_2      8
6   A      L_3      3
7   B      L_3      6
8   C      L_3      9

Now, if I look at Cat, it's become an object:

>>> melted.dtypes
Cat         object
variable    object
value        int64
dtype: object

Is this intended?

746

asked Oct 28 '22 03:10

sacuL

1 Answers

In source code . 0.22.0(My old version)

 for col in id_vars:
        mdata[col] = np.tile(frame.pop(col).values, K)
     mcolumns = id_vars + var_name + [value_name]

Which will return the datatype object with np.tile.

It has been fixed in 0.23.4(After I update my pandas)

df.melt('Cat')
Out[6]: 
  Cat variable  value
0   A      L_1      1
1   B      L_1      4
2   C      L_1      7
3   A      L_2      2
4   B      L_2      5
5   C      L_2      8
6   A      L_3      3
7   B      L_3      6
8   C      L_3      9
df.melt('Cat').dtypes
Out[7]: 
Cat         category
variable      object
value          int64
dtype: object

More info how it fixed :

for col in id_vars:
    id_data = frame.pop(col)
    if is_extension_type(id_data): # here will return True , then become concat not np.tile
        id_data = concat([id_data] * K, ignore_index=True)
    else:
        id_data = np.tile(id_data.values, K)
    mdata[col] = id_data

192

answered Nov 15 '22 07:11

BENY

Related questions
                            
                                How to change the [[source]] for the Pipfile for better usage of pipenv?
                            
                                connecting multiple strings to path in python with slashes
                            
                                Django Multi value on ModelMultipleChoiceField
                            
                                Normalize Phone Numbers Using Python
                            
                                How can i extract the links from the site that contains pagination?(using selenium)
                            
                                `pandas.DataFrame.to_html()` without `table border` and `tr style`
                            
                                Pandas: Dynamically replace NaN values with the average of previous and next non-missing values
                            
                                dialogflow: 403 IAM permission 'dialogflow.sessions.detectIntent'
                            
                                Discord code is running multiple times for no reason
                            
                                How can I create a dead weakref in python?
                            
                                Setting environment variables for integrated terminal
                            
                                How to use multi threading in keras/tensorflow when fitting a model?
                            
                                ValueError: The columns in the computed data do not match the columns in the provided metadata
                            
                                A better way to make pytorch code agnostic to running on a CPU or GPU?
                            
                                How to get back a list from bytes in Python? [duplicate]
                            
                                SQLAlchemy convert SELECT query result to a list of dicts

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With