Convert one-hot encoded data-frame columns into one column

Tags:

In the pandas data frame, the one-hot encoded vectors are present as columns, i.e:

Rows   A  B  C  D  E

0      0  0  0  1  0
1      0  0  1  0  0
2      0  1  0  0  0
3      0  0  0  1  0
4      1  0  0  0  0
4      0  0  0  0  1

How to convert these columns into one data frame column by label encoding them in python? i.e:

Also need suggestion on this that some rows have multiple 1s, how to handle those rows because we can have only one category at a time.

246

asked Jul 31 '20 17:07

3 Answers

Try with argmax

#df=df.set_index('Rows')

df['New']=df.values.argmax(1)+1
df
Out[231]: 
      A  B  C  D  E  New
Rows                    
0     0  0  0  1  0    4
1     0  0  1  0  0    3
2     0  1  0  0  0    2
3     0  0  0  1  0    4
4     1  0  0  0  0    1
4     0  0  0  0  1    5

196

answered Sep 22 '22 14:09

anky

Also need suggestion on this that some rows have multiple 1s, how to handle those rows because we can have only one category at a time.

In this case you dot your DataFrame of dummies with an array of all the powers of 2 (based on the number of columns). This ensures that the presence of any unique combination of dummies (A, A+B, A+B+C, B+C, ...) will have a unique category label. (Added a few rows at the bottom to illustrate the unique counting)

df['Category'] = df.dot(2**np.arange(df.shape[1]))

      A  B  C  D  E  Category
Rows                         
0     0  0  0  1  0         8
1     0  0  1  0  0         4
2     0  1  0  0  0         2
3     0  0  0  1  0         8
4     1  0  0  0  0         1
5     0  0  0  0  1        16
6     1  0  0  0  1        17
7     0  1  0  0  1        18
8     1  1  0  0  1        19

answered Sep 22 '22 14:09

ALollz

Related questions
                            
                                Sort Dict by Values in Python 3.6+
                            
                                Unknown format code 'f' for object of type 'str'- Folium
                            
                                What is the C++ equivalent of python collections.Counter?
                            
                                "Insert Into" statement causing errors due to "Parameter 7 (""): The supplied value is not a valid instance of data type float."
                            
                                "ValueError: could not convert string to float" when converting input
                            
                                python tuple and enum
                            
                                How to match pairs of values contained in two numpy arrays
                            
                                Running a Tkinter window and PysTray Icon together
                            
                                PSYCHOPY Error: AttributeError: module 'logging' has no attribute 'getLogger'
                            
                                pca.inverse_transform in sklearn
                            
                                How do I solve error "no module found named pyside2"?
                            
                                pgAdmin4 Query Error "not enough values to unpack (expected 5, got 4)"
                            
                                How to avoid overlapping error bars in matplotlib?
                            
                                kafka-python raise UnrecognizedBrokerVersion Error
                            
                                Unpack list of dictionaries in Python
                            
                                How to filter s3 objects by last modified date with Boto3
                            
                                Create blob container in azure storage if it does not exists
                            
                                "exec: "python": executable file not found in $PATH
                            
                                python, Windows 10: launching an application on a specific virtual desktop environment (work-spaces)
                            
                                Download attachment from mail using python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Convert one-hot encoded data-frame columns into one column

Tags:

python

pandas

dataframe

numpy

Eisha Tir Raazia

People also ask

3 Answers

BENY

anky

ALollz

Recent Activity

Donate For Us