How to give column names after one-hot encoding with sklearn?

Tags:

Here is my question, I hope someone can help me to figure it out..

To explain, there are more than 10 categorical columns in my data set and each of them has 200-300 categories. I want to convert them into binary values. For that I used first label encoder to convert string categories into numbers. The Label Encoder code and the output is shown below.

After Label Encoder, I used One Hot Encoder From scikit-learn again and it is worked. BUT THE PROBLEM IS, I need column names after one hot encoder. For example, column A with categorical values before encoding. A = [1,2,3,4,..]

It should be like that after encoding,

A-1, A-2, A-3

Anyone know how to assign column names to (old column names -value name or number) after one hot encoding. Here is my one hot encoding and it's output;

I need columns with name because I trained an ANN, but every time data comes up I cannot convert all past data again and again. So, I want to add just new ones every time. Thank anyway..

762

asked May 28 '19 09:05

Aditya Pratama

1 Answers

You can get the column names using .get_feature_names() attribute.

>>> ohenc.get_feature_names()
>>> x_cat_df.columns = ohenc.get_feature_names()

Detailed example is here.

Update

from Version 1.0, use get_feature_names_out

answered Sep 28 '22 09:09

Venkatachalam

Related questions
                            
                                How to install NumPy for python 3.3.5 on Mac OSX 10.9
                            
                                Remove dtype datetime NaT
                            
                                How to create a Decile and Quintile columns to rank another variable based on size using Python, Pandas?
                            
                                Reading the mail content of an mbox file using python mailbox
                            
                                Argparse - do not catch positional arguments with `nargs`.
                            
                                Disable INFO logging messages in Ipython Notebook
                            
                                AttributeError: FileInput instance has no attribute '__exit__'
                            
                                Python: Realtime audio streaming with PyAudio (or something else)?
                            
                                How to create an array of dataframes in Python
                            
                                Python/Pandas - Convert type from pandas period to string
                            
                                Send multiple tab key presses with Selenium
                            
                                Python syntax for namedtuple inside a namedtuple
                            
                                Introspecting arguments from the constructor function __init__ in Python
                            
                                pandas, multiply all the numeric values in the data frame by a constant
                            
                                Why use tensorflow gfile? (for file I/O)
                            
                                Datetime strptime in python
                            
                                Keras + TensorFlow Realtime training chart
                            
                                How to set the environment variable in tox?
                            
                                Python threading error - must be an iterable, not int
                            
                                Making async for loops in Python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to give column names after one-hot encoding with sklearn?

Tags:

python

encoding

one-hot-encoding

scikit-learn

Aditya Pratama

People also ask

1 Answers

Venkatachalam

Recent Activity

Donate For Us