Are there feature selection algorithms that can be applied to categorical data inputs?

Tags:

I am training a neural network which has 10 or so categorical inputs. After one-hot encoding these categorical inputs I end up feeding around 500 inputs into the network.

I would love to be able to ascertain the importance of each of my categorical inputs. Scikit-learn has numerous feature importance algorithms, however can any of these be applied to categorical data inputs? All of the examples use numerical inputs.

I could apply these methods to the one-hot encoded inputs, but how would I extract the meaning after applying to binarised inputs? How does one go about judging feature importance on categorical inputs?

924

asked Feb 17 '17 17:02

A555h55

1 Answers

Using the feature selection algorithms on one hot encoding might be miss leading due to the relations between the encoded features. For example, if you encode a feature of n values into n features and you have n-1 of the m selected, the last feature is not needed.

Since the number of your features is quite low (~10), feature selection not help you so much since you'll probably be able to reduce only few of them without loosing too much information.

You wrote that the one hot encoding turns the 10 features into 500, meaning that each feature has about 50 values. In this case you might be more interested in discretisation algorithms, manipulating on the values themselves. If there is an implied order on the values, you can use algorithms for continuos features. Another option is simply to omit rare values or values without a strong correlation to the concept.

In case that you use feature selection, most algorithms will work on categorial data but you should beware of corner cases. For example, the mutual information, suggested by @Igor Raush is an excellent measure. However, features with many values tend to have higher entropy than feature withe less values. That in turn might lead into higher mutual information and a bias into features of many values. A way to cope with this problem is to normalize by dividing the mutual information by the feature entropy.

Another set of feature selection algorithms that might help you are the wrappers. They actually delegate the learning to the classification algorithm and therefore they are indifferent of the representation as long as the classification algorithm can cope with it.

182

answered Sep 21 '22 14:09

DaL

Related questions
                            
                                Detect more than 1 object on picture
                            
                                How would I combine two async libraries?
                            
                                Using threading.Timer with asycnio
                            
                                PyCharm does not recognize module (cx_oracle) installed
                            
                                Write pandas DataFrame to HDF in memory buffer
                            
                                Reading huge number of json files in Python?
                            
                                Using gdal_grid in Python with np array
                            
                                Conda environment is discoverable but not activateable (when activate is a bash alias)
                            
                                Python Opencv imshow error
                            
                                Returning a complex object containing PyObject from c++ function Cython
                            
                                How to pass large chunk of data to celery
                            
                                m Smallest values from upper triangular matrix with their indices as a list of tuples
                            
                                Currying in inversed order in python
                            
                                Python: YAML dictionary of functions: how to load without converting to strings
                            
                                How to manipulate expressions in matrices using sympy?
                            
                                How to use Delete method in python flask
                            
                                Config Kivy > Invert input for y axis
                            
                                How to keep session alive when using async websockets?
                            
                                Replace rows of strings in dataframe with corresponding words in other dataframe pandas
                            
                                Pymongo - ValueError: NaTType does not support utcoffset when using insert_many

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Are there feature selection algorithms that can be applied to categorical data inputs?

Tags:

python

algorithm

machine-learning

neural-network

scikit-learn

A555h55

People also ask

1 Answers

DaL

Recent Activity

Donate For Us