I'm trying to solve some machine-learning problems using neural networks, mostly with the <code>NEAT</code> evolution (NeuroEvolution of Augmented Topologies). Some of my input variables are continuous, but some of them are of a categorical nature, like: <ul> <li>Species: {Lion,Leopard,Tiger,Jaguar}</li> <li>Branches of Trade: {Health care,Insurances,Finance,IT,Advertising}</li> </ul> At first I wanted to model such a variable by mapping the categories to discrete numbers, like: {Lion:1, Leopard:2, Tiger:3, Jaguar:4} But I'm afraid this adds some kind of arbitrary topology on the variable. A Tiger is not the sum of a Lion and a Leopard. What approaches to this problem are usually employed?

Unfortunately there is no good solution, each leads to some kind of problems: <ul> <li>Your solution is adding the topology, as you mentioned; it may not be that bad, as NN can fit arbitrary functions and represent "ifs", but in many cases it will (as NN are often falling into some local minima).</li> <li>You can encode your data in form of <code>is_categorical_feature_i_equal_j</code>, which won't induce any additional topology, but will grow the number of features quadratically. So instaed of "species" you get features "is_lion", "is_leopard", etc. and only one of them is equal <code>1</code> at the time</li> <li>in case of large amount of data as compared to the possible categorical values (for example you have 10000 od data points, and only 10 possible categorical values) one can also split the problem into 10 independent ones, each trained on one particular value (so we have "neural network for lions" "neural network for jaguars" etc.)</li> </ul> These two first approaches are to "extreme" cases - one is very computationally cheap, but can lead to high bias, while the second introduces much complexity, but should not influence the classification process itself. The last one is rarely usable (due to assumption of small number of categorical values) yet quite reasonable in terms of machine learning. <h3>Update</h3> So many things changes in 8 years. Solution 2 is definitely the most popular one, and with growth of compute, wide adoption of neural networks, and support of sparse inputs, the costs is now negliegiable

Neural network with categorical variables (enum) as inputs

1 Answers

Unfortunately there is no good solution, each leads to some kind of problems:

Your solution is adding the topology, as you mentioned; it may not be that bad, as NN can fit arbitrary functions and represent "ifs", but in many cases it will (as NN are often falling into some local minima).
You can encode your data in form of is_categorical_feature_i_equal_j, which won't induce any additional topology, but will grow the number of features quadratically. So instaed of "species" you get features "is_lion", "is_leopard", etc. and only one of them is equal 1 at the time
in case of large amount of data as compared to the possible categorical values (for example you have 10000 od data points, and only 10 possible categorical values) one can also split the problem into 10 independent ones, each trained on one particular value (so we have "neural network for lions" "neural network for jaguars" etc.)

These two first approaches are to "extreme" cases - one is very computationally cheap, but can lead to high bias, while the second introduces much complexity, but should not influence the classification process itself. The last one is rarely usable (due to assumption of small number of categorical values) yet quite reasonable in terms of machine learning.

Update

So many things changes in 8 years. Solution 2 is definitely the most popular one, and with growth of compute, wide adoption of neural networks, and support of sparse inputs, the costs is now negliegiable

107

answered Sep 20 '22 14:09

lejlot

Related questions
                            
                                Is it safe to cast arbitrary values of the underlying type to a strongly-typed enum type?
                            
                                Where should enums live in an MVC project structure?
                            
                                Android "java.lang.RuntimeException: Parcelable encounteredClassNotFoundException reading a Serializable object"
                            
                                How to compare System.Enum to enum (implementation) without boxing?
                            
                                Why can I parse invalid values to an Enum in .NET?
                            
                                Compiler error for exhaustive switch
                            
                                Export Powershell 5 enum declaration from a Module
                            
                                Create an abstract Enum class
                            
                                How do I add multiple attributes to an Enum?
                            
                                How to return enum value in java
                            
                                Explicitly defining flag combinations in an enum
                            
                                How are enums implemented in Java?
                            
                                Enum vs Reference table vs Lookup class
                            
                                Does an Enum Class containing 2000+1 Enum Constants hit any limit?
                            
                                Why does typeof(System.Enum).IsEnum = false?
                            
                                Java: Unable to use EnumSet within an Enumeration : Initialization error : Tech Research Talent Tree example
                            
                                Setting enum value at runtime in C#
                            
                                is it possible to directly import an enum field in Python 3?
                            
                                Get All Enum Values To A List
                            
                                Which is faster/more efficient: Dictionary<string,object> or Dictionary<enum,object>?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Neural network with categorical variables (enum) as inputs

Tags:

enums

neural-network

cheesus

People also ask

1 Answers

Update

lejlot

Recent Activity

Donate For Us