Candidate Elimination Algorithm

Tags:

machine-learning

Consider the following training data sets..

+-------+-------+----------+-------------+ | Size  | Color | Shape    | Class/Label | +=======+=======+==========+=============+ | big   | red   | circle   | No          | | small | red   | triangle | No          | | small | red   | circle   | Yes         | | big   | blue  | circle   | No          | | small | blue  | circle   | Yes         | +-------+-------+----------+-------------+

I would like to understand how the algorithm proceeds when it starts with a negative example and when two negative examples come together.

This is not an assignment question by the way.

Examples with other data sets are also welcome! This is to understand the negative part of the algorithm.

766

asked Mar 25 '14 04:03

1 Answers

For your hypothesis space (H), you start with your sets of maximally general (G) and maximally specific (S) hypotheses:

G0 = {<?, ?, ?>} S0 = {<0, 0, 0>}

When you are presented with a negative example, you need to remove from S any hypothesis inconsistent with the current observation and replace any inconsistent hypothesis in G with its minimal specializations that are consistent with the observation but still more general than some member of S.

So for your first (negative) example, (big, red, circle), the minimal specializations would make the new hypothesis space

G1 = {<small, ? , ?>, <?, blue, ?>, <?, ?, triangle>} S1 = S0 = {<0, 0, 0>}

Note that S did not change. For your next example, (small, red, triangle), which is also negative, you will need to further specialize G. Note that the second hypothesis in G1 does not match the new observation so only the first and third hypotheses in G1 need to be specialized. That would yield

G2 = {<small, blue, ?>, <small, ?, circle>, <?, blue, ?>, <big, ?, triangle>, <?, blue, triangle>}

However, since the first and last hypotheses in G2 above are specializations of the middle hypothesis (<?, blue, ?>), we drop those two, giving

G2 = {<small, ?, circle>, <?, blue, ?>, <big, ?, triangle>} S2 = S1 = S0 = {<0, 0, 0>}

For the positive (small, red, circle) observation, you must generalize S and remove anything in G that is inconsistent, which gives

G3 = {<small, ?, circle>} S3 = {<small, red, circle>}

(big, blue, circle) is the next negative example. But since it in not consistent with G, there is nothing to do so

G4 = G3 = {<small, ?, circle>} S4 = S3 = {<small, red, circle>}

Lastly, you have the positive example of (small, blue, circle), which requires you to generalize S to make it consistent with the example, giving

G5 = {<small, ?, circle>} S5 = {<small, ?, circle>}

Since G and S are equal, you have learned the concept of "small circles".

168

answered Sep 22 '22 08:09

bogatron

Related questions
                            
                                What is the best way to generate a random float value included into a specified value interval?
                            
                                Why is squaring a number faster than multiplying two random numbers?
                            
                                Finding all combinations of well-formed brackets
                            
                                When should the STL algorithms be used instead of using your own?
                            
                                Bentley-Ottmann Algorithm in Haskell?
                            
                                Find all combinations of 3x3 holepunch
                            
                                Drawing a Topographical Map
                            
                                Code Golf: Countdown Number Game
                            
                                Organizing felt tip pens: optimizing the arrangement of items in a 2D grid by similarity of adjacent items, using JS [updated]
                            
                                Speed of calculating powers (in python)
                            
                                Fast calculation of min, max, and average of incoming numbers
                            
                                Binary search vs binary search tree
                            
                                python image recognition [closed]
                            
                                How to update elements within a heap? (priority queue)
                            
                                Sorting in place
                            
                                Whats the difference between Paxos and W+R>=N in Cassandra?
                            
                                How to calculate the shortest path between two points in a grid
                            
                                How is pi (π) calculated?
                            
                                Best sorting algorithms for C# / .NET in different scenarios
                            
                                What sort algorithm does PHP use?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Candidate Elimination Algorithm

Tags:

algorithm

machine-learning

Ravi

People also ask

1 Answers

bogatron

Recent Activity

Donate For Us