KMP failure function calculation

Tags:

My professor solved the kmp failure function as follows:

index  1 2 3 4 5 6 7 8 9
string a a b a a b a b b
ff     0 1 2 1 2 3 4 5 1

From other texts I checked online, I found out it might be wrong, I went back to confirm from him again and he told me he's absolutely right. Can someone pls explain to me why he thinks it's right or wrong in a simple step by step manner? Thanks

957

asked Apr 20 '13 21:04

Dennis

2 Answers

As I understand the algorithm, the failure function for your example should be the following:

1 2 3 4 5 6 7 8 9
a a b a a b a b b

0 1 0 1 2 3 4 0 0

f - failure function (by definition, this is the length of the longest prefix of the string which is a suffix also)

Here how I built it step by step:

f(a) = 0 (always = 0 for one letter)

f(aa) = 1 (one letter 'a' is both a prefix and suffix)

f(aab) = 0 (there is no the same suffixes and prefixes: a != b, aa != ab)

f(aaba) = 1 ('a' is the same in the beginning and the end, but if you take 2 letters, they won't be equal: aa != ba)

f(aabaa) = 2 ( you can take 'aa' but no more: aab != baa)

f(aabaab) = 3 ( you can take 'aab')

f(aabaaba) = 4 ( you can take 'aaba')

f(aabaabab) = 0 ( 'a' != 'b', 'aa' != 'ab' and so on, it can't be = 5, so as 'aabaa' != 'aabab')

f(aabaababb) = 0 ( the same situation)

101

answered Oct 31 '22 12:10

user2513978

Since @user1041889 was confused (and got me confused too) I'll lay here the differences between the Z-function and the failure function.

Failure function, π[i]:

Is the mapping of and index to the length of the longest prefix of the string which is also a suffix

But that's arguably Chinese so I'll dumb it down in order to actually understand what I'm saying:

How big is the longest sub-string at the beginning of the string of interest, that is equal to the sub-string ending at index i

Or equivalently:

What is the length of the biggest sub-string ending at index i which matches the start of the string of interest

So in your example:

index  1 2 3 4 5 6 7 8 9
string a a b a a b a b b
ff     0 1 0 1 2 3 4 0 0

We observe that π[6] = 3, so what's the substring that ends at index 6 with length 3? aab!

Interesting how we've seen that before!

Let's check that it is indeed the biggest one: baab != aab. Yup!

Notice how this implies that the failure functions always grows uniformly.

That isn't the case for the Z-algorithm.

[SAVING DRAFT to continue later]

answered Oct 31 '22 11:10

DMeneses

Related questions
                            
                                Powershell binary grep
                            
                                Closest match for Full Text Search
                            
                                String matching on two columns in [R]
                            
                                What is the complexity of the code to find word in a set of cubes
                            
                                Detect that 2 string are same but in different order
                            
                                Lua: String.match vs String.gmatch?
                            
                                scrabble solving with maximum score
                            
                                How to get almost matching string from Oracle table?
                            
                                String regex two mismatches Python
                            
                                PHP array search within array
                            
                                Searching one Python dataframe / dictionary for fuzzy matches in another dataframe
                            
                                How to replace all matching characters except the first occurrence
                            
                                Matching strings in PowerShell
                            
                                Implementing Knuth-Morris-Pratt (KMP) algorithm for string matching with Python
                            
                                Python - Iterate through a list of strings and group partial matching strings
                            
                                Get close string matches considering deletion - python
                            
                                bash script to check file name begins with expected string
                            
                                Rabin-Karp String Matching is not matching
                            
                                Remove ends of string entries in pandas DataFrame column
                            
                                Search for string allowing for one mismatch in any location of the string

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

KMP failure function calculation

Tags:

string-matching

knuth-morris-pratt

knuth

Dennis

People also ask

2 Answers

user2513978

DMeneses

Recent Activity

Donate For Us