I do not understand the <code>O(2^n)</code> complexity that the recursive function for the Longest Common Subsequence algorithm has. Usually, I can tie this notation with the number of basic operations (in this case comparisons) of the algorithm, but this time it doesn't make sense in my mind. For example, having two strings with the same length of <code>5</code>. In the worst case the recursive function computes <code>251</code> comparisons. And <code>2^5</code> is not even close to that value. Can anyone explain the algorithmic complexity of this function? <pre class="prettyprint"><code>def lcs(xstr, ystr): global nComp if not xstr or not ystr: return "" x, xs, y, ys = xstr[0], xstr[1:], ystr[0], ystr[1:] nComp += 1 #print("comparing",x,"with",y) if x == y: return x + lcs(xs, ys) else: return max(lcs(xstr, ys), lcs(xs, ystr), key=len) </code></pre>

To understand it properly look at the diagram carefully and follow the recursive top-to-down approach while reading the graph. <pre class="prettyprint"><code>Here, xstr = "ABCD" ystr = "BAEC" lcs("ABCD", "BAEC") // Here x != y / \ lcs("BCD", "BAEC") <-- x==y --> lcs("ABCD", "AEC") x==y | | | | lcs("CD", "AEC") <-- x!=y --> lcs("BCD", "EC") / \ / \ / \ / \ / \ / \ lcs("D","AEC") lcs("CD", "EC") lcs("BCD", "C") / \ / \ / \ lcs("", "AEC") lcs("D","EC") lcs("CD", "C") lcs("BCD","") | \ / \ | / | Return lcs("", "EC") lcs("D" ,"C") lcs("D", "") lcs("CD","") Return / \ / \ / \ / \ Return lcs("","C") lcs("D","") lcs("","") Return lcs("D","") Return / \ / \ / / \ Return lcs("","") Return lcs("", "") Return | | Return Return </code></pre> NOTE: The proper way of representation of recursive call is usually done by using tree approach, but here i used the graph approach just to compress the tree so one can easy understand the recursive call in a go. And, of course it would be easy to me to represent. <hr> <ul> <li>Since, in the above diagram there are some redundant pairs like <code>lcs("CD", "EC")</code> which is the result of deletion of <code>"A"</code> from the <code>"AEC"</code> in <code>lcs("CD", "AEC")</code> and of <code>"B"</code> from the <code>"BCD"</code> in <code>lcs("BCD", "EC")</code>. As a result, these pairs will be called more than once while execution which increases the time complexity of the program.</li> <li>As you could easily see that every pair generates two outcomes for its next level until it encounters any empty string or <code>x==y</code>. Therefore, if the length of the strings are n, m (considering the length of the xstr is <code>n</code> and ystr is <code>m</code> and we are considering the worst case scenario). Then, we will have number outcomes at the end of the order : 2n+m. (How? think)</li> </ul> Since, n+m is an integer number let's say N. Therefore, the time complexity of the algorithm : O(2N), which is not efficient for lager values of N. Therefore, we prefer Dynamic-Programming Approach over the recursive Approach. It can reduce the time complexity to: O(n.m) => O(n2) , when n == m. Even now, if you are getting hard time to understand the logic, i would suggest you to make a <code>tree-like</code> (not the graph which i have shown here) representation for <code>xstr = "ABC"</code> and <code>ystr = "EF"</code>. I hope you will understand it. Any doubt, comments most welcome.

Understanding the time complexity of the Longest Common Subsequence Algorithm

Tags:

algorithm

recursion

subsequence

lcs

I do not understand the O(2^n) complexity that the recursive function for the Longest Common Subsequence algorithm has.

Usually, I can tie this notation with the number of basic operations (in this case comparisons) of the algorithm, but this time it doesn't make sense in my mind.

For example, having two strings with the same length of 5. In the worst case the recursive function computes 251 comparisons. And 2^5 is not even close to that value.

Can anyone explain the algorithmic complexity of this function?

def lcs(xstr, ystr):
    global nComp
    if not xstr or not ystr:
        return ""
    x, xs, y, ys = xstr[0], xstr[1:], ystr[0], ystr[1:]
    nComp += 1
    #print("comparing",x,"with",y)
    if x == y:
        return x + lcs(xs, ys)
    else:
        return max(lcs(xstr, ys), lcs(xs, ystr), key=len)

918

asked Jan 08 '16 23:01

Daniel Catita

2 Answers

To understand it properly look at the diagram carefully and follow the recursive top-to-down approach while reading the graph.

Here, xstr = "ABCD"
      ystr = "BAEC"

                                    lcs("ABCD", "BAEC")       // Here x != y 
                                  /                     \  
                lcs("BCD", "BAEC")   <--  x==y   -->    lcs("ABCD", "AEC")  x==y
                          |                                        |
                          |                                        |
                  lcs("CD", "AEC")   <--  x!=y   -->     lcs("BCD", "EC")
                 /                \                     /                \
                /                  \                   /                  \
               /                    \                 /                    \
      lcs("D","AEC")                  lcs("CD", "EC")              lcs("BCD", "C")
    /                \              /               \              /        \       
lcs("", "AEC")        lcs("D","EC")                  lcs("CD", "C")        lcs("BCD","")
  |        \         /              \                       |             /       |
Return     lcs("", "EC")    lcs("D" ,"C")            lcs("D", "")   lcs("CD","")  Return
           /         \       /         \             /        \       /        \ 
        Return      lcs("","C")    lcs("D","") lcs("","")  Return  lcs("D","")  Return
                     /     \         /     \      /                 /      \
                  Return   lcs("","")       Return            lcs("", "")  Return
                                 |                                  |
                              Return                            Return

NOTE: The proper way of representation of recursive call is usually done by using tree approach, but here i used the graph approach just to compress the tree so one can easy understand the recursive call in a go. And, of course it would be easy to me to represent.

Since, in the above diagram there are some redundant pairs like lcs("CD", "EC") which is the result of deletion of "A" from the "AEC" in lcs("CD", "AEC") and of "B" from the "BCD" in lcs("BCD", "EC"). As a result, these pairs will be called more than once while execution which increases the time complexity of the program.
As you could easily see that every pair generates two outcomes for its next level until it encounters any empty string or x==y. Therefore, if the length of the strings are n, m (considering the length of the xstr is n and ystr is m and we are considering the worst case scenario). Then, we will have number outcomes at the end of the order : 2^n+m. (How? think)

Since, n+m is an integer number let's say N. Therefore, the time complexity of the algorithm : O(2^N), which is not efficient for lager values of N.

Therefore, we prefer Dynamic-Programming Approach over the recursive Approach. It can reduce the time complexity to: O(n.m) => O(n²) , when n == m.

Even now, if you are getting hard time to understand the logic, i would suggest you to make a tree-like (not the graph which i have shown here) representation for xstr = "ABC" and ystr = "EF". I hope you will understand it.

Any doubt, comments most welcome.

130

answered Nov 01 '22 01:11

surajs1n

O(2^n) means the run time is proportional to (2^n) for large enough n. It doesn't mean the number is bad, high, low, or anything specific for a small n, and it doesn't give a way to calculate the absolute run-time.

To get a feel for the implication, you should consider the run-times for n = 1000, 2000, 3000, or even 1 million, 2 million, etc.

In your example, assuming that for n=5 the algorithm takes a max of 251 iteration, then the O(n) prediction is that for n=50, it would take in the range of 2^(50)/2^(5)*251 = 2^45*251 = ~8.8E15 iterations.

answered Nov 01 '22 01:11

Aganju

Related questions
                            
                                python efficient substring search [duplicate]
                            
                                How to unlock all the chests in the treasure trove?
                            
                                Solving a graph issue with Python
                            
                                Why does decreasing K in K-nearest-neighbours increase complexity?
                            
                                How to efficiently find the ideal column count for strings of a certain width?
                            
                                Pyramids dynamic programming
                            
                                Algorithm to mix colours on 7 individual pieces of toy
                            
                                How can I find only 'interesting' words from a corpus?
                            
                                Fast algorithm to calculate delta of two list
                            
                                Is it possible to get the original value of a number, after several multiplications **with overflow**?
                            
                                Algorithm for placing a grid over a disordered set of points
                            
                                Fastest way to reduce number of latitude and longitude points
                            
                                Hash Collision Linear Probing Running Time
                            
                                Minimax explanation "for dummies"
                            
                                Why solving Knapsack problem is not considered as linear programming?
                            
                                Multiplicative combination algorithm
                            
                                Find all possible combinations of a String representation of a number
                            
                                Minimizing number of crossings in a bipartite graph
                            
                                Find the single wrong element in matrix product?
                            
                                How to find ith item in zigzag ordering?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With