While searching for answers relating to "Big O" notation, I have seen many SO answers such as this, this, or this, but still I have not clearly understood some points. Why do we ignore the co-efficients? For example this answer says that the final complexity of <code>2N + 2</code> is <code>O(N)</code>; we remove the leading co-efficient <code>2</code> and the final constant <code>2</code> as well. Removing the final constant of <code>2</code> perhaps understandable. After all, <code>N</code> may be very large and so "forgetting" the final <code>2</code> may only change the grand total by a small percentage. However I cannot clearly understand how removing the leading co-efficient does not make difference. If the leading <code>2</code> above became a <code>1</code> or a <code>3</code>, the percentage change to the grand total would be large. Similarly, apparently <code>2N^3 + 99N^2 + 500</code> is <code>O(N^3)</code>. How do we ignore the <code>99N^2</code> along with the <code>500</code>?

The purpose of the Big-O notation is to find what is the dominant factor in the asymptotic behavior of a function as the value tends towards the infinity. As we walk through the function domain, some factors become more important than others. Imagine <code>f(n) = n^3+n^2</code>. As <code>n</code> goes to infinity, <code>n^2</code> becomes less and less relevant when compared with <code>n^3</code>. But that's just the intuition behind the definition. In practice we ignore some portions of the function because of the formal definition: <blockquote> <code>f(x) = O(g(x))</code> as <code>x->infinity</code> if and only if there is a positive real <code>M</code> and a real <code>x_0</code> such as <code>|f(x)| <= M|g(x)|</code> for all <code>x > x_0</code>. </blockquote> That's in wikipedia. What that actually means is that there is a point (after <code>x_0</code>) after which some multiple of <code>g(x)</code> dominates <code>f(x)</code>. That definition acts like a loose upper bound on the value of <code>f(x)</code>. From that we can derive many other properties, like <code>f(x)+K = O(f(x))</code>, <code>f(x^n+x^n-1)=O(x^n)</code>, etc. It's just a matter of using the definition to prove those. In special, the intuition behind removing the coefficient (<code>K*f(x) = O(f(x))</code>) lies in what we try to measure with computational complexity. Ultimately it's all about time (or any resource, actually). But it's hard to know how much time each operation take. One algorithm may perform <code>2n</code> operations and the other <code>n</code>, but the latter may have a large constant time associated with it. So, for this purpose, isn't easy to reason about the difference between <code>n</code> and <code>2n</code>.

Why do we ignore co-efficients in Big O notation?

Tags:

While searching for answers relating to "Big O" notation, I have seen many SO answers such as this, this, or this, but still I have not clearly understood some points.

Why do we ignore the co-efficients?

For example this answer says that the final complexity of 2N + 2 is O(N); we remove the leading co-efficient 2 and the final constant 2 as well.

Removing the final constant of 2 perhaps understandable. After all, N may be very large and so "forgetting" the final 2 may only change the grand total by a small percentage.

However I cannot clearly understand how removing the leading co-efficient does not make difference. If the leading 2 above became a 1 or a 3, the percentage change to the grand total would be large.

Similarly, apparently 2N^3 + 99N^2 + 500 is O(N^3). How do we ignore the 99N^2 along with the 500?

539

asked Apr 29 '15 20:04

swdeveloper

1 Answers

The purpose of the Big-O notation is to find what is the dominant factor in the asymptotic behavior of a function as the value tends towards the infinity.

As we walk through the function domain, some factors become more important than others.

Imagine f(n) = n^3+n^2. As n goes to infinity, n^2 becomes less and less relevant when compared with n^3.

But that's just the intuition behind the definition. In practice we ignore some portions of the function because of the formal definition:

f(x) = O(g(x)) as x->infinity

if and only if there is a positive real M and a real x_0 such as

|f(x)| <= M|g(x)| for all x > x_0.

That's in wikipedia. What that actually means is that there is a point (after x_0) after which some multiple of g(x) dominates f(x). That definition acts like a loose upper bound on the value of f(x).

From that we can derive many other properties, like f(x)+K = O(f(x)), f(x^n+x^n-1)=O(x^n), etc. It's just a matter of using the definition to prove those.

In special, the intuition behind removing the coefficient (K*f(x) = O(f(x))) lies in what we try to measure with computational complexity. Ultimately it's all about time (or any resource, actually). But it's hard to know how much time each operation take. One algorithm may perform 2n operations and the other n, but the latter may have a large constant time associated with it. So, for this purpose, isn't easy to reason about the difference between n and 2n.

101

answered Sep 19 '22 05:09

Juan Lopes

Related questions
                            
                                Angular 2 router examples + @Routes typing support
                            
                                Raise error if a Python dict comprehension overwrites a key
                            
                                How to set a constraint using percentage?
                            
                                Prevent NSURLSession from caching responses
                            
                                Jquery autocomplete custom data error no such method 'instance' for autocomplete widget instance
                            
                                Sqlite lack of ALTER support, Alembic migration failing because of this. Solutions?
                            
                                Attach custom CardView style to theme
                            
                                Why the libc++ std::vector internally keeps three pointers instead of one pointer and two sizes?
                            
                                plot mixed effects model in ggplot
                            
                                How to ignore branch coverage for missing 'else'
                            
                                Trigger element (XAML) is not supported in a UWP project
                            
                                offline document for go/golang

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With