While answering to this question a debate began in comments about complexity of QuickSort. What I remember from my university time is that QuickSort is <code>O(n^2)</code> in worst case, <code>O(n log(n))</code> in average case and <code>O(n log(n))</code> (but with tighter bound) in best case. What I need is a correct mathematical explanation of the meaning of <code>average complexity</code> to explain clearly what it is about to someone who believe the big-O notation can only be used for worst-case. What I remember if that to define average complexity you should consider complexity of algorithm for all possible inputs, count how many degenerating and normal cases. If the number of degenerating cases divided by n tend towards 0 when n get big, then you can speak of average complexity of the overall function for normal cases. Is this definition right or is definition of average complexity different ? And if it's correct can someone state it more rigorously than I ?

Let's refer Big O Notation in Wikipedia: <blockquote> Let f and g be two functions defined on some subset of the real numbers. One writes <code>f(x)=O(g(x)) as x --> infinity</code> if ... </blockquote> So what the premise of the definition states is that the function f should take a number as an input and yield a number as an output. What input number are we talking about? It's supposedly a number of elements in the sequence to be sorted. What output number could we be talking about? It could be a number of operations done to order the sequence. But stop. What is a function? Function in Wikipedia: <blockquote> a function is a relation between a set of inputs and a set of permissible outputs with the property that each input is related to exactly one output. </blockquote> Are we producing exacly one output with our prior defition? No, we don't. For a given size of a sequence we can get a wide variation of number of operations. So to ensure the definition is applicable to our case we need to reduce a set possible outcomes (number of operations) to a single value. It can be a maximum ("the worse case"), a minimum ("the best case") or an average. The conclusion is that talking about best/worst/average case is mathematically correct and using big O notation without those in context of sorting complexity is somewhat sloppy. On the other hand, we could be more precise and use big Theta notation instead of big O notation.

Meaning of average complexity when using Big-O notation

Tags:

algorithm

complexity-theory

big-o

While answering to this question a debate began in comments about complexity of QuickSort. What I remember from my university time is that QuickSort is O(n^2) in worst case, O(n log(n)) in average case and O(n log(n)) (but with tighter bound) in best case.

What I need is a correct mathematical explanation of the meaning of average complexity to explain clearly what it is about to someone who believe the big-O notation can only be used for worst-case.

What I remember if that to define average complexity you should consider complexity of algorithm for all possible inputs, count how many degenerating and normal cases. If the number of degenerating cases divided by n tend towards 0 when n get big, then you can speak of average complexity of the overall function for normal cases.

Is this definition right or is definition of average complexity different ? And if it's correct can someone state it more rigorously than I ?

678

asked Oct 11 '10 10:10

kriss

1 Answers

Let's refer Big O Notation in Wikipedia:

Let f and g be two functions defined on some subset of the real numbers. One writes f(x)=O(g(x)) as x --> infinity if ...

So what the premise of the definition states is that the function f should take a number as an input and yield a number as an output. What input number are we talking about? It's supposedly a number of elements in the sequence to be sorted. What output number could we be talking about? It could be a number of operations done to order the sequence. But stop. What is a function? Function in Wikipedia:

a function is a relation between a set of inputs and a set of permissible outputs with the property that each input is related to exactly one output.

Are we producing exacly one output with our prior defition? No, we don't. For a given size of a sequence we can get a wide variation of number of operations. So to ensure the definition is applicable to our case we need to reduce a set possible outcomes (number of operations) to a single value. It can be a maximum ("the worse case"), a minimum ("the best case") or an average.

The conclusion is that talking about best/worst/average case is mathematically correct and using big O notation without those in context of sorting complexity is somewhat sloppy.

On the other hand, we could be more precise and use big Theta notation instead of big O notation.

147

answered Oct 15 '22 15:10

Alexey

Related questions
                            
                                Why do people say that Java can't have an expression evaluator?
                            
                                Algorithm Question on File Search Indexing
                            
                                Locating the end points of a bridge-like structure in an image
                            
                                What are the differences between B-tree and B*-tree, except the requirement for fullness?
                            
                                Algorithm to loop over an array from the middle outwards?
                            
                                Bisecting k-means clustering algorithm explanation
                            
                                Intersection between two rectangles in 3D
                            
                                Add two big numbers represented as linked lists without reversing the linked lists
                            
                                Identifying a person's name vs. a dictionary word
                            
                                How to sort faster than n log n (given a strong condition on the list)?
                            
                                Count the number of Ks between 0 and N
                            
                                Subtracting two integers digit by digit
                            
                                How to compute a^^b mod m?
                            
                                Elastic search or Trie for search/autocomplete?
                            
                                Algorithm to generate spanning set
                            
                                How to determine if a Delaunay triangle is internal or external?
                            
                                Patterns/Principles for thread-safe queues and "master/worker" program in Java
                            
                                Greatest Distance between Equal Numbers in Array
                            
                                What is the optimization level ( g++ ) you use while comparing two different algorithms written in C++ ?
                            
                                Best learning algorithm to make a decision tree in java?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With