Complexity of partial_sort vs nth_element

Tags:

According to cppreference.com, the complexity of the C++ STL sorting algorithms is:

sort: O(N log(N))

partial_sort: "approximately" O(N log(M)), where M is distance(middle-first)

nth_element: "on average" O(N)

However, this seems to imply that, instead of doing a partial_sort, you could use nth_element and then sort the first range, to give an overall complexity of O(N + M log(M)), which is a bit better than O(N log(M)). Is this actually true? Am I better off avoiding partial_sort?

626

asked Jul 02 '15 22:07

rlbond

1 Answers

std::partial_sort would perform partial sort for the M elements you are interested in. On the other hand std::nth_element would only give you an array, such that nth element is placed such that all elements on the left are smaller and on the right are greater.

Use std::partial_sort for use cases such as, getting top 10 results out of a million in order of rank. Use std::nth_element for finding the median of an array, or to find out who stood 10th in exam results.

If you are just interested in the performance characteristics of both, for smaller values of M, std::partial_sort would perform better than std::nth_element (about 10,000) . For a detailed analysis of this, see: https://www.youtube.com/watch?v=-0tO3Eni2uo

Summary of video

std::nth_element uses modified Quickselect, which provides O(N) complexity regardless of M.

std::partial_sort uses Heapselect, which provides better performance than Quickselect for small M. As a side effect, the end state of Heapselect leaves you with a heap, which means that you get the first half of the Heapsort algorithm "for free".

std::partial_sort is optimized for the case where M is a small constant relative to N. For example, taking the top 10 items from a very large variable-length list. It is not optimized for the other cases.

In a race between std::partial_sort and std::nth_element + std::sort, std::partial_sort jumps out to an early lead (small M) but is overtaken by std::nth_element + std::sort once M is no longer small.

answered Oct 01 '22 17:10

Average Joe

Related questions
                            
                                Existence of objects created in C functions
                            
                                Is it legal to call delete on a null pointer of an incomplete type?
                            
                                Is it safe to link gcc 6, gcc 7, and gcc 8 objects?
                            
                                int numeral -> pointer conversion rules
                            
                                Dropping privileges in C++ on Windows
                            
                                Mocking non-virtual methods in C++ without editing production code?
                            
                                g++/clang ultra fast parse but not compile mode?
                            
                                calling managed c# functions from unmanaged c++
                            
                                C / C++ packages to understand code for refactoring
                            
                                Randomly permute N first elements of a singly linked list
                            
                                Is it possible to enable array bounds checking in g++?
                            
                                polymorphic iterators in C++
                            
                                Pass by reference vs pass by pointer? [duplicate]
                            
                                Necessity of forward-declaring template functions
                            
                                C++ static initialization vs __attribute__((constructor))
                            
                                Implicit generated members and noexcept
                            
                                Is `std::function` allowed to move its arguments?
                            
                                What are "terse ranged-based for loops"?
                            
                                Inlining of vararg functions
                            
                                How to check if a struct is NULL in C or C++

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Complexity of partial_sort vs nth_element

Tags:

c++

algorithm

time-complexity

rlbond

People also ask

1 Answers

Summary of video

Average Joe

Recent Activity

Donate For Us