std:sort vs inserting into an std::set

Tags:

I am reading some line segments from cin. Each line segment is represented by a start and end point. 2D. X and Y.

The input is not sorted. It is in random order. (Update:But I need them sorted first by X and then by Y)

I can read in all the segments, store them in a vector and then call std::sort. On the other hand, I can create an empty std::set and insert each segment as it arrives. The set will automatically maintain sorted order. Which of the two approaches is more efficient?

Update: The total size of the input (number of segments) is known in advance.

498

asked Mar 26 '13 13:03

Agnel Kurian

3 Answers

You should measure the performance of both approaches to be sure, but it's a safe bet to assume that std::sort on an std::vector is way faster than inserting into an std::set due to locality effects and the large constants hiding in the tree insertion algorithm. Also, the subsequent lookups and iteration will be faster.

(However, std::set is better suited for supporting a mixed series of insertions and deletions/lookups/iterations. Maintaining order in vector is expensive, as each insertion will take linear time on average.)

119

answered Sep 23 '22 03:09

Fred Foo

As a good rule of thumb, the stricter guarantees are offered, the worse performance you'll get.

Inserting into a std::set guarantees that the sequence is sorted after every insertion.

Inserting into a std::vector, and calling std::sort once after all insertions have been done guarantees that the sequence is sorted once all manipulations on the vector have been done. It doesn't require the vector to be sorted during all the intermediate insertions.

A std::vector also exhibits better spatial locality, and requires fewer memory allocations. So I would assume the vector approach to be faster, but if performance matters to you, then it matters enough to be measured.

If you don't care to measure what is faster in your case for your data sets with your code in your application, then you don't care which is faster.

answered Sep 20 '22 03:09

jalf

Use the container that has the appropriate semantics for your needs. Efficiency generally follows on automatically from that choice.

If you then experience performance bottlenecks, do some benchmarking.

answered Sep 21 '22 03:09

Lightness Races in Orbit

Related questions
                            
                                Visual C++ Enable Console
                            
                                Which school of reporting function failures is better
                            
                                function pointer without typedef
                            
                                How do I check if a value is contained in a vector? C++
                            
                                Memoized, recursive factorial function?
                            
                                Do an OR in Switch-Case in C++
                            
                                "Use of plus() is ambiguous" error
                            
                                Segmentation fault on erasing the last element from the vector C++
                            
                                How do I implement QHoverEvent in Qt?
                            
                                Writing ALL program output to a txt file in C++
                            
                                What is a good scripting language to integrate into high-performance applications?
                            
                                Which C++ logical operators do you use: and, or, not and the ilk or C style operators? why? [closed]
                            
                                Exposing a std::list as read only
                            
                                C++ append one vector to another
                            
                                Try-Catch Block For C++ File-IO Errors Not Working
                            
                                Find only file name from full path of the file in vc++
                            
                                Calling a lambda expression multiple times
                            
                                C++: subtract vectors
                            
                                Basics of strtol?
                            
                                Force all && to be executed?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With