I implemented an algorithm where I make use of an priority queue. I was motivated by this question: Transform a std::multimap into std::priority_queue I am going to store up to 10 million elements with their specific priority value. I then want to iterate until the queue is empty. Every time an element is retrieved it is also deleted from the queue. After this I recalculate the elements pririty value, because of previous iterations it can change. If the value did increase I am inserting the element againg into the queue. This happens more often dependent on the progress. (at the first 25% it does not happen, in the next 50% it does happen, in the last 25% it will happen multiple times). After receiving the next element and not reinserting it, I am going to process it. This for I do not need the priority value of this element but the technical ID of this element. This was the reason I intuitively had chosen a <code>std::multimap</code> to achieve this, using <code>.begin()</code> to get the first element, <code>.insert()</code> to insert it and <code>.erase()</code> to remove it. Also, I did not intuitively choose <code>std::priority_queue</code> directly because of other questions to this topic answering that <code>std::priority_queue</code> most likely is used for only single values and not for mapped values. After reading the link above I reimplemented it using priority queue analogs to the other question from the link. My runtimes seem to be not that unequal (about an hour on 10 mio elements). Now I am wondering why <code>std::priority_queue</code> is faster at all. I actually would expect to be the <code>std::multimap</code> faster because of the many reinsertions. Maybe the problem is that there are too many reorganizations of the multimap?

I think the main difference comes form two facts: <ol> <li>Priority queue has a weaker constraint on the order of elements. It doesn't have to have sorted whole range of keys/priorities. Multimap, has to provide that. Priority queue only have to guarantee the 1st / top element to be largest.</li> </ol> So, while, the theoretical time complexities for the operations on both are the same <code>O(log(size))</code>, I would argue that <code>erase</code> from <code>multimap</code>, and rebalancing the RB-tree performs more operations, it simply has to move around more elements. (NOTE: RB-tree is not mandatory, but very often chosen as underlying container for <code>multimap</code>) <ol start="2"> <li>The underlying container of priority queue is contiguous in memory (it's a <code>vector</code> by default).</li> </ol> I suspect the rebalancing is also slower, because RB-tree relies on nodes (vs contiguous memory of vector), which makes it prone to cache misses, although one has to remember that operations on heap are not done in iterative manner, it is hopping through the vector. I guess to be really sure one would have to profile it. The above points are true for both insertions and erasues. I would say the difference is in the constant factors lost in the <code>big-O</code> notation. This is intuitive thinking.

c++ Why std::multimap is slower than std::priority_queue

Tags:

c++

std

priority-queue

multimap

I implemented an algorithm where I make use of an priority queue. I was motivated by this question: Transform a std::multimap into std::priority_queue

I am going to store up to 10 million elements with their specific priority value.

I then want to iterate until the queue is empty. Every time an element is retrieved it is also deleted from the queue.

After this I recalculate the elements pririty value, because of previous iterations it can change.

If the value did increase I am inserting the element againg into the queue. This happens more often dependent on the progress. (at the first 25% it does not happen, in the next 50% it does happen, in the last 25% it will happen multiple times).

After receiving the next element and not reinserting it, I am going to process it. This for I do not need the priority value of this element but the technical ID of this element.

This was the reason I intuitively had chosen a std::multimap to achieve this, using .begin() to get the first element, .insert() to insert it and .erase() to remove it. Also, I did not intuitively choose std::priority_queue directly because of other questions to this topic answering that std::priority_queue most likely is used for only single values and not for mapped values.

After reading the link above I reimplemented it using priority queue analogs to the other question from the link. My runtimes seem to be not that unequal (about an hour on 10 mio elements). Now I am wondering why std::priority_queue is faster at all.

I actually would expect to be the std::multimap faster because of the many reinsertions. Maybe the problem is that there are too many reorganizations of the multimap?

617

asked Jan 23 '17 13:01

Kaspatoo

2 Answers

To summarize: your runtime profile involves both removing and inserting elements from your abstract priority queue, with you trying to use both a std::priority_queue and a std::multimap as the actual implementation.

Both the insertion into a priority queue and into a multimap have roughly equivalent complexity: logarithmic.

However, there's a big difference with removing the next element from a multimap versus a priority queue. With a priority queue this is going to be a constant-complexity operation. The underlying container is a vector, and you're removing the last element from the vector, which is going to be mostly a nothing-burger.

But with a multimap you're removing the element from one of the extreme ends of the multimap.

The typical underlying implementation of a multimap is a balanced red/black tree. Repeated element removals from one of the extreme ends of a multimap has a good chance of skewing the tree, requiring frequent rebalancing of the entire tree. This is going to be an expensive operation.

This is likely to be the reason why you're seeing a noticeable performance difference.

147

answered Oct 12 '22 06:10

Sam Varshavchik

I think the main difference comes form two facts:

Priority queue has a weaker constraint on the order of elements. It doesn't have to have sorted whole range of keys/priorities. Multimap, has to provide that. Priority queue only have to guarantee the 1st / top element to be largest.

So, while, the theoretical time complexities for the operations on both are the same O(log(size)), I would argue that erase from multimap, and rebalancing the RB-tree performs more operations, it simply has to move around more elements. (NOTE: RB-tree is not mandatory, but very often chosen as underlying container for multimap)

The underlying container of priority queue is contiguous in memory (it's a vector by default).

I suspect the rebalancing is also slower, because RB-tree relies on nodes (vs contiguous memory of vector), which makes it prone to cache misses, although one has to remember that operations on heap are not done in iterative manner, it is hopping through the vector. I guess to be really sure one would have to profile it.

The above points are true for both insertions and erasues. I would say the difference is in the constant factors lost in the big-O notation. This is intuitive thinking.

answered Oct 12 '22 07:10

luk32

Related questions
                            
                                Alignment and size of C++ primitive types
                            
                                Using Likely() / Unlikely() Preprocessor Macros in if-else if chain
                            
                                Calling a static function on a template parameter in C++
                            
                                is a trivially copyable ::std::tuple-like class template possible? Does an implementation exist?
                            
                                c++ linker error LNK2005 already defined in SDL
                            
                                write_some vs write - boost asio
                            
                                valgrind: Unrecognised instruction at address 0x5111715
                            
                                Best way to read binary file c++ though input redirection
                            
                                QGridLayout: change height of a row
                            
                                How to convert a json object to a map with nlohmann::json?
                            
                                Why does my result data returned as void* gets broken?
                            
                                What does T::* signify in the declaration of a function parameter list?
                            
                                Linking against an ExternalProject_add dependency in CMAKE
                            
                                Comma operator with static_assert()
                            
                                Why can't I store my objects in an unordered_set?
                            
                                How is allocator-aware container assignment implemented?
                            
                                How to hide cursor in QML
                            
                                How can I know if I need to delete something in C++?
                            
                                Is overloading on all of the fundamental integer types is sufficient to capture all integers?
                            
                                Are shared_ptr on static objects good?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With