Approximate sort (array/vector), predictable runtime

Background:

I need to process some hundred thousand events (producing results) given a hard time limit. The clock is literally ticking, and when the timer fires, whatever is done at that point must be flushed out.

What isn't ready by that time is either discarded (depending on an importance metric) or processed during the next time quantum (with an "importance boost", i.e. adding a constant to the importance metric).
Now ideally, the CPU is much faster than needed, and the whole set is ready a long time before the end of the time slice. Unluckily, the world is rarely ever ideal, and "hundred thousands" becomes "tens of millions" before you know.

Events are added to the back of a queue (which is really a vector) as they come in, and are processed from the front during the respective next quantum (so the program always processes the last quantum's input).

However, not all events are equally important. In case the available time is not sufficient, it would be preferrable to drop unimportant events rather than important ones (this is not a strict requirement, since important events will be copied to the next time quantum's queue, but doing so further adds to the load so it isn't a perfect solution).

The obvious thing to use would be, of course, a priority queue / heap. Unluckily, heapifying 100k elements isn't precisely a free operation either (or parallel), and then I end up with objects being in some non-obvious and not necessarily cache-friendly memory locations, and pulling elements from a priority queue doesn't parallelize nicely.
What I would really like is somewhat like a vector that is sorted or at least "somewhat approximately sorted", which one can traverse sequentially afterwards. This would trivially allow me to create e.g. 12 threads (or any other number, one per CPU) that process e.g. 1/64 of the range (or another size) each, slowly advancing from the front to the end, and eventually dropping/postponing what's left over -- which will be events of little importantance that can be discarded.

Simply sorting the complete range using std::sort would be the easiest, most straightforward solution. However, the time it takes to sort items reduces the time available to actually process elements within the fixed time budget, and sorting time is for the most part single-CPU time (and parallel sort isn't that great either).
Also, doing a perfect sort (which isn't really needed) may bring forth worst case complexity whereas an approximate sort should ideally perform at its optimum and have a very predictable cost.

tl;dr

So, what I'm looking for is a way to sort an array/vector only approximately, but fast, and with a predictable (or guaranteed) runtime.

The sort key would be a small integer typically between 10 and 1000. Being postponed to the next time quantum might increase ("priority boost") that value by a small amount, e.g. 100 or 200.

In a different question where humans are supposed to do an approximate sort using "subjective compare"(?) shell sort was proposed. On various sorting demo applets, it seems like at least for the "random shuffle" input that's typical in these, shell sort can indeed do an "approximate sort" that doesn't look too bad with 3-4 passes over the data (and at least the read-tap is strictly sequential). Unluckily it seems to be somewhat of a black art to choose gap values that work well, and runtime estimates seem to involve a lot of looking into the crystal ball as well.

Comb sort with a relatively large shrink factor (such as 2 or 3?) seems tempting as well, since it visits memory strictly sequentially (on both taps) and is able to move far out elements by a great distance quickly. Again, judging from sorting demo applets, it seems like 3-4 passes already give a rather reasonable "approximate sort".

MSD radix sort comes to mind, though I am not sure how it would perform given typical 16/32bit integers in which most of the most significant bits are all zero! One would probably have to do an initial pass to find the most significant bit in the whole set, followed by 2-3 actual sort passes?

Is there a better algorithm or a well-known working approach with one of the algorithms I mentioned?

855

asked Jan 29 '14 14:01

Damon

1 Answers

What comes to mind is to iterate over the vector and if some event is less important, don't process it but put it aside. As soon as the entire vector has been read, have a look at the events put aside. Of course you can use several buckets with different priorities. And only store references there, you don't want to move megabytes of data. (posted as an answer now as requested by Damon)

answered Sep 26 '22 00:09

Ronald

Related questions
                            
                                Parsing expression with complex numbes [closed]
                            
                                Build the static Opencv library with multi-threaded debug DLL (/MDd) C runtime library
                            
                                Cross-platform implementation of the x86 pause instruction
                            
                                Arithmetic with NAN's
                            
                                Implicit conversion of shared_ptr
                            
                                Qt5 QWaitCondition example
                            
                                Error: Qt5 Video render error code 80040218
                            
                                C++11 auto variable initialized by nullptr
                            
                                Specialization of template member functions of a non-template class
                            
                                In .NET, why can't a BadImageFormatException be caught inside the method that throws it?
                            
                                Is there any way to use Qt without using dynamic memory for everything?
                            
                                the operator-> return value of smart pointers [duplicate]
                            
                                Why worry about 'undefined behavior' in >> of signed type?
                            
                                Can you swap a std::queue with a lambda comparator?
                            
                                How to overload both possibilities of non-commutative operators in C++?
                            
                                How to deal with large projects in C++? [closed]
                            
                                sourceCpp from Rcpp giving weird compilation error on canonical example
                            
                                Detect if file is open in C++
                            
                                G++ cannot find boost library
                            
                                Treating a single object like an array with one element, taking one-past-end pointer

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Approximate sort (array/vector), predictable runtime

Tags:

c++

algorithm

sorting