Why actual runtime for a larger search value is smaller than a lower search value in a sorted array?

Tags:

I executed a linear search on an array containing all unique elements in range [1, 10000], sorted in increasing order with all search values i.e., from 1 to 10000 and plotted the runtime vs search value graph as follows:

enter image description here

Upon closely analysing the zoomed in version of the plot as follows:

enter image description here

I found that the runtime for some larger search values is smaller than the lower search values and vice versa

My best guess for this phenomenon is that it is related to how data is processed by CPU using primary memory and cache, but don't have a firm quantifiable reason to explain this.

Any hint would be greatly appreciated.

PS: The code was written in C++ and executed on linux platform hosted on virtual machine with 4 VCPUs on Google Cloud. The runtime was measured using the C++ Chrono library.

827

asked Mar 30 '20 06:03

Deepak Tatyaji Ahire

1 Answers

CPU cache size depends on the CPU model, there are several cache levels, so your experiment should take all those factors into account. L1 cache is usually 8 KiB, which is about 4 times smaller than your 10000 array. But I don't think this is cache misses. L2 latency is about 100ns, which is much smaller than the difference between lowest and second line, which is about 5 usec. I suppose this (second line-cloud) is contributed from the context switching. The longer the task, the more probable the context switching to occur. This is why the cloud on the right side is thicker.

Now for the zoomed in figure. As Linux is not a real time OS, it's time measuring is not very reliable. IIRC it's minimal reporting unit is microsecond. Now, if a certain task takes exactly 15.45 microseconds, then its ending time depends on when it started. If the task started at exact zero time clock, the time reported would be 15 microseconds. If it started when the internal clock was at 0.1 microsecond in, than you will get 16 microsecond. What you see on the graph is a linear approximation of the analogue straight line to the discrete-valued axis. So the tasks duration you get is not actual task duration, but the real value plus task start time into microsecond (which is uniformly distributed ~U[0,1]) and all that rounded to the closest integer value.

answered Nov 10 '22 13:11

igrinis

Related questions
                            
                                Algorithm to find the number of distinct paths in a directed graph [duplicate]
                            
                                A better concurrent prime number sieve in go
                            
                                Which algorithm is being used in Android's spell checker?
                            
                                How to detect squares on a grid which can NEVER be part of a shortest path after adding blocks?
                            
                                Longest repeated (k times) substring
                            
                                Finding k most common words in a file - memory usage
                            
                                Volleyball Player Combination
                            
                                Get permutation with specified degree by index number
                            
                                Algorithm for merging sets that share at least 2 elements
                            
                                Longest Common Subsequence for Multiple Sequences
                            
                                Is there an algorithm for anonymous, changeable, secure voting?
                            
                                How can I optimize this indexing algorithm
                            
                                Get number of elements greater than a number
                            
                                Design Pattern to track partial results of a complex process
                            
                                Traceback in dynamic programming implementation of Needleman-Wunsch algorithm
                            
                                Calculate area given list of directions
                            
                                Algorithm / Data structure for largest set intersection in a collection of sets with a given set
                            
                                Find three elements in a sorted array which sum to a fourth element
                            
                                Build JPA Specification from tree
                            
                                Find the maximum product of two non overlapping palindromic subsequences

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why actual runtime for a larger search value is smaller than a lower search value in a sorted array?

Tags:

algorithm

caching

ram

processor

runtime

Deepak Tatyaji Ahire

People also ask

1 Answers

igrinis

Recent Activity

Donate For Us