Why is vector array doubled?

2 Answers

When calculating the average time to insert into a vector, you need to allow for the non-growing inserts and the growing inserts.

Call the total number of operations to insert n items o_total, and the average o_average.

If you insert n items, and you grow by a factor of A as required, then there are o_total = n + ΣAⁱ [ 0 < i < 1 + ln_An ] operations. In the worst case you use 1/A of the allocated storage.

Intuitively, A = 2 means at worst you have o_total = 2n, so o_average is O(1), and the worst case you use 50% of the allocated storage.

For a larger A, you have a lower o_total, but more wasted storage.

For a smaller A, o_total is larger, but you don't waste so much storage. As long as it grows geometrically, it's still O(1) amortised insertion time, but the constant will get higher.

For growth factors 1.25 ( red ), 1.5 ( cyan ), 2 ( black ), 3 ( blue ) and 4 ( green ), these graphs show point and average size efficiency ( ratio of size/allocated space; more is better ) on the left and time efficiency ( ratio of insertions / operations; more is better ) on the right for inserting 400,000 items. 100% space efficiency is reached for all growth factors just prior to resizing; the case for A = 2 shows time efficiency between 25% and 50%, and space efficiency about 50%, which is good for most cases:

space and time efficiency graph - C like implementations

For runtimes such as Java, arrays are zero filled, so the number of operations to allocate is proportional to the size of the array. Taking account of this gives reduces the difference between the time efficiency estimates:

space and time efficiency graph - Java like implementations

answered Sep 28 '22 04:09

Pete Kirkham

Any multiple is a compromise. Make it too big and you waste too much memory. Make it too small and you waste much time for reallocations and copying. I guess that doubling is there because it works and is very easy to implement. I also saw a proprietary STL-like library that uses 1.5 as multiplier for the same - I guess its developers considered doubling wasting too much memory.

answered Sep 28 '22 04:09

sharptooth

Related questions
                            
                                Time elapsed between two functions
                            
                                Efficiently build a graph of words with given Hamming distance
                            
                                Comparing unordered_map vs unordered_set
                            
                                Scanning images for finding rectangles
                            
                                Algorithm to share/settle expenses among a group
                            
                                C# hashcode for array of ints
                            
                                Should developers know discrete math? [closed]
                            
                                Algorithm for pow(float, float)
                            
                                Which algorithm is used for noise canceling in earphones?
                            
                                Is Dijkstra's algorithm, dynamic programming
                            
                                SURF and SIFT Alternative Object Tracking Algorithm for Augmented Reality
                            
                                Measuring the average thickness of traces in an image
                            
                                Finding maximum size sub-matrix of all 1's in a matrix having 1's and 0's
                            
                                Most efficient way of erasing/deleting multiple std::vector elements while retaining original order?
                            
                                Sudoku validity check algorithm - how does this code works?
                            
                                C++ Design Pattern for Passing a Large Number of Parameters
                            
                                Hadoop gzip compressed files
                            
                                Why is bubble sort O(n^2)?
                            
                                How to find runtime efficiency of a C++ code
                            
                                Normalizing the edit distance

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why is vector array doubled?

Tags:

algorithm

arraylist

data-structures

vector

TheOne

People also ask

2 Answers

Pete Kirkham

sharptooth

Recent Activity

Donate For Us