<p>I am a little confused on what the time complexity of a <code>len()</code> function would be. </p> <p>I have read in many different posts that finding the length of an array in python is <code>O(1)</code> with the <code>len()</code> function and similar for other languages.</p> <p>How is this possible? Do you not have to iterate through the whole array to count how many indices its taking up? </p>

<blockquote> <p>Do you not have to iterate through the whole array to count how many indices its taking up?</p> </blockquote> <p>No, you do not.</p> <p>You can generally always trade space for time when constructing algorithms.</p> <p>For example, when creating a collection, allocate a separate variable holding the size. Then increment that when adding an item to the collection and decrement it when removing something.</p> <p>Then, voilà, the size of the collection can then be obtained in <code>O(1)</code> time just by accessing that variable.</p> <p>And this appears to be what Python actually does, as per this page, which states (checking of the Python source code shows that this is the action when requesting the size of a great many objects):</p> <blockquote> <p><strong><code>Py_SIZE(o)</code></strong> - This macro is used to access the <code>ob_size</code> member of a Python object. It expands to <code>(((PyVarObject*)(o))->ob_size)</code>.</p> </blockquote> <hr> <p>If you compare the two approaches (iterating vs. a length variable), the properties of each can be seen in the following table:</p> <div class="s-table-container"> <table class="s-table"> <thead><tr> <th>Measurement</th> <th>Iterate</th> <th>Variable</th> </tr></thead> <tbody> <tr> <td>Space needed</td> <td>No extra space beyond the collection itself.</td> <td> <em>Tiny</em> additional length (4 bytes allowing for size of about four billion).</td> </tr> <tr> <td>Time taken</td> <td>Iteration over the collection.<br>Depends on collection size, so could be significant.</td> <td>Extraction of length, <em>very</em> quick.<br>Changes to list size (addition or deletion) incur slight extra expense of updating length, but this is also tiny.</td> </tr> </tbody> </table> </div> <p>In this case, the extra cost is minimal but the time saved for getting the length could be considerable, so it's probably worth it.</p> <p>That's not <em>always</em> the case since, in some (rare) situations, the added space cost may outweigh the reduced time taken (or it may need more space than can be made available).</p> <hr> <p>And, by way of example, this is what I'm talking about. Ignore the fact that it's totally unnecessary in Python, this is for a mythical Python-like language that has an <code>O(n)</code> cost for finding the length of a list:</p> <pre class="prettyprint lang-py prettyprint-override"><code>import random class FastQueue: """ FastQueue: demonstration of length variable usage. """ def __init__(self): """ Init: Empty list and set length zero. """ self._content = [] self._length = 0 def push(self, item): """ Push: Add to end, increase length. """ self._content.append(item) self._length += 1 def pull(self): """ Pull: Remove from front, decrease length, taking care to handle empty queue. """ item = None if self._length > 0: item = self._content[0] self._content = self._content[1:] self._length -= 1 return item def length(self): """ Length: Just return stored length. Obviously this has no advantage in Python since that's how it already does length. This is just an illustration of my answer. """ return self._length def slow_length(self): """ Length: A slower version for comparison. """ list_len = 0 for _ in self._content: list_len += 1 return list_len """ Test harness to ensure I haven't released buggy code :-) """ queue = FastQueue() for _ in range(10): val = random.randint(1, 50) queue.push(val) print(f'push {val}, length = {queue.length()}') for _ in range(11): print(f'pull {queue.pull()}, length = {queue.length()}') </code></pre>

Time Complexity of finding the length of an array

1 Answers

Do you not have to iterate through the whole array to count how many indices its taking up?

No, you do not.

You can generally always trade space for time when constructing algorithms.

For example, when creating a collection, allocate a separate variable holding the size. Then increment that when adding an item to the collection and decrement it when removing something.

Then, voilà, the size of the collection can then be obtained in O(1) time just by accessing that variable.

And this appears to be what Python actually does, as per this page, which states (checking of the Python source code shows that this is the action when requesting the size of a great many objects):

Py_SIZE(o) - This macro is used to access the ob_size member of a Python object. It expands to (((PyVarObject*)(o))->ob_size).

If you compare the two approaches (iterating vs. a length variable), the properties of each can be seen in the following table:

Measurement	Iterate	Variable
Space needed	No extra space beyond the collection itself.	Tiny additional length (4 bytes allowing for size of about four billion).
Time taken	Iteration over the collection. Depends on collection size, so could be significant.	Extraction of length, very quick. Changes to list size (addition or deletion) incur slight extra expense of updating length, but this is also tiny.

In this case, the extra cost is minimal but the time saved for getting the length could be considerable, so it's probably worth it.

That's not always the case since, in some (rare) situations, the added space cost may outweigh the reduced time taken (or it may need more space than can be made available).

And, by way of example, this is what I'm talking about. Ignore the fact that it's totally unnecessary in Python, this is for a mythical Python-like language that has an O(n) cost for finding the length of a list:

import random

class FastQueue:
    """ FastQueue: demonstration of length variable usage.
    """
    def __init__(self):
        """ Init: Empty list and set length zero.
        """
        self._content = []
        self._length = 0

    def push(self, item):
        """ Push: Add to end, increase length.
        """
        self._content.append(item)
        self._length += 1

    def pull(self):
        """ Pull: Remove from front, decrease length, taking
                  care to handle empty queue.
        """
        item = None
        if self._length > 0:
            item = self._content[0]
            self._content = self._content[1:]
            self._length -= 1
        return item

    def length(self):
        """ Length: Just return stored length. Obviously this
                    has no advantage in Python since that's
                    how it already does length. This is just
                    an illustration of my answer.
        """
        return self._length

    def slow_length(self):
        """ Length: A slower version for comparison.
        """
        list_len = 0
        for _ in self._content:
            list_len += 1
        return list_len

""" Test harness to ensure I haven't released buggy code :-)
"""

queue = FastQueue()

for _ in range(10):
    val = random.randint(1, 50)
    queue.push(val)
    print(f'push {val}, length = {queue.length()}')

for _ in range(11):
    print(f'pull {queue.pull()}, length = {queue.length()}')

150

answered Oct 16 '22 11:10

paxdiablo

Related questions
                            
                                Array from C++ to C#
                            
                                Check if Elements in an Array are of a SubClass in Java
                            
                                Fast fuse of close points in a numpy-2d (vectorized)
                            
                                How to count multiple values in an array
                            
                                Optimize- get third largest num in array
                            
                                How to put multidimensional arrays double quoted strings?
                            
                                format String with array of Strings
                            
                                How to convert list [a, b, c] to python slice index[:a, :b :c]?
                            
                                Index array with multiple ranges
                            
                                Can I create an object for which Array.isArray() returns true without using the Array constructor or array literal?
                            
                                Mapping an array of functions over an array in Javascript
                            
                                Error: Total size of array must not exceed 0x7fffffff bytes
                            
                                Array not printing correct Values
                            
                                Bash. The quickest and efficient array search
                            
                                Is returning a slice of a local array in a Go function safe?
                            
                                Fetch value from list in robot framework
                            
                                why is java taking long time initializing two dimensional arrays starting with the first dimension having a big size number?
                            
                                Would mutating an ArraySlice instantiate a new array instance?
                            
                                How to create an associative array in Pascal using arrays for the values
                            
                                Difference between array[i][:] and array[i,:]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Time Complexity of finding the length of an array

Tags:

arrays

time-complexity

user3047661

People also ask

1 Answers

paxdiablo

Recent Activity

Donate For Us