I had a really simple problem and got some strange results when I tried to figure out which solution is faster. The original Problem: Given two lists <code>ListA</code>, <code>ListB</code> and a constant <code>k</code>, remove all entries where the two lists sum to <code>k</code>. I solved the problem in two ways: First I tried using a loop and then I used list comprehension and <code>zip()</code> to zip and unzip the two lists. The version using a loop. <pre class="prettyprint lang-py prettyprint-override"><code>def Remove_entries_simple(listA, listB, k): """ removes entries that sum to k """ new_listA = [] new_listB = [] for index in range(len(listA)): if listA[index] + listB[index] == k: pass else: new_listA.append(listA[index]) new_listB.append(listB[index]) return(new_listA, new_listB) </code></pre> The version using list comprehension and <code>zip()</code> <pre class="prettyprint lang-py prettyprint-override"><code>def Remove_entries_zip(listA, listB, k): """ removes entries that sum to k using zip""" zip_lists = [(a, b) for (a, b) in zip(listA, listB) if not (a+b) == k] # unzip the lists new_listA, new_listB = zip(*zip_lists) return(list(new_listA), list(new_listB)) </code></pre> Then I tried to determine which approach is faster. But then I got what you see in figure below (x-axis: size of the lists, y-axis: average time to run it, 10**3 repetitions). For some reason the version using <code>zip()</code> always makes the same jump at somewhat the same position -- I ran it multiple times and on different machines. Can someone explain what could cause such an odd behavior? <img src="https://i.stack.imgur.com/d2QMN.png" alt="Plot comparing running times"> Update: The code I used to generate the plot. I used a function decorator to run each problem a 1000 times. import statements: <pre class="prettyprint"><code>import random import time import matplotlib.pyplot as plt </code></pre> The function decorator: <pre class="prettyprint"><code>def Repetition_Decorator(fun, Rep=10**2): ''' returns the average over Rep repetitions''' def Return_function(*args, **kwargs): Start_time = time.clock() for _ in range(Rep): fun(*args, **kwargs) return (time.clock() - Start_time)/Rep return Return_function </code></pre> The code to create the plots: <pre class="prettyprint"><code>Zippedizip = [] Loops = [] The_Number = 10 Size_list = list(range(10, 1000, 10)) Repeated_remove_loop = Repetition_Decorator(Remove_entries_simple, Rep=10**3) Repeated_remove_zip = Repetition_Decorator(Remove_entries_zip, Rep=10**3) for size in Size_list: ListA = [random.choice(range(10)) for _ in range(size)] ListB = [random.choice(range(10)) for _ in range(size)] Loops.append(Repeated_remove_loop(ListA, ListB, The_Number)) Zippedizip.append(Repeated_remove_zip(ListA, ListB, The_Number)) plt.xlabel('Size of List') plt.ylabel('Averaged time in seconds') plt.plot(Size_list, Loops, label="Using Loop") plt.plot(Size_list, Zippedizip, label="Zip") plt.legend(loc='upper left', shadow=False, fontsize='x-large') plt.show() </code></pre> Update-Update: thanks to kaya3 for pointing out the timeit module. To be as close as possible to my original code but also use the timeit module, I created a new function decorator that uses the timeit module for timing the code. The new decorator: <pre class="prettyprint"><code>def Repetition_Decorator_timeit(fun, Rep=10**2): """returns average over Rep repetitions with timeit""" def Return_function(*args, **kwargs): partial_fun = lambda: fun(*args, **kwargs) return timeit.timeit(partial_fun, number=Rep) / Rep return Return_function </code></pre> When I use the new decorator the version using the for loop is not affected but the <code>zip</code> version no longer makes the jump. <img src="https://i.stack.imgur.com/MLTkM.png" alt="enter image description here"> So far I feel quite certain that the jump is a result of how I measure the function rather than the function itself. But the jump is so distinct -- always at the same list size across different machines -- that it cannot be a fluke. Any ideas why exactly this jump happens? Update-Update-Update: It has something to do with the garbage collector, because if I disable the garbage collector with <code>gc.disable()</code>, both ways of measuring give the same result. <img src="https://i.stack.imgur.com/bnv93.png" alt="Garbage Collector disabled"> What did I learn here: Do not just measure the time of execution yourself. Use the <code>timeit</code> module for measuring performance of code snippets.

This seems to be an artifact of the way you have measured the running times. I do not know what causes your timing code to produce this effect, but the effect disappears when I use <code>timeit</code> to measure the running times instead. I'm using Python 3.6.2. I can consistently reproduce the effect using your timing code; I get the <code>zip</code> version's running time jumping at around the same threshold, though it is still slightly faster than the other version on my machine: <img src="https://i.stack.imgur.com/YHmcM.png" alt="Timed using clock"> However, when I measure the times using <code>timeit</code>, the effect disappears completely: <img src="https://i.stack.imgur.com/ve1Fk.png" alt="enter image description here"> Here's the code using <code>timeit</code>; I changed as little as possible from your analysis code. <pre class="prettyprint lang-py prettyprint-override"><code>import timeit Zippedizip = [] Loops = [] The_Number = 10 Size_list = list(range(10, 1000, 10)) Reps = 1000 for size in Size_list: ListA = [random.choice(range(10)) for _ in range(size)] ListB = [random.choice(range(10)) for _ in range(size)] remove_loop = lambda: Remove_entries_simple(ListA, ListB, The_Number) remove_zip = lambda: Remove_entries_zip(ListA, ListB, The_Number) Loops.append(timeit.timeit(remove_loop, number=Reps) / Reps) Zippedizip.append(timeit.timeit(remove_zip, number=Reps) / Reps) # ... </code></pre> So I think it's a spurious result. That said, I don't understand what's causing it in your timing code. I tried simplifying your timing code to not use a decorator or vargs, and I replaced <code>time.clock()</code> with <code>time.perf_counter()</code> which is more accurate, but that didn't change anything.

Strange performance results -- loop vs list comprehension and zip()

Tags:

performance

python

list-comprehension

I had a really simple problem and got some strange results when I tried to figure out which solution is faster.

The original Problem: Given two lists ListA, ListB and a constant k, remove all entries where the two lists sum to k.

I solved the problem in two ways: First I tried using a loop and then I used list comprehension and zip() to zip and unzip the two lists.

The version using a loop.

def Remove_entries_simple(listA, listB, k):
    """ removes entries that sum to k """
    new_listA = []
    new_listB = []
    for index in range(len(listA)):
        if listA[index] + listB[index] == k:
            pass
        else:
            new_listA.append(listA[index])
            new_listB.append(listB[index])
    return(new_listA, new_listB)

The version using list comprehension and zip()

def Remove_entries_zip(listA, listB, k):
    """ removes entries that sum to k using zip"""
    zip_lists = [(a, b) for (a, b) in zip(listA, listB) if not (a+b) == k]

    # unzip the lists
    new_listA, new_listB = zip(*zip_lists)
    return(list(new_listA), list(new_listB))

Then I tried to determine which approach is faster. But then I got what you see in figure below (x-axis: size of the lists, y-axis: average time to run it, 10**3 repetitions). For some reason the version using zip() always makes the same jump at somewhat the same position -- I ran it multiple times and on different machines. Can someone explain what could cause such an odd behavior?

Plot comparing running times

Update: The code I used to generate the plot. I used a function decorator to run each problem a 1000 times.

import statements:

import random
import time
import matplotlib.pyplot as plt

The function decorator:

def Repetition_Decorator(fun, Rep=10**2):
    ''' returns the average over Rep repetitions'''
    def Return_function(*args, **kwargs):
        Start_time = time.clock()
        for _ in range(Rep):
            fun(*args, **kwargs)
        return (time.clock() - Start_time)/Rep

return Return_function

The code to create the plots:

Zippedizip = []
Loops = []
The_Number = 10
Size_list = list(range(10, 1000, 10))

Repeated_remove_loop = Repetition_Decorator(Remove_entries_simple, Rep=10**3)
Repeated_remove_zip = Repetition_Decorator(Remove_entries_zip, Rep=10**3)

for size in Size_list:
    ListA = [random.choice(range(10)) for _ in range(size)]
    ListB = [random.choice(range(10)) for _ in range(size)]

    Loops.append(Repeated_remove_loop(ListA, ListB, The_Number))
    Zippedizip.append(Repeated_remove_zip(ListA, ListB, The_Number))

plt.xlabel('Size of List')
plt.ylabel('Averaged time in seconds')
plt.plot(Size_list, Loops, label="Using Loop")
plt.plot(Size_list, Zippedizip, label="Zip")
plt.legend(loc='upper left', shadow=False, fontsize='x-large')
plt.show()

Update-Update: thanks to kaya3 for pointing out the timeit module.

To be as close as possible to my original code but also use the timeit module, I created a new function decorator that uses the timeit module for timing the code.

The new decorator:

def Repetition_Decorator_timeit(fun, Rep=10**2):                                                                                   
"""returns average over Rep repetitions with timeit"""                                                                         
    def Return_function(*args, **kwargs):                                                                                          
        partial_fun = lambda: fun(*args, **kwargs)                                                                                 
        return timeit.timeit(partial_fun, number=Rep) / Rep                                                                        
return Return_function

When I use the new decorator the version using the for loop is not affected but the zip version no longer makes the jump.

enter image description here

So far I feel quite certain that the jump is a result of how I measure the function rather than the function itself. But the jump is so distinct -- always at the same list size across different machines -- that it cannot be a fluke. Any ideas why exactly this jump happens?

Update-Update-Update:

It has something to do with the garbage collector, because if I disable the garbage collector with gc.disable(), both ways of measuring give the same result.

Garbage Collector disabled

What did I learn here: Do not just measure the time of execution yourself. Use the timeit module for measuring performance of code snippets.

994

asked Nov 14 '19 23:11

oldmansaur

1 Answers

This seems to be an artifact of the way you have measured the running times. I do not know what causes your timing code to produce this effect, but the effect disappears when I use timeit to measure the running times instead. I'm using Python 3.6.2.

I can consistently reproduce the effect using your timing code; I get the zip version's running time jumping at around the same threshold, though it is still slightly faster than the other version on my machine:

Timed using clock

However, when I measure the times using timeit, the effect disappears completely:

enter image description here

Here's the code using timeit; I changed as little as possible from your analysis code.

import timeit

Zippedizip = []
Loops = []
The_Number = 10
Size_list = list(range(10, 1000, 10))
Reps = 1000

for size in Size_list:
    ListA = [random.choice(range(10)) for _ in range(size)]
    ListB = [random.choice(range(10)) for _ in range(size)]

    remove_loop = lambda: Remove_entries_simple(ListA, ListB, The_Number)
    remove_zip = lambda: Remove_entries_zip(ListA, ListB, The_Number)

    Loops.append(timeit.timeit(remove_loop, number=Reps) / Reps)
    Zippedizip.append(timeit.timeit(remove_zip, number=Reps) / Reps)

# ...

So I think it's a spurious result. That said, I don't understand what's causing it in your timing code. I tried simplifying your timing code to not use a decorator or vargs, and I replaced time.clock() with time.perf_counter() which is more accurate, but that didn't change anything.

126

answered Oct 24 '22 09:10

kaya3

Related questions
                            
                                Pandas dataframe to PostgreSQL table using psycopg2 without SQLAlchemy?
                            
                                Looking for a sequential pattern with condition
                            
                                How to specify the prior for scikit-learn's Gaussian process regression?
                            
                                Set Colorbar range with "contourf" in matplotlib
                            
                                Installing data_files in setup.py with pip install -e
                            
                                Memory leak with tf.data
                            
                                In pycharm ImportError: DLL load failed: The specified module could not be found. while importing facerecognition
                            
                                understand sklearn QuantileTransformer
                            
                                Image in Jupyter Notebook ipynb doesn't show up in GitHub private repo but the same code works with public repo
                            
                                How to fix "module 'tensorflow' has no attribute 'estimator' " error
                            
                                Connection was closed in the middle of operation when accesing database using Python
                            
                                Tensorflow: Modern way to load large data
                            
                                tqdm and numpy vectorize
                            
                                Since latest python version retains insertion order of dict,will the meaning of equality (==) change?
                            
                                Absolute paths after freezing with cx_freeze (Qt5 / PySide2 App)
                            
                                How to establish TLS session in python using PKCS11
                            
                                How to plot predicted values vs the true value?
                            
                                Stop/fail docker build if tests fail
                            
                                GradienTape convergence much slower than Keras.model.fit
                            
                                Is there an equivalent of `sum()` builtin which uses augmented assignment?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With