Why does separating my module into multiple files make it slower?

Tags:

I made a Python module (swood) that, up until recently, was one large file with many classes. After refactoring related classes into separate files, everything still works, albeit around 50% slower. I assumed that, if anything, it would get a little faster because Python could more efficiently cache the bytecode for each file, improving the startup time.

I am running this code with CPython (haven't tested with PyPy and its ilk). I've run line_profiler on the old and refactored versions and the percentage of processing time spent on each line looks roughly the same before and after the refactor.

Here are some things about my program that might have something to do with it:

It makes a lot of small classes like Note and instantiating these might be expensive, though this wasn't a problem before the refactor.
When making these classes it gets them from a separate file it imports at the beginning.
There is a lot of numpy-based array manipulation happening in the part that takes longest (scaling and mixing audio)
I have a cache that I store the scaled notes in, if they are used more than three times in 7.5 seconds. (code)

What is causing my code to get slower after doing nothing but separating it into multiple files?

363

asked May 13 '16 06:05

MilkeyMouse

1 Answers

After some more benchmarking it was one of the things I suspected: having to access the functions/classes from another module meant another lookup for the Python interpreter, and the slight slowdown added up in some tight loops. The Python wiki has something about this, too:

Avoiding dots...

Suppose you can't use map or a list comprehension? You may be stuck with the for loop. The for loop example has another inefficiency. Both newlist.append and word.upper are function references that are reevaluated each time through the loop. The original loop can be replaced with:
upper = str.upper
newlist = []
append = newlist.append
for word in oldlist:
    append(upper(word))

answered Oct 19 '22 23:10

MilkeyMouse

Related questions
                            
                                Modifying dictionary inside a function
                            
                                Passing variables from Flask back to Ajax
                            
                                How access individual element in a tuple on a RDD in pyspark?
                            
                                Optimising a julia one-liner to make it as fast as python
                            
                                Calling class method as part of initialization
                            
                                How to merge xArray datasets with conflicting coordinates
                            
                                searching matching string pattern from dataframe column in python pandas
                            
                                extracting numbers from list
                            
                                Cleaning headers in imported pandas dataframe
                            
                                Scikit-learn SVC always giving accuracy 0 on random data cross validation
                            
                                How to append a new column to a CSV file using Python? [duplicate]
                            
                                What is a good way to extract dominant colors from image without the shadow?
                            
                                Use Sympy with Pypy
                            
                                Pyodbc - The specified DSN contains an architecture mismatch between the Driver and Application
                            
                                Python Pandas, Resampling only specific hours
                            
                                OpenCV how to smooth contour, reducing noise
                            
                                How to covert a list of lists into dataframe and make the first element of the lists as the index
                            
                                A single string in single quotes with PyYAML
                            
                                Using seaborn barplot to plot wide-form dataframes
                            
                                How can i connect pyRserve with Python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why does separating my module into multiple files make it slower?

Tags:

performance

python

cpython

numpy

python-3.5

MilkeyMouse

People also ask

1 Answers

MilkeyMouse

Recent Activity

Donate For Us