Python multiprocess profiling

Tags:

I'm struggling to figure out how to profile a simple multiprocess python script

import multiprocessing import cProfile import time def worker(num):     time.sleep(3)     print 'Worker:', num  if __name__ == '__main__':     for i in range(5):         p = multiprocessing.Process(target=worker, args=(i,))         cProfile.run('p.start()', 'prof%d.prof' %i)

I'm starting 5 processes and therefore cProfile generates 5 different files. Inside of each I want to see that my method 'worker' takes approximately 3 seconds to run but instead I'm seeing only what's going on inside the 'start'method.

I would greatly appreciate if somebody could explain this to me.

Update: Working example based on accepted answer:

import multiprocessing import cProfile import time def test(num):     time.sleep(3)     print 'Worker:', num  def worker(num):     cProfile.runctx('test(num)', globals(), locals(), 'prof%d.prof' %num)   if __name__ == '__main__':     for i in range(5):         p = multiprocessing.Process(target=worker, args=(i,))         p.start()

371

asked Jun 14 '12 21:06

barmaley

2 Answers

You're profiling the process startup, which is why you're only seeing what happens in p.start() as you say—and p.start() returns once the subprocess is kicked off. You need to profile inside the worker method, which will get called in the subprocesses.

160

answered Sep 23 '22 10:09

zigg

It's not cool enough having to change your source code for profiling. Let's see what your code is supposed to be like:

import multiprocessing import time def worker(num):     time.sleep(3)     print('Worker:', num)  if __name__ == '__main__':     processes = []     for i in range(5):         p = multiprocessing.Process(target=worker, args=(i,))         p.start()         processes.append(p)     for p in processes:         p.join()

I added join here so your main process will wait for your workers before quitting.

Instead of cProfile, try viztracer.

Install it by pip install viztracer. Then use the multiprocess feature

viztracer --log_multiprocess your_script.py

It will generate an html file showing every process on a timeline. (use AWSD to zoom/navigate)

result of script

Of course this includes some info that you are not interested in(like the structure of the actual multiprocessing library). If you are already satisfied with this, you are good to go. However, if you want a clearer graph for only your function worker(). Try log_sparse feature.

First, decorate the function you want to log with @log_sparse

from viztracer import log_sparse  @log_sparse def worker(num):     time.sleep(3)     print('Worker:', num)

Then run viztracer --log_multiprocess --log_sparse your_script.py

sparse log

Only your worker function, taking 3s, will be displayed on the timeline.

answered Sep 24 '22 10:09

minker

Related questions
                            
                                ERROR: Could not build wheels for cryptography which use PEP 517 and cannot be installed directly
                            
                                better way to drop nan rows in pandas
                            
                                Intersection of two graphs in Python, find the x value
                            
                                Django, ImportError: cannot import name Celery, possible circular import?
                            
                                Selecting Pandas Columns by dtype
                            
                                Get defining class of unbound method object in Python 3
                            
                                How do I import variable packages in Python like using variable variables ($$) in PHP?
                            
                                os.mkdir(path) returns OSError when directory does not exist
                            
                                How to pass an entire list as command line argument in Python?
                            
                                Iterating through two lists in Django templates
                            
                                How can I repeat each test multiple times in a py.test run?
                            
                                dlib installation on Windows 10
                            
                                How to install pywin32 module in windows 7 [duplicate]
                            
                                Convert an IP string to a number and vice versa
                            
                                Image size (Python, OpenCV)
                            
                                How to install xgboost package in python (windows platform)?
                            
                                Alternative implementations of python/setuptools entry points (extensions) in other languages/applications
                            
                                What does "app.run(host='0.0.0.0') " mean in Flask [duplicate]
                            
                                Uninstall python built from source?
                            
                                OData Python Library available?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python multiprocess profiling

Tags:

python

multiprocessing

cprofile