I would need to run a python script for some random amount of time, pause it, get a stack traceback, and unpause it. I've googled around for a way to do this, but I see no obvious solution.

I can think of a <s>couple of</s> few ways to do this: <ul> <li> Rather than trying to get a stack trace while the program is running, just fire an interrupt at it, and parse the output. You could do this with a shell script or with another python script that invokes your app as a subprocess. The basic idea is explained and rather thoroughly defended in this answer to a C++-specific question. <ul> <li>Actually, rather than having to parse the output, you could register a postmortem routine (using <code>sys.excepthook</code>) that logs the stack trace. Unfortunately, Python doesn't have any way to continue from the point at which an exception occurred, so you can't resume execution after logging.</li> </ul> </li> <li>In order to actually get a stack trace from a running program, you <s>will</s> may have to hack the implementation. So if you really want to do that, it may be worth your time to check out pypy, a Python implementation written mostly in Python. I've no idea how convenient it would be to do this in pypy. I'm guessing that it wouldn't be particularly convenient, since it would involve introducing a hook into basically every instruction, which would I think be prohibitively inefficient. Also, I don't think there will be much advantage over the first option, unless it takes a very long time to reach the state where you want to start doing stack traces.</li> <li>There exists a set of macros for the <code>gdb</code> debugger intended to facilitate debugging Python itself. gdb can attach to an external process (in this case the instance of python which is executing your application) and do, well, pretty much anything with it. It seems that the macro <code>pystack</code> will get you a backtrace of the Python stack at the current point of execution. I think it would be pretty easy to automate this procedure, since you can (at worst) just feed text into <code>gdb</code> using <code>expect</code> or whatever.</li> </ul>

Is there a statistical profiler for python? If not, how could I go about writing one?

2 Answers

There's the statprof module

pip install statprof (or easy_install statprof), then to use:

import statprof

statprof.start()
try:
    my_questionable_function()
finally:
    statprof.stop()
    statprof.display()

There's a bit of background on the module from this blog post:

Why would this matter, though? Python already has two built-in profilers: lsprof and the long-deprecated hotshot. The trouble with lsprof is that it only tracks function calls. If you have a few hot loops within a function, lsprof is nearly worthless for figuring out which ones are actually important.

A few days ago, I found myself in exactly the situation in which lsprof fails: it was telling me that I had a hot function, but the function was unfamiliar to me, and long enough that it wasn’t immediately obvious where the problem was.

After a bit of begging on Twitter and Google+, someone pointed me at statprof. But there was a problem: although it was doing statistical sampling (yay!), it was only tracking the first line of a function when sampling (wtf!?). So I fixed that, spiffed up the documentation, and now it’s both usable and not misleading. Here’s an example of its output, locating the offending line in that hot function more accurately:
  %   cumulative      self          
 time    seconds   seconds  name    
 68.75      0.14      0.14  scmutil.py:546:revrange
  6.25      0.01      0.01  cmdutil.py:1006:walkchangerevs
  6.25      0.01      0.01  revlog.py:241:__init__
  [...blah blah blah...]
  0.00      0.01      0.00  util.py:237:__get__
---
Sample count: 16
Total time: 0.200000 seconds
I have uploaded statprof to the Python package index, so it’s almost trivial to install: "easy_install statprof" and you’re up and running.

Since the code is up on github, please feel welcome to contribute bug reports and improvements. Enjoy!

125

answered Sep 27 '22 19:09

dbr

I can think of a ~~couple of~~ few ways to do this:

Rather than trying to get a stack trace while the program is running, just fire an interrupt at it, and parse the output. You could do this with a shell script or with another python script that invokes your app as a subprocess. The basic idea is explained and rather thoroughly defended in this answer to a C++-specific question.
- Actually, rather than having to parse the output, you could register a postmortem routine (using sys.excepthook) that logs the stack trace. Unfortunately, Python doesn't have any way to continue from the point at which an exception occurred, so you can't resume execution after logging.
In order to actually get a stack trace from a running program, you ~~will~~ may have to hack the implementation. So if you really want to do that, it may be worth your time to check out pypy, a Python implementation written mostly in Python. I've no idea how convenient it would be to do this in pypy. I'm guessing that it wouldn't be particularly convenient, since it would involve introducing a hook into basically every instruction, which would I think be prohibitively inefficient. Also, I don't think there will be much advantage over the first option, unless it takes a very long time to reach the state where you want to start doing stack traces.
There exists a set of macros for the gdb debugger intended to facilitate debugging Python itself. gdb can attach to an external process (in this case the instance of python which is executing your application) and do, well, pretty much anything with it. It seems that the macro pystack will get you a backtrace of the Python stack at the current point of execution. I think it would be pretty easy to automate this procedure, since you can (at worst) just feed text into gdb using expect or whatever.

answered Sep 27 '22 20:09

intuited

Related questions
                            
                                What is the simplest way to make matplotlib in OSX work in a virtual environment?
                            
                                How to create a heat map in python that ranges from green to red?
                            
                                No URL to redirect to. Either provide a url or define a get_absolute_url method on the Model
                            
                                Scrapy .css select element with a specific attribute name and value
                            
                                OError: [Errno 26] Text file busy: '/...myvirtualenv/bin/python'
                            
                                PySpark Throwing error Method __getnewargs__([]) does not exist
                            
                                IF ELSE in robot framework with variables assignment
                            
                                pandas convert columns to percentages of the totals
                            
                                Can memmap pandas series. What about a dataframe?
                            
                                jupyter notebook bad interpreter error message
                            
                                Facing ValueError: Target is multiclass but average='binary'
                            
                                Installing dependencies of a local dependency with pipenv
                            
                                How to make VSCode auto-reload external *.py modules?
                            
                                Why are f-strings faster than str() to parse values?
                            
                                ERROR: unable to download video data: HTTP Error 403: Forbidden while using youtube_dl
                            
                                Why Python decorators rather than closures?
                            
                                IPython doesn't work in Django shell
                            
                                Where can i find good practice python problems with solutions? [closed]
                            
                                python: unhashable type error
                            
                                str to time in python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Is there a statistical profiler for python? If not, how could I go about writing one?

Tags:

python

profile

stochastic

shino

People also ask

2 Answers

dbr

intuited

Recent Activity

Donate For Us