I want to trace function/class executive order in scrapy framework. There are multiple *.py files across the default project, and I want to know which py file and class has been executed in order. It sound silly to put logger line in every class and function. How to visualize this order? cprofile is mainly used for measuring total time. I could also visualize the execution order inside one module, which is common question, but visualizing multiple modules are difficult. In terms of trace package, I did not find appropriate examples to work with large project like scrapy or django. Trace usage tutorial is about a single python file. I want to trace multiple *.py files in multiple modules in a large project, eg scrapy, instead of just one module. I am aware of debug tools like pdb, but I find it cumbersome to put break point across the whole project. More importantly, it is not easy to summarize the execution order. Finally I solved by using Hunter, which is better than build-in trace module. Trace module did not offer include_dir attribute. For those who are curiosity about how to trace all lines of scrapy. <pre class="prettyprint"><code>$PYTHONHUNTER='Q(module_startswith=["scrapy", "your_project"])' scrapy list </code></pre> In terms of django, tracing execution codes of rest_framework and save to test.log, for example: <pre class="prettyprint"><code>$PYTHONHUNTER='Q(module_startswith=["rest_framework", "your_project"]), action=CallPrinter(stream=open("test.log", "w"))' python manage.py runserver --noreload --nothreading </code></pre>

Well the best tool to trace function execution order is definitely viztracer. I would have to say that visualization is a huge factor when it comes to understanding a larger project. <img src="https://i.stack.imgur.com/D9XHQ.png" alt="enter image description here"> An interactive image like this makes it much easier to understand what's going on in your program, compared to cold terminal ascii. Also, it's a non-intrusive tool, which means you don't need to write a single line of code. Just install it and run your program with it. <pre class="prettyprint"><code>pip install viztracer viztracer your_script.py </code></pre> Another important factor here is that viztracer supports multi-thread and multi-process, and can visualize them in separate signals, on the same timeline, which you'll never achieve with terminal display.

<h3>trace</h3> <blockquote> The trace module allows you to trace program execution, generate annotated statement coverage listings, print caller/callee relationships and list functions executed during a program run. It can be used in another program or from the command line. </blockquote> <pre class="prettyprint"><code>python -m trace --count -C . somefile.py ... </code></pre> The above will execute <code>somefile.py</code> and generate annotated listings of all Python modules imported during the execution into the current directory. <h3>PDB</h3> <blockquote> The module pdb defines an interactive source code debugger for Python programs. It supports setting (conditional) breakpoints and single stepping at the source line level, inspection of stack frames, source code listing, and evaluation of arbitrary Python code in the context of any stack frame. It also supports post-mortem debugging and can be called under program control. </blockquote> Most Common Used Command: w(here) <ul> <li>Print a stack trace, with the most recent frame at the bottom. An arrow indicates the current frame, which determines the context of most commands.</li> </ul> d(own) <ul> <li>Move the current frame one level down in the stack trace (to a newer frame).</li> </ul> u(p) <ul> <li>Move the current frame one level up in the stack trace (to an older frame).</li> </ul> You can also check this question Python debugging tips <h3>Coverage</h3> <blockquote> Coverage.py measures code coverage, typically during test execution. It uses the code analysis tools and tracing hooks provided in the Python standard library to determine which lines are executable, and which have been executed. </blockquote> <h3>Hunter</h3> <blockquote> Hunter is a flexible code tracing toolkit, not for measuring coverage, but for debugging, logging, inspection and other nefarious purposes. </blockquote> The default action is to just print the code being executed. Example: <pre class="prettyprint"><code>import hunter hunter.trace(module='posixpath') import os os.path.join('a', 'b') </code></pre> Result in terminal: <img src="https://i.stack.imgur.com/axWto.png" alt="Hunter Result in terminal">

python: How to trace function execution order in large project

Tags:

python

visualization

profile

execution

trace

I want to trace function/class executive order in scrapy framework. There are multiple *.py files across the default project, and I want to know which py file and class has been executed in order. It sound silly to put logger line in every class and function. How to visualize this order?

cprofile is mainly used for measuring total time. I could also visualize the execution order inside one module, which is common question, but visualizing multiple modules are difficult.

In terms of trace package, I did not find appropriate examples to work with large project like scrapy or django. Trace usage tutorial is about a single python file.

I want to trace multiple *.py files in multiple modules in a large project, eg scrapy, instead of just one module.

I am aware of debug tools like pdb, but I find it cumbersome to put break point across the whole project. More importantly, it is not easy to summarize the execution order.

Finally I solved by using Hunter, which is better than build-in trace module. Trace module did not offer include_dir attribute.

For those who are curiosity about how to trace all lines of scrapy.

$PYTHONHUNTER='Q(module_startswith=["scrapy", "your_project"])' scrapy list

In terms of django, tracing execution codes of rest_framework and save to test.log, for example:

$PYTHONHUNTER='Q(module_startswith=["rest_framework", "your_project"]), action=CallPrinter(stream=open("test.log", "w"))' python manage.py runserver --noreload --nothreading

534

asked May 28 '18 03:05

anonymous

2 Answers

Well the best tool to trace function execution order is definitely viztracer. I would have to say that visualization is a huge factor when it comes to understanding a larger project.

enter image description here

An interactive image like this makes it much easier to understand what's going on in your program, compared to cold terminal ascii.

Also, it's a non-intrusive tool, which means you don't need to write a single line of code. Just install it and run your program with it.

pip install viztracer
viztracer your_script.py

Another important factor here is that viztracer supports multi-thread and multi-process, and can visualize them in separate signals, on the same timeline, which you'll never achieve with terminal display.

163

answered Nov 09 '22 23:11

minker

trace

The trace module allows you to trace program execution, generate annotated statement coverage listings, print caller/callee relationships and list functions executed during a program run. It can be used in another program or from the command line.

python -m trace --count -C . somefile.py ...

The above will execute somefile.py and generate annotated listings of all Python modules imported during the execution into the current directory.

PDB

The module pdb defines an interactive source code debugger for Python programs. It supports setting (conditional) breakpoints and single stepping at the source line level, inspection of stack frames, source code listing, and evaluation of arbitrary Python code in the context of any stack frame. It also supports post-mortem debugging and can be called under program control.

Most Common Used Command:

w(here)

Print a stack trace, with the most recent frame at the bottom. An arrow indicates the current frame, which determines the context of most commands.

d(own)

Move the current frame one level down in the stack trace (to a newer frame).

u(p)

Move the current frame one level up in the stack trace (to an older frame).

You can also check this question Python debugging tips

Coverage

Coverage.py measures code coverage, typically during test execution. It uses the code analysis tools and tracing hooks provided in the Python standard library to determine which lines are executable, and which have been executed.

Hunter

Hunter is a flexible code tracing toolkit, not for measuring coverage, but for debugging, logging, inspection and other nefarious purposes.

The default action is to just print the code being executed. Example:

import hunter
hunter.trace(module='posixpath')

import os
os.path.join('a', 'b')

Result in terminal: Hunter Result in terminal

answered Nov 09 '22 21:11

H.Tibat

Related questions
                            
                                How is pandas groupby method actually working?
                            
                                How to count objects in Tensorflow Object Detection API
                            
                                Best practices for writing argparse parsers
                            
                                What does "splitter" attribute in sklearn's DecisionTreeClassifier do?
                            
                                Python PostgreSQL COPY command used to INSERT or UPDATE (not just INSERT)
                            
                                Matplotlib mathtext: Glyph errors in tick labels
                            
                                File association not found for extension .py
                            
                                matplotlib toolbar in a pyqt5 application
                            
                                Running collectstatic on server : AttributeError: 'PosixPath' object has no attribute 'startswith'
                            
                                Can you stop PyCharm from automatically closing script files when you click out of the program?
                            
                                Pearson correlation and nan values
                            
                                Django max similarity (TrigramSimilarity) from ManyToManyField
                            
                                pandas plotting - x axis gets transformed to floats
                            
                                How does await give back control to the event loop during coroutine chaining?
                            
                                Python pandas: concat vertical and horizontal
                            
                                Manager / Container class, how to?
                            
                                Selenium with chromedriver doesn't start via cron
                            
                                Difference between setRootPath and setRootIndex in QFileSystemModel
                            
                                How can I attach documentation to members of a python enum?
                            
                                Shopify API Python Multiple Pictures upload with Python API

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

python: How to trace function execution order in large project

Tags:

python

visualization

profile

execution

trace

anonymous

People also ask

2 Answers

minker

trace

PDB

Coverage

Hunter

H.Tibat

Recent Activity

Donate For Us