Open Source Profiling Frameworks?

Tags:

Have you ever wanted to test and quantitatively show whether your application would perform better as a static build or shared build, stripped or non-stripped, upx or no upx, gcc -O2 or gcc -O3, hash or btree, etc etc. If so this is the thread for you. There are hundreds of ways to tune an application, but how do we collect, organize, process, visualize the consequences of each experiment.

I have been looking for several months for an open source application performance engineering/profiling framework similar in concept to Mozilla's Perftastic where I can develop/build/test/profile hundreds of incarnations of different tuning experiments.

Some requirements:

Platform

SUSE32 and SUSE64

Data Format

Very flexible, compact, simple, hierarchical. There are several possibilities including

Custom CSV
RRD
Protocol Buffers
JSON
No XML. There is lots of data and XML is tooo verbose

Data Acquisition

Flexible and Customizable plugins. There is lots of data to collect from the application including performance data from /proc, sys time, wall time, cpu utilization, memory profile, leaks, valgrind logs, arena fragmentation, I/O, localhost sockets, binary size, open fds, etc. And some from the host system. My language of choice for this is Python, and I would develop these plugins to monitor and/or parse data in all different formats and store them in the data format of the framework.

Tagging

All experiments would be tagged including data like GCC version and compile options, platform, host, app options, experiment, build tag, etc.

Graphing

History, Comparative, Hierarchical, Dynamic and Static.

The application builds are done by a custom CI sever which releases a new app version several times per day the last 3 years straight. This is why we need a continuous trend analysis. When we add new features, make bug fixes, change build options, we want to automatically gather profiling data and see the trend. This is where generating various static builds is needed.
For analysis Mozilla dynamic graphs are great for doing comparative graphing. It would be great to have comparative graphing between different tags. For example compare N build versions, compare platforms, compare build options, etc.
We have a test suite of 3K tests, data will be gathered per test, and grouped from inter-test data, to per test, to per tagged group, to complete regression suite.
Possibilities include RRDTool, Orca, Graphite

Analysis on a grouping basis

Min
Max
Median
Avg
Standard Deviation
etc

Presentation

All of this would be presented and controlled through a app server, preferably Django or TG would be best.

Inspiration

Centreon
Cacti

321

asked Oct 22 '08 07:10

Gregory

1 Answers

There was a talk at PyCon this week discussing the various profiling methods on Python today. I don't think anything is as complete as what your looking for, but it may be worth a look. http://us.pycon.org/2009/conference/schedule/event/15/

You should be able to find the actual talk later this week on blip.tv http://blip.tv/search?q=pycon&x=0&y=0

170

answered Oct 13 '22 03:10

PKKid

Related questions
                            
                                Interesting results with duplicate columns in pandas.DataFrame
                            
                                How to use the kubernetes-client for executing "kubectl apply"
                            
                                Failed to build opencv-contrib-python (On Rasberry Pi)
                            
                                Shap installation
                            
                                how to set WSGI of appache2 to work with python 3.7?
                            
                                flask-ngrok returns "Tunnel _________.ngrok.io not found" when running flask app via ngrok on Google Colab [duplicate]
                            
                                How to handle odd resolutions in Unet architecture PyTorch
                            
                                Keras custom loss function to ignore false negatives of a specific class during semantic segmentation?
                            
                                Pandas changing values when inferring dtypes
                            
                                Error while Importing pyspark ETL module and running as child process using pything subprocess
                            
                                Can getline() be used multiple times within a loop? - Cython, file reading
                            
                                Reorder Sankey diagram vertically based on label value
                            
                                What is the correct way to update an slqalchemy orm column from a pandas dataframe column
                            
                                Selenium + Flask/Falcon in Python - 502 Bad Gateway Error
                            
                                How to store custom Parquet Dataset metadata with pyarrow?
                            
                                strange implicit conversion of data type in numpy
                            
                                Getting BrokenPipeError: [Errno 32] Broken pipe When Sending Second Socket MSG
                            
                                Why do we need the 'index' variable to move down a row in a list of list? Doesn't the 'for loop' do that automatically?
                            
                                Pylons with Elixir

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With