When I use the <code>perf record</code> on my code, I find three choices for the <code>--call-graph</code> option: <code>lbr</code> (last branch record), <code>dwarf</code> and <code>fp</code>. What is difference between these?

The option <code>--call-graph</code> refers to the collection of call graphs / call chains, i.e. the function stack for a sample. The default, <code>fp</code>, uses frame pointers. This is very efficient but can be unreliable, particularly for optimized code. By explicitly using <code>-fno-omit-frame-pointer</code>, you can ensure that this is available for your code. Nevertheless, the result for libraries may vary. With <code>dwarf</code>, <code>perf</code> actually collects and stores a part of the stack memory itself and unwinds it with post-processing. This can be very resource consuming and may have limited stack depth. The default stack memory chunk is 8 kiB, but can be configured. <code>lbr</code> stands for last branch records. This is a hardware mechanism support by Intel CPUs. This will probably offer the best performance at the cost of portability. <code>lbr</code> is also limited to userspace functions.

What do the perf record choices of LBR vs DWARF vs fp do?

1 Answers

The option --call-graph refers to the collection of call graphs / call chains, i.e. the function stack for a sample.

The default, fp, uses frame pointers. This is very efficient but can be unreliable, particularly for optimized code. By explicitly using -fno-omit-frame-pointer, you can ensure that this is available for your code. Nevertheless, the result for libraries may vary.

With dwarf, perf actually collects and stores a part of the stack memory itself and unwinds it with post-processing. This can be very resource consuming and may have limited stack depth. The default stack memory chunk is 8 kiB, but can be configured.

lbr stands for last branch records. This is a hardware mechanism support by Intel CPUs. This will probably offer the best performance at the cost of portability. lbr is also limited to userspace functions.

answered Sep 20 '22 06:09

Zulan

Related questions
                            
                                How to export a symbol from an external module?
                            
                                Is it possible to install Rust on Linux without admin privileges?
                            
                                /usr/lib64/libstdc++.so.6: version `GLIBCXX_3.4.15' not found
                            
                                Virtual WiFi / 802.11 interface similar to VETH on Linux [closed]
                            
                                How to find whether a given address is in heap or in stack
                            
                                how to catch keyboard layout change event and get current new keyboard layout on Linux?
                            
                                Turn off counting line numbers in less command
                            
                                How to enable the Docker Remote API on Windows
                            
                                Virtual GPIO emulation
                            
                                How to use nix's ioctl?
                            
                                C# Timer resolution: Linux (mono, dotnet core) vs Windows
                            
                                The Linux timeout command and exit codes
                            
                                PCRE issue when setting up WSGI application
                            
                                How can _do_fork() return two different PIDs (one for the parent process and one for the child process)
                            
                                Where is the location of the mime plugin files in mac and linux?
                            
                                Docker daemon/container real-time scheduling with Ubuntu (Linux) host
                            
                                How to use double $ in echo [duplicate]
                            
                                How to include bash scripts with relative path? [duplicate]
                            
                                How to convert PDF to DOCX on linux
                            
                                BlueZ: Adding services, attributes, and profiles without sdptool command

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What do the perf record choices of LBR vs DWARF vs fp do?

Tags:

linux

perf

The flash

People also ask

1 Answers

Zulan

Recent Activity

Donate For Us