How to determine maximum stack usage in embedded system with gcc?

Tags:

I'm writing the startup code for an embedded system -- the code that loads the initial stack pointer before jumping to the main() function -- and I need to tell it how many bytes of stack my application will use (or some larger, conservative estimate).

I've been told the gcc compiler now has a -fstack-usage option and -fcallgraph-info option that can somehow be used to statically calculates the exact "Maximum Stack Usage" for me. ( "Compile-time stack requirements analysis with GCC" by Botcazou, Comar, and Hainque ).

Nigel Jones says that recursion is a really bad idea in embedded systems ("Computing your stack size" 2009), so I've been careful not to make any mutually recursive functions in this code.

Also, I make sure that none of my interrupt handlers ever re-enable interrupts until their final return-from-interrupt instruction, so I don't need to worry about re-entrant interrupt handlers.

Without recursion or re-entrant interrupt handlers, it should possible to statically determine the maximum stack usage. (And so most of the answers to How to determine maximum stack usage? do not apply). My understanding is I (or preferably, some bit of code on my PC that is automatically run every time I rebuild the executable) first find the maximum stack depth for each interrupt handler when it's not interrupted by a higher-priority interrupt, and the maximum stack depth of the main() function when it is not interrupted. Then I add them all up to find the total (worst-case) maximum stack depth. That occurs (in my embedded system) when the main() background task is at its maximum depth when it is interrupted by the lowest-priority interrupt, and that interrupt is at its maximum depth when it is interrupted by the next-lowest-priority interrupt, and so on.

I'm using YAGARTO with gcc 4.6.0 to compile code for the LM3S1968 ARM Cortex-M3.

So how do I use the -fstack-usage option and -fcallgraph-info option with gcc to calculate the maximum stack depth? Or is there some better approach to determine maximum stack usage?

(See How to determine maximum stack usage in embedded system? for almost the same question targeted to the Keil compiler .)

446

asked Jun 17 '11 14:06

David Cary

2 Answers

GCC docs :

-fstack-usage

Makes the compiler output stack usage information for the program, on a per-function basis. The filename for the dump is made by appending .su to the auxname. auxname is generated from the name of the output file, if explicitly specified and it is not an executable, otherwise it is the basename of the source file. An entry is made up of three fields:

The name of the function.

A number of bytes.

One or more qualifiers: static, dynamic, bounded.

The qualifier static means that the function manipulates the stack statically: a fixed number of bytes are allocated for the frame on function entry and released on function exit; no stack adjustments are otherwise made in the function. The second field is this fixed number of bytes.

The qualifier dynamic means that the function manipulates the stack dynamically: in addition to the static allocation described above, stack adjustments are made in the body of the function, for example to push/pop arguments around function calls. If the qualifier bounded is also present, the amount of these adjustments is bounded at compile-time and the second field is an upper bound of the total amount of stack used by the function. If it is not present, the amount of these adjustments is not bounded at compile-time and the second field only represents the bounded part.

I can't find any references to -fcallgraph-info

You could potentially create the information you need from -fstack-usage and -fdump-tree-optimized

For each leaf in -fdump-tree-optimized, get its parents and sum their stack size number (keeping in mind that this number lies for any function with "dynamic" but not "bounded") from -fstack-usage, find the max of these values and this should be your maximum stack usage.

108

answered Sep 22 '22 16:09

τεκ

Just in case no one comes up with a better answer, I'll post what I had in the comment to your other question, even though I have no experience using these options and tools:

GCC 4.6 adds the -fstack-usage option which gives the stack usage statistics on a function-by-function basis.

If you combine this information with a call graph produced by cflow or a similar tool you can get the kind of stack depth analysis you're looking for (a script could probably be written pretty easily to do this). Have the script read the stack-usage info and load up a map of function names with the stack used by the function. Then have the script walk the cflow graph (which can be an easy-to-parse text tree), adding up the stack usage associated with each line for each branch in the call graph.

So, it looks like this can be done with GCC, but you might have to cobble together the right set of tools.

answered Sep 19 '22 16:09

Michael Burr

Related questions
                            
                                Why am I getting the message "Single-stepping until exit . . . which has no line number information" in GDB?
                            
                                Is a program compiled with -g gcc flag slower than the same program compiled without -g?
                            
                                += operator for uint16_t promotes the assigned value to int and won't compile
                            
                                Are packed structs portable?
                            
                                C++: Safe to use longjmp and setjmp?
                            
                                What is the difference between 'asm', '__asm' and '__asm__'?
                            
                                Can I link a plain file into my executable? [duplicate]
                            
                                Is it a good idea to compile a language to C?
                            
                                How do you link to a specific version of a shared library in GCC
                            
                                gcc 4.8 on MAC OS X 10.8 throws "Undefined symbols for architecture x86_64: "
                            
                                Does the C++ standard guarantee that uniform initialization is exception-safe?
                            
                                Why doesn't GCC optimize out deletion of null pointers in C++?
                            
                                C preprocessor Macro defining Macro
                            
                                Is there a way to get warned about unused functions?
                            
                                Search a vector of objects by object attribute
                            
                                Why do I have to define LD_LIBRARY_PATH with an export every time I run my application?
                            
                                Why does this snippet using uniform initialization compile with g++4.6 but not g++4.7?
                            
                                Why do Clang and VS2013 accept moving brace-initialized default arguments, but not GCC 4.8 or 4.9?
                            
                                Implicit conversion failure from initializer list
                            
                                How can the compile-time be (exponentially) faster than run-time?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to determine maximum stack usage in embedded system with gcc?

Tags:

gcc

embedded

code-analysis

static-analysis

David Cary

People also ask

2 Answers

τεκ

Michael Burr

Recent Activity

Donate For Us