Debug core file with no symbols

Tags:

I have a C application we have deployed to a customers site. It was compiled and runs on HP-UX. The user has reported a crash and we have obtained a core dump. So far, I've been unable to duplicate the crash in house.

As you would suspect, the core file/deployed executable is completely devoid of any sort of symbols. When I load it up in gdb and do a bt, the best I get is this:

(gdb) bt
#0  0xc0199470 in ?? ()

I can do a 'strings core' on the file, but my understanding is that all I get there is all the strings in the executable, so it seems semi-impossible to track down anything there.

I do have a debug version (compiled with -g) of the executable, which is unfortunately a couple of months newer than the released version. If I try to start gdb with that hub, I see this:

warning: exec file is newer than core file.
Core was generated by `program_name'.
Program terminated with signal 11, Segmentation fault.
__dld_list is not valid according to __dld_flags.

#0  0xc0199470 in ?? ()
(gdb) bt
#0  0xc0199470 in ?? ()

While it would be feasible to compile a debug version and deploy it at the customer's site and then wait for another crash, it would be relatively difficult and undesirable for a number of reasons.

I am quite familiar with the code and have a relatively good idea of where in code it is crashing based on the customer's bug report.

Is there ANY way I can glean any more information from this core dump? Via strings or another debugger or anything? Thanks.

217

asked Jun 26 '09 18:06

Morinar

1 Answers

This type of response from gdb:

(gdb) bt
#0  0xc0199470 in ?? ()

can also happen in the case that the stack was smashed by a buffer overrun, where the return address was overwritten in memory, so the program counter gets set to a seemingly random area.

This is one of the ways that even a build with a corresponding symbol database can cause a symbol lookup error (or strange looking backtraces). If you still get this after you have the symbol table, your problem is likely that your customer's data is causing some issues with your code.

170

answered Sep 29 '22 19:09

Sufian

Related questions
                            
                                Static functions in Linux device driver
                            
                                Initializing "a pointer to an array of integers"
                            
                                Direction Vector To Rotation Matrix
                            
                                Why is it not cost effective to inline functions with loops or switch statements?
                            
                                Enumerate factors of a number directly in ascending order without sorting?
                            
                                Converting typeof to string
                            
                                What are the IDEs available for gtk+ development [closed]
                            
                                Bind error while recreating socket
                            
                                calculating double integrals in R quickly
                            
                                How to properly inline for static libraries
                            
                                Why am I able to perform floating point operations inside a Linux kernel module?
                            
                                Are gnu syslog(), openlog() and closelog() thread-safe?
                            
                                "Assume" clause in gcc
                            
                                The right type for handles in C interfaces
                            
                                Old style C function declaration
                            
                                What's the difference between static inline void and void?
                            
                                What does gdb 'x' command do?
                            
                                Expression must be a pointer to a complete object type using simple pointer arithmetic [duplicate]
                            
                                Use of redefining void pointer to pointer to an anonymous structure?
                            
                                Why is the address of __libc_start_main always the same inside GDB even though ASLR is on?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Debug core file with no symbols

Tags:

c

gdb

hp-ux

Morinar

People also ask

1 Answers

Sufian

Recent Activity

Donate For Us