How do you compare two files containing C code based on code structure, not merely textual differences?

Tags:

I have two files containing C code which I wish to compare. I'm looking for a utility which will construct a syntax tree for each file, and compare the syntax trees, instead of merely comparing the text of the files. This way minor differences in formatting and style will be ignored. It would be nice to even be able to tell the comparison tool to ignore differences such as variable names, etc.

Correct me if I'm wrong, but diff doesn't have this capability. I'm a Ubuntu user. Thanks!

250

asked Nov 07 '10 05:11

Corey Jeffco

1 Answers

Our SD Smart Differencer does exactly what you want. It uses compiler-quality parsers to read source code and build ASTs for two files you select. It then compares the trees guided by the syntax, so it doesn't get confused by whitespace, layout or comments. Because it normalize the values of constants, it doesn't get confused by change of radix or how you expressed escape sequences!

The deltas are reported at the level of the langauge constructs (variable, expression, statement, declaration, function, ...) in terms of programmer intent (delete, insert, copy, move) complete with determining that an identifier has been renamed consistently throughout a changed block.

The SmartDifferencer has versions available for C (in a number of dialects; if you compiler-accurate parse, the langauge dialect matters) was well as for C++, Java, C#, JavaScript, COBOL, Python and many other langauges.

If you want to understand how a set of files are related to one another, our SD CloneDR will accept a very large set of files, and tell you what they have in common. It finds code that has been copy-paste-edited across the entire set. You don't have to tell it what to look for; it finds it automatically. Using ASTs (as above), it isn't fooled by whitespace changes or renames of identifiers. There's a bunch of sample clone detection reports for various languages at the web site.

122

answered Oct 22 '22 16:10

Ira Baxter

Related questions
                            
                                Do non-observable atomics synchronize memory?
                            
                                Asynchronous Finite Difference Scheme using MPI_Put
                            
                                RAII sockets: when to release (close)
                            
                                Creating a new file avoiding race conditions
                            
                                Append json_object_arrays in C using jsonc library
                            
                                pointers in a volatile struct in C
                            
                                C static inline parameter evaluation optimization
                            
                                How to debug a preprocessor macro
                            
                                Implicit conversion to/from an enum
                            
                                Intermediate pointers in cast must be "const qualified" - why?
                            
                                C: unordered floating-point comparison does not raise FE_INVALID
                            
                                C - multiple warnings "pointer is missing a nullability type specifier" when compiling program, what do I do?
                            
                                How to test if PyObject has an iterator
                            
                                Difference between Re-entrant and Thread-Safe function
                            
                                mmap-loadable data structure library for C++ (or C)
                            
                                Any C/C++ to non-native bytecode compiler/interpreters?
                            
                                Delay-Load equivalent in unix based systems
                            
                                Change library load order at run time (like LD_PRELOAD but during execution)
                            
                                FFmpeg: Jpeg file to AVFrame
                            
                                Does GCC create typedefs for arrays passed to functions?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How do you compare two files containing C code based on code structure, not merely textual differences?

Tags:

c

linux

comparison

diff

ubuntu

Corey Jeffco

People also ask

1 Answers

Ira Baxter

Recent Activity

Donate For Us