Searching for a nice way to define rules for decompiler, need advice

Tags:

I am working on a very simple decompiler for MIPS architecture and as I progress I have to define lots of rules for code analysis, for example "if this opcode is lui and next opcode is addiu then return var = value" or "if this opcode is bne and it's referring to address before current - create loop definition in parsing tree". The problem - there are tons of such rules and I can't find a good way to define them. I've tried writing separated functions for every rule, defining nice OOP base logic classes and extending them to create rules, even tried regular expressions on disasmed code(to my surprise this works better than expected) but no matter what I've tried, my code soon became to big and to hard to read no matter how well I am trying to document and structure it.

This brings me to conclusion, that I am trying to solve this task by using wrong tools(not to mention being too stupid for such complex task :) ), but I have no real idea what should I try. Currently I have two untested ideas, one is using some kind of DSL(I have absolutely no experience in this, so I can be totally wrong), and another is writing some kind of binary regexp-like tools for opcode matching.

I hope someone can point me in correct direction, thx.

348

asked Jul 25 '10 23:07

Riz

1 Answers

I would guess that some of your rules are too low-level, and that's why they're becoming unmanageable.

Recognising lui followed by addiu as a 32-bit constant load certainly seems very reasonable; but trying to derive control flow from branch instructions at the individual opcode level seems rather more suspect - I think you want to be working with basic blocks there.

Cifuentes' Reverse Compilation Techniques is a reference which keeps cropping up in discussions of decompilation that I've seen; from a fairly brief skim, it seems like it would be well worth spending some time reading in detail for your project.

Some of the x86-specific stuff won't be relevant - in particular, the step which translates x86 to a low-level intermediate representation is probably not necessary for MIPS (MIPS is essentially just one basic operation per opcode already) - but otherwise much of the content looks like it should be very useful.

answered Sep 21 '22 01:09

Matthew Slattery

Related questions
                            
                                MIPS to C Translation
                            
                                How to find the minimum value of an array in MIPS
                            
                                MIPS Assembly Alignment Align n
                            
                                MIPS assembly - random integer range
                            
                                How to convert UTF-16 to ASCII
                            
                                How to convert from 4-bit hexadecimal to 7-bit ASCII?
                            
                                MIPS: Reading a string from command line argument
                            
                                How do I correctly use the mod operator in MIPS?
                            
                                How to distinguish between mips cpu types on linux when dpkg-architecture is absent?
                            
                                Loop through an array MIPS Assembly
                            
                                MIPS offsets with variables
                            
                                Is "muli" a MIPS instruction? Where is it defined?
                            
                                How to load memory address without using pseudo-instructions?
                            
                                Why are there (load byte unsigned) and (load byte) instructions in MIPS but only (store byte)?
                            
                                Use the floating point instructions to get results in decimal
                            
                                Instruction references undefined error in MIPS/QTSPIM
                            
                                MIPS linked list
                            
                                Attempt to execute non instruction

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Searching for a nice way to define rules for decompiler, need advice

Tags:

decompiling

control-flow

opcode

mips

disassembly

Riz

People also ask

1 Answers

Matthew Slattery

Recent Activity

Donate For Us