Compiling high-level language to machine code

Tags:

After reading some answers from the site and viewing some sources, I thought that the compiler converts high-level language (C++ as an example) to machine code directly as the computer itself doesn't need to convert it to assembly, it only converts it to assembly for the user to view the code and can have more control over the code if needed.

But this was found in one of my lecture sheets, so can I would appreciate if someone could explain further and correct me if I am wrong, or the screenshot below.

Slide

482

asked Jul 25 '14 21:07

Karim K.

3 Answers

Your slide is mostly wrong...

There is a 1-to-1 mapping between assembly and machine code. Assembly is a textual representation of the information, and machine code is a binary representation.

Some machines however, support additional assembly instructions, but what instructions are included in the produced assembly code is still determined at compile time, not runtime. Generally speaking however, this is determined by the processor in the system (intel, amd, ti, nvidia, etc..) not the manufacturer that you purchase the whole system from.

169

answered Oct 16 '22 12:10

Bill Lynch

This slide is confusing bytecode with textual assembly. Assembly is a human readable version of either bytecode or machine code. Machine code is what the hardware can run directly. Bytecode is further compiled to machine code, it is low level, but generic.

Some languages use byte code which is translsted during runtime into even lower level machine code. One example of this is java, where class files will sometimes be compiled to machine code asa runtime optimization. Another is cuda, where each nvidia gpu has a different instruction set but the cuda compiler generates bytecode that the cuda driver for each gpu can then translate.

Another option is that he is talking about how intel processors translate machine code during runtime into internal microcode and then run it, this is completely invisible to software though, including the OS.

answered Oct 16 '22 14:10

tohava

The slide is badly wrong in many ways.

A greatly simplified version of what actually happens in the example given in the slide — compiling C++ — would explain that there are four phases of compilation to produce and executable from a source code file:

Preprocessing
Compilation “proper”
Assembly
Linking

In the preprocessing phase, preprocessor directives, such as #include and #define are fully expanded and comments are stripped by the preprocessor, creating “postprocessed” C++. The slide omits this entirely.

In the compilation “proper” phase, the postprocessed text from the previous phase is converted into assembly language by the compiler. It's unfortunate that we use the same term — compilation — for both the whole four-step procedure and this one step, but that's the way it is.

Contrary to the slide, assembly language statements are not “readable by the OS” nor are they converted to machine code at run-time. Rather, they are readable by the assembler, which does its job (next paragraph) at compile-time.

In the assembly phase, the assembly language statements from the previous phase are converted into object code (binary machine code instructions that the CPU understands, combined with metadata that the OS and the linker understand) by the assembler.

In the linking phase, the object code from the previous phase is linked with other object code files and common/system libraries to form an executable.

At runtime, the OS — in particular the loader — reads the executable into memory and performs run-time linking, where references to common/system libraries are resolved and those libraries are loaded into memory (if they're not already) so that your executable is able to use them.

A further error is that different brands of machine do not have their “own machine codes”. What determines what machine codes are understood by a machine is the CPU. If two machines have the same CPU (e.g. a Dell laptop and a Toshiba laptop with the same Intel i7-3610QM CPU), then they understand the same machine codes. Moreover two CPUs with the same ISA (instruction set architecture) understand the same machine codes. Also, newer CPUs are generally backward-compatible with older CPUs in the same series. For example, a newer Intel i7 CPU understands all of the instructions that an older Intel Pentium 4 understands, but not vice-versa.

Hopefully, I've struck a somewhat better balance between simplicity and correctness than the slide, above, which fails miserably.

answered Oct 16 '22 13:10

Emmet

Related questions
                            
                                How to get resultant type of multiplying two different types?
                            
                                What is the best way to initialize a bitfield struct in C++?
                            
                                Append digit to an int without converting to string?
                            
                                What is the MZ signature in a PE file for?
                            
                                Templates and nested classes/structures
                            
                                Is the order of initialization guaranteed by the standard?
                            
                                Is using Java the proper language/platform for developing a GUI based accounting app?
                            
                                Dangling reference. Alternatives for dangling pointers and references?
                            
                                if without condition?
                            
                                How do I know if std::map insert succeeded or failed?
                            
                                __do_global_dtors_aux and __do_global_ctors_aux
                            
                                Purpose of uppercase VOID macro & INT typedef in winnt.h
                            
                                C++: What is the printf() format spec for "float"?
                            
                                How much does pointer indirection affect efficiency?
                            
                                What does the following runtime error mean: "terminate called without an active exception\n Aborted"
                            
                                Can You Use a Lambda In A Class' Initialization List?
                            
                                c++ use ifstream from memory
                            
                                Qt - Centering a checkbox in a QTable
                            
                                Why is the argument of the copy constructor a reference rather than a pointer?
                            
                                Why is char *A able to hold strings while char A cannot?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Compiling high-level language to machine code

Tags:

c++

assembly

compiler-construction

machine-code