I have the following code which compiles fine with the gcc command <code>gcc ./example.c</code>. The program itself calls the function "add_two" which simply adds two integers. To use the intel syntax within the extended assembly instructions I need to switch at first to intel and than back to AT&T. According to the gcc documentation it is possible to switch to intel syntax entirely by using <code>gcc -masm=intel ./exmaple</code>. Whenever I try to compile it with the switch <code>-masm=intel</code> it won't compile and I don't understand why? I already tried to delete the instruction <code>.intel_syntax</code> but it still don't compile. <pre class="prettyprint"><code>#include <stdio.h> int add_two(int, int); int main(){ int src = 3; int dst = 5; printf("summe = %d \n", add_two(src, dst)); return 0; } int add_two(int src, int dst){ int sum; asm ( ".intel_syntax;" //switch to intel syntax "mov %0, %1;" "add %0, %2;" ".att_syntax;" //switch to at&t syntax : "=r" (sum) //output : "r" (src), "r" (dst) //input ); return sum; } </code></pre> The error message by compiling the above mentioned program with <code>gcc -masm=intel ./example.c</code> is: <pre class="prettyprint"><code>tmp/ccEQGI4U.s: Assembler messages: /tmp/ccEQGI4U.s:55: Error: junk `PTR [rbp-4]' after expression /tmp/ccEQGI4U.s:55: Error: too many memory references for `mov' /tmp/ccEQGI4U.s:56: Error: too many memory references for `mov' </code></pre>

Use <code>-masm=intel</code> and don't use any <code>.att_syntax</code> directives in your inline asm. This works with GCC and I think ICC, and with any constraints you use. Other methods don't. (See Can I use Intel syntax of x86 assembly with GCC? for a simple answer saying that; this answer explores exactly what goes wrong, including with clang 13 and earlier.) That also works in clang 14 and later. (Which isn't released yet but the patch is part of current trunk; see https://reviews.llvm.org/D113707). Clang 13 and earlier would always use AT&T syntax for inline asm, both in substituting operands and in assembling as <code>op src, dst</code>. But even worse, <code>clang -masm=intel</code> would do that even when taking the Intel side of an asm template using dialect-alternatives like <code>asm ("add {att | intel}</code>" : ... )`! <code>clang -masm=intel</code> did still control how it printed asm after its built-in assembler turned an <code>asm()</code> statement into some internal representation of the instruction. e.g. Godbolt showing clang13 <code>-masm=intel</code> turning <code>add %0, 1</code> as <code>add dword ptr [1], eax</code>, but clang trunk producing <code>add eax, 1</code>. Some of the rest of this answer talking about clang hasn't been updated for this new clang patch. Clang does support Intel-syntax inside MSVC-style asm-blocks, but that's terrible (no constraints so inputs / outputs have to go through memory. If you were hard-coding register names with clang, <code>-masm=intel</code> would be usable (or the equivalent <code>-mllvm --x86-asm-syntax=intel</code>). But it chokes on <code>mov %eax, 5</code> in Intel-syntax mode so you can't let <code>%0</code> expand to an AT&T-syntax register name. <hr> <code>-masm=intel</code> makes the compiler use <code>.intel_syntax noprefix</code> at the top of its asm output file, and use Intel-syntax when generating asm from C outside your inline-asm statement. Using <code>.att_syntax</code> at the bottom of your asm template breaks the compiler's asm, hence the error messages like <code>PTR [rbp-4]</code> looking like junk to the assembler (which is expecting AT&T syntax). The "too many operands for mov" is because in AT&T syntax, <code>mov eax, ebx</code> is a <code>mov</code> from a memory operand (with symbol name <code>eax</code>) to a memory operand (with symbol name <code>ebx</code>) <hr> Some people suggest using <code>.intel_syntax noprefix</code> and <code>.att_syntax prefix</code> around your asm template. That can sometimes work but it's problematic. And incompatible with the preferred method of <code>-masm=intel</code>. <h3>Problems with the "sandwich" method:</h3> When the compiler substitutes operands into your asm template, it will do so according to <code>-masm=</code>. This will always break for memory operands (the addressing-mode syntax is completely different). It will also break with clang even for registers. Clang's built-in assembler does not accept <code>%eax</code> as a register name in Intel-syntax mode, and doesn't accept <code>.intel_syntax prefix</code> (as opposed to the <code>noprefix</code> that's usually used with Intel-syntax). Consider this function: <pre class="prettyprint"><code>int foo(int x) { asm(".intel_syntax noprefix \n\t" "add %0, 1 \n\t" ".att_syntax" : "+r"(x) ); return x; } </code></pre> It assembles as follows with GCC (Godbolt): <pre class="prettyprint"><code> movl %edi, %eax .intel_syntax noprefix add %eax, 1 # AT&T register name in Intel syntax .att_syntax </code></pre> The sandwich method depends on GAS accepting <code>%eax</code> as a register name even in Intel-syntax mode. GAS from GNU Binutils does, but clang's built-in assembler doesn't. On a Mac, even using real GCC the asm output has to assemble with an <code>as</code> that's based on clang, not GNU Binutils. Using clang on that source code complains: <pre class="prettyprint"><code><source>:2:35: error: unknown token in expression asm(".intel_syntax noprefix \n\t" ^ <inline asm>:2:6: note: instantiated into assembly here add %eax, 1 ^ </code></pre> (The first line of the error message didn't handle the multi-line string literal very well. If you use <code>;</code> instead of <code>\n\t</code> and put everything on one line the clang error message works better but the source is a mess.) <hr> I didn't check what happens with <code>"ri"</code> constraints when the compiler picks an immediate; it will still decorate it with <code>$</code> but IDK if GAS silently ignores that, too, in Intel syntax mode. <hr> PS: your asm statement has a bug: you forgot an early-clobber on your output operand so nothing is stopping the compiler from picking the same register for the <code>%0</code> output and the <code>%2</code> input that you don't read until the 2nd instruction. Then <code>mov</code> will destroy an input. But using <code>mov</code> as the first or last instruction of an asm-template is usually also a missed-optimization bug. In this case you can and should just use <code>lea %0, [%1 + %2]</code> to let the compiler add with the result written to a 3rd register, non-destructively. Or just wrap the <code>add</code> instruction (using a <code>"+r"</code> operand and an <code>"r"</code>, and let the compiler worry about data movement.) If it had to load the value from memory anyway, it can put it in the right register so no <code>mov</code> is needed. <hr> PS: it's possible to write inline asm that works with <code>-masm=intel</code> or <code>att</code>, using GNU C inline asm dialect alternatives. e.g. <pre class="prettyprint"><code>void atomic_inc(int *p) { asm( "lock add{l $1, %0 | %0, 1}" : "+m" (*p) :: "memory" ); } </code></pre> compiles with <code>gcc -O2</code> (<code>-masm=att</code> is the default) to <pre class="prettyprint"><code>atomic_inc(int*): lock addl $1, (%rdi) ret </code></pre> Or with <code>-masm=intel</code> to: <pre class="prettyprint"><code>atomic_inc(int*): lock add DWORD PTR [rdi], 1 ret </code></pre> Notice that the <code>l</code> suffix is required for AT&T, and the <code>dword ptr</code> is required for intel, because memory, immediate doesn't imply an operand-size. And that the compiler filled in valid addressing-mode syntax for both cases. This works with clang, but only the AT&T version ever gets used.

Note that <code>-masm=</code> also affects the default inline assembler syntax: <blockquote> Output assembly instructions using selected dialect. Also affects which dialect is used for basic "asm" and extended "asm". Supported choices (in dialect order) are att or intel. The default is att. Darwin does not support intel. </blockquote> That means that your first <code>.intel_syntax</code> directive is superfluous and the final <code>.att_syntax</code> is wrong because your GCC call compiles C to Intel assembler code. IOW, either stick to <code>-masm=intel</code> or sandwich your inline Intel assembler code sections between <code>.intel_syntax noprefix</code> and <code>.att_syntax prefix</code> directives - but don't do both. Note that the sandwich method isn't compatible with all inline assembler constraints - e.g. a constraint that involves <code>m</code> (i.e. memory operand) would insert an operand in ATT syntax which would yield an error like 'Error: junk (%rbp) after expression'. In those cases you have to use <code>-masm=intel</code>.

How to set gcc to use intel syntax permanently?

Tags:

x86

gcc

assembly

intel-syntax

inline-assembly

I have the following code which compiles fine with the gcc command gcc ./example.c. The program itself calls the function "add_two" which simply adds two integers. To use the intel syntax within the extended assembly instructions I need to switch at first to intel and than back to AT&T. According to the gcc documentation it is possible to switch to intel syntax entirely by using gcc -masm=intel ./exmaple.

Whenever I try to compile it with the switch -masm=intel it won't compile and I don't understand why? I already tried to delete the instruction .intel_syntax but it still don't compile.

#include <stdio.h>

int add_two(int, int);

int main(){
     int src = 3;
     int dst = 5;
     printf("summe = %d \n", add_two(src, dst));
     return 0;
}

int add_two(int src, int dst){

    int sum;

    asm (
        ".intel_syntax;"  //switch to intel syntax
        "mov %0, %1;"
        "add %0, %2;"

        ".att_syntax;"  //switch to at&t syntax
        : "=r" (sum) //output
        : "r" (src), "r" (dst) //input
    );

    return sum;
}

The error message by compiling the above mentioned program with gcc -masm=intel ./example.c is:

tmp/ccEQGI4U.s: Assembler messages:
/tmp/ccEQGI4U.s:55: Error: junk `PTR [rbp-4]' after expression
/tmp/ccEQGI4U.s:55: Error: too many memory references for `mov'
/tmp/ccEQGI4U.s:56: Error: too many memory references for `mov'

647

asked Aug 15 '16 11:08

Dennis

2 Answers

Use -masm=intel and don't use any .att_syntax directives in your inline asm. This works with GCC and I think ICC, and with any constraints you use. Other methods don't. (See Can I use Intel syntax of x86 assembly with GCC? for a simple answer saying that; this answer explores exactly what goes wrong, including with clang 13 and earlier.)

That also works in clang 14 and later. (Which isn't released yet but the patch is part of current trunk; see https://reviews.llvm.org/D113707).

Clang 13 and earlier would always use AT&T syntax for inline asm, both in substituting operands and in assembling as op src, dst. But even worse, clang -masm=intel would do that even when taking the Intel side of an asm template using dialect-alternatives like asm ("add {att | intel}" : ... )`!

clang -masm=intel did still control how it printed asm after its built-in assembler turned an asm() statement into some internal representation of the instruction. e.g. Godbolt showing clang13 -masm=intel turning add %0, 1 as add dword ptr [1], eax, but clang trunk producing add eax, 1.

Some of the rest of this answer talking about clang hasn't been updated for this new clang patch.

Clang does support Intel-syntax inside MSVC-style asm-blocks, but that's terrible (no constraints so inputs / outputs have to go through memory.

If you were hard-coding register names with clang, -masm=intel would be usable (or the equivalent -mllvm --x86-asm-syntax=intel). But it chokes on mov %eax, 5 in Intel-syntax mode so you can't let %0 expand to an AT&T-syntax register name.

-masm=intel makes the compiler use .intel_syntax noprefix at the top of its asm output file, and use Intel-syntax when generating asm from C outside your inline-asm statement. Using .att_syntax at the bottom of your asm template breaks the compiler's asm, hence the error messages like PTR [rbp-4] looking like junk to the assembler (which is expecting AT&T syntax).

The "too many operands for mov" is because in AT&T syntax, mov eax, ebx is a mov from a memory operand (with symbol name eax) to a memory operand (with symbol name ebx)

Some people suggest using .intel_syntax noprefix and .att_syntax prefix around your asm template. That can sometimes work but it's problematic. And incompatible with the preferred method of -masm=intel.

Problems with the "sandwich" method:

When the compiler substitutes operands into your asm template, it will do so according to -masm=. This will always break for memory operands (the addressing-mode syntax is completely different).

It will also break with clang even for registers. Clang's built-in assembler does not accept %eax as a register name in Intel-syntax mode, and doesn't accept .intel_syntax prefix (as opposed to the noprefix that's usually used with Intel-syntax).

Consider this function:

int foo(int x) {
    asm(".intel_syntax noprefix \n\t"
        "add  %0, 1  \n\t"
        ".att_syntax"
         : "+r"(x)
        );
    return x;
}

It assembles as follows with GCC (Godbolt):

        movl    %edi, %eax
        .intel_syntax noprefix 
         add %eax, 1                    # AT&T register name in Intel syntax
        .att_syntax

The sandwich method depends on GAS accepting %eax as a register name even in Intel-syntax mode. GAS from GNU Binutils does, but clang's built-in assembler doesn't.

On a Mac, even using real GCC the asm output has to assemble with an as that's based on clang, not GNU Binutils.

Using clang on that source code complains:

<source>:2:35: error: unknown token in expression
    asm(".intel_syntax noprefix \n\t"
                                  ^
<inline asm>:2:6: note: instantiated into assembly here
        add %eax, 1
            ^

(The first line of the error message didn't handle the multi-line string literal very well. If you use ; instead of \n\t and put everything on one line the clang error message works better but the source is a mess.)

I didn't check what happens with "ri" constraints when the compiler picks an immediate; it will still decorate it with $ but IDK if GAS silently ignores that, too, in Intel syntax mode.

PS: your asm statement has a bug: you forgot an early-clobber on your output operand so nothing is stopping the compiler from picking the same register for the %0 output and the %2 input that you don't read until the 2nd instruction. Then mov will destroy an input.

But using mov as the first or last instruction of an asm-template is usually also a missed-optimization bug. In this case you can and should just use lea %0, [%1 + %2] to let the compiler add with the result written to a 3rd register, non-destructively. Or just wrap the add instruction (using a "+r" operand and an "r", and let the compiler worry about data movement.) If it had to load the value from memory anyway, it can put it in the right register so no mov is needed.

PS: it's possible to write inline asm that works with -masm=intel or att, using GNU C inline asm dialect alternatives. e.g.

void atomic_inc(int *p) {
    asm( "lock add{l $1, %0 | %0, 1}"
       : "+m" (*p)
       :: "memory"
    );
}

compiles with gcc -O2 (-masm=att is the default) to

atomic_inc(int*):
    lock addl $1, (%rdi) 
    ret

Or with -masm=intel to:

atomic_inc(int*):
    lock add DWORD PTR [rdi], 1
    ret

Notice that the l suffix is required for AT&T, and the dword ptr is required for intel, because memory, immediate doesn't imply an operand-size. And that the compiler filled in valid addressing-mode syntax for both cases.

This works with clang, but only the AT&T version ever gets used.

answered Sep 19 '22 09:09

Peter Cordes

Note that -masm= also affects the default inline assembler syntax:

Output assembly instructions using selected dialect. Also affects which dialect is used for basic "asm" and extended "asm". Supported choices (in dialect order) are att or intel. The default is att. Darwin does not support intel.

That means that your first .intel_syntax directive is superfluous and the final .att_syntax is wrong because your GCC call compiles C to Intel assembler code.

IOW, either stick to -masm=intel or sandwich your inline Intel assembler code sections between .intel_syntax noprefix and .att_syntax prefix directives - but don't do both.

Note that the sandwich method isn't compatible with all inline assembler constraints - e.g. a constraint that involves m (i.e. memory operand) would insert an operand in ATT syntax which would yield an error like 'Error: junk (%rbp) after expression'. In those cases you have to use -masm=intel.

answered Sep 18 '22 09:09

maxschlepzig

Related questions
                            
                                Remove the comments generated by cpp
                            
                                Finding path of static system libraries in Linux
                            
                                Will passing std::string via copying be optimized?
                            
                                Linking Homebrew-compiled openmpi (or mpich2) to Homebrew's gcc
                            
                                libnl 3 (netlink library) undefined reference to nl* and genl*
                            
                                How to let cmake use "-pthread" instead of -lpthread"?
                            
                                Is there any way to increase the stack size/recursion limit?
                            
                                arm-none-eabi-ld: cannot find -lc
                            
                                __isr_vectors variable not found when placed inside a static library
                            
                                How to find and avoid uninitialised primitive members in C++?
                            
                                Compile-time counter in template class
                            
                                What does __ATOMIC_RELAXED mean?
                            
                                #including <alsa/asoundlib.h> and <sys/time.h> results in multiple definition conflict
                            
                                GCC Inline Assembly 'Nd' constraint
                            
                                How can I change the debug path included in the DWARF info of a binary by the compiler
                            
                                Minimum floating point number (closest to zero)
                            
                                GCC linker complains about undefined reference to existing global variable
                            
                                Why does the compiler allocate more than needed in the stack?
                            
                                g++ 5.4.0 - unable to use C++14 standard
                            
                                Assembler Error: expression too complex

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With