Is sse2 enabled by default in g++?

Tags:

When I run g++ -Q --help=target, I get

-msse2 [disabled].

However, if I create the assembly code of with default options as

g++ -g mycode.cpp -o mycode.o; objdump -S mycode.o > default,

and a sse2 version with

g++ -g -msse2 mycode.cpp -o mycode.sse2.o; objdump -S mycode.sse2.o > sse2,

and finally a non-sse2 version with

g++ -g -mno-sse2 mycode.cpp -o mycode.nosse2.o; objdump -S mycode.nosse2.o > nosse2

I see basically no difference between default and sse2, but a big difference between default and nosse2, so this tells me that, by default, g++ is using sse2 instructions, even though I am being told it is disabled ... what is going on here?

I am compiling on a Xeon E5-2680 under Linux with gcc-4.4.7 if it matters.

367

asked Jun 08 '15 19:06

drjrm3

1 Answers

If you are compiling for 64bit, then this is totally fine and documented behavior.

As stated in the gcc docs the SSE instruction set is enabled by default when using an x86-64 compiler:

-mfpmath=unit

Generate floating point arithmetics for selected unit unit. The choices for unit are:

`387'

Use the standard 387 floating point coprocessor present majority of chips and emulated otherwise. Code compiled with this option will run almost everywhere. The temporary results are computed in 80bit precision instead of precision specified by the type resulting in slightly different results compared to most of other chips. See -ffloat-store for more detailed description.

This is the default choice for i386 compiler.

`sse'

Use scalar floating point instructions present in the SSE instruction set. This instruction set is supported by Pentium3 and newer chips, in the AMD line by Athlon-4, Athlon-xp and Athlon-mp chips. The earlier version of SSE instruction set supports only single precision arithmetics, thus the double and extended precision arithmetics is still done using 387. Later version, present only in Pentium4 and the future AMD x86-64 chips supports double precision arithmetics too.

For the i386 compiler, you need to use -march=cpu-type, -msse or -msse2 switches to enable SSE extensions and make this option effective. For the x86-64 compiler, these extensions are enabled by default.

The resulting code should be considerably faster in the majority of cases and avoid the numerical instability problems of 387 code, but may break some existing code that expects temporaries to be 80bit.

This is the default choice for the x86-64 compiler.

answered Oct 02 '22 00:10

Uroc327

Related questions
                            
                                YouCompleteMe can't autocomplete
                            
                                How to use swig with compiled dll and header file only
                            
                                GoogleTest CMake and Make tests not running
                            
                                empty curly bracket {} as end of range
                            
                                C++ Templates with pointer to member function by signature and type
                            
                                Explicit constructor and initialization with std::initializer_list
                            
                                Linking libcurl while cross compiling with mingw32 under Linux for Windows
                            
                                How do I switch between local and global settings for the initial state of a C++11 RNG?
                            
                                Segmentation fault on one Linux machine but not another with C++ code
                            
                                What else do I need to use variadic template inheritance to create lambda overloads?
                            
                                gcc error trying to exec 'cc1': execvp: No such file or directory when running with non-root user
                            
                                Handle removed variable from boost serialize
                            
                                If I created a process, does it mean that I will always be able to terminate it?
                            
                                OpenCV - Zero padding of Mat
                            
                                Raw character literal
                            
                                Switch Statement: Is the logic different in C v/s. other languages like Java?
                            
                                How to find out what type to use for OpenCV .at function in C++?
                            
                                incomprehensible performance improvement with openmp even when num_threads(1)
                            
                                Linear congruential generator in C++
                            
                                Debugging GCC Compile Times [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Is sse2 enabled by default in g++?

Tags:

c++

linux

gcc

drjrm3

People also ask

1 Answers

Uroc327

Recent Activity

Donate For Us