Suppose I have two functions with the same parameter types and name (not in the same program): <pre class="prettyprint"><code>std::string foo(int x) { return "hello"; } int foo(int x) { return x; } </code></pre> Will they have the same mangled name once compiled? Is the the return type part of the mangled name in C++?

As mangling schemes aren't standardised, there's no single answer to this question; the closest thing to an actual answer would be to look at mangled names generated by the most common mangling schemes. To my knowledge, those are the GCC and MSVC schemes, in alphabetical order, so... <hr> <h3>GCC:</h3> To test this, we can use a simple program. <pre class="prettyprint"><code>#include <string> #include <cstdlib> std::string foo(int x) { return "hello"; } //int foo(int x) { return x; } int main() { // Assuming executable file named "a.out". system("nm a.out"); } </code></pre> Compile and run with GCC or Clang, and it'll list the symbols it contains. Depending on which of the functions is uncommented, the results will be: <pre class="prettyprint"><code>// GCC: // ---- std::string foo(int x) { return "hello"; } // _Z3fooB5cxx11i // foo[abi:cxx11](int) int foo(int x) { return x; } // _Z3fooi // foo(int) // Clang: // ------ std::string foo(int x) { return "hello"; } // _Z3fooi // foo(int) int foo(int x) { return x; } // _Z3fooi // foo(int) </code></pre> The GCC scheme contains relatively little information, not including return types: <ul> <li>Symbol type: <code>_Z</code> for "function".</li> <li>Name: <code>3foo</code> for <code>::foo</code>.</li> <li>Parameters: <code>i</code> for <code>int</code>.</li> </ul> Despite this, however, they are different when compiled with GCC (but not with Clang), because GCC indicates that the <code>std::string</code> version uses the <code>cxx11</code> ABI. Note that it does still keep track of the return type, and make sure signatures match; it just doesn't use the function's mangled name to do so. <hr> <h3>MSVC:</h3> To test this, we can use a simple program, as above. <pre class="prettyprint"><code>#include <string> #include <cstdlib> std::string foo(int x) { return "hello"; } //int foo(int x) { return x; } int main() { // Assuming object file named "a.obj". // Pipe to file, because there are a lot of symbols when <string> is included. system("dumpbin/symbols a.obj > a.txt"); } </code></pre> Compile and run with Visual Studio, and <code>a.txt</code> will list the symbols it contains. Depending on which of the functions is uncommented, the results will be: <pre class="prettyprint"><code>std::string foo(int x) { return "hello"; } // ?foo@@YA?AV?$basic_string@DU?$char_traits@D@std@@V?$allocator@D@2@@std@@H@Z // class std::basic_string<char,struct std::char_traits<char>,class std::allocator<char> > __cdecl foo(int) int foo(int x) { return x; } // ?foo@@YAHH@Z // int __cdecl foo(int) </code></pre> The MSVC scheme contains the entire declaration, including things that weren't explicitly specified: <ul> <li>Name: <code>foo@</code> for <code>::foo</code>, followed by <code>@</code> to terminate.</li> <li>Symbol type: Everything after the name-terminating <code>@</code>.</li> <li>Type and member status: <code>Y</code> for "non-member function".</li> <li>Calling convention: <code>A</code> for <code>__cdecl</code>.</li> <li>Return type: <ul> <li> <code>H</code> for <code>int</code>.</li> <li> <code>?AV?$basic_string@DU?$char_traits@D@std@@V?$allocator@D@2@@std@</code> (followed by <code>@</code> to terminate) for <code>std::basic_string<char, std::char_traits<char>, std::allocator<char>></code> (<code>std::string</code> for short).</li> </ul> </li> <li>Parameter list: <code>H</code> for <code>int</code> (followed by <code>@</code> to terminate).</li> <li>Exception specifier: <code>Z</code> for <code>throw(...)</code>; this one is omitted from demangled names unless it's something else, probably because MSVC just ignores it anyway.</li> </ul> This allows it to whine at you if declarations aren't identical across every compilation unit. <hr> Generally, most compilers will use one of those schemes (or sometimes a variation thereof) when targeting *nix or Windows, respectively, but this isn't guaranteed. For example... <ul> <li>Clang, to my knowledge, will use the GCC scheme for *nix, or the MSVC scheme for Windows.</li> <li>Intel C++ uses the GCC scheme for Linux and Mac, and the MSVC scheme (with a few minor variations) for Windows.</li> <li>The Borland and Watcom compilers have their own schemes.</li> <li>The Symantec and Digital Mars compilers generally use the MSVC scheme, with a few small changes.</li> <li>Older versions of GCC, and a lot of UNIX tools, use a modified version of cfront's mangling scheme.</li> <li>And so on...</li> </ul> Schemes used by other compilers are thanks to Agner Fog's PDF. <hr> <h3>Note:</h3> Examining the generated symbols, it becomes apparent that GCC's mangling scheme doesn't provide the same level of protection against Machiavelli as MSVC's. Consider the following: <pre class="prettyprint"><code>// foo.cpp #include <string> // Simple wrapper class, to avoid encoding `cxx11 ABI` into the GCC name. class MyString { std::string data; public: MyString(const char* const d) : data(d) {} operator std::string() { return data; } }; // Evil. MyString foo(int i) { return "hello"; } // ----- // main.cpp #include <iostream> // Evil. int foo(int); int main() { std::cout << foo(3) << '\n'; } </code></pre> If we compile each source file separately, then attempt to link the object files together... <ul> <li>GCC: <code>MyString</code>, due to not being part of the <code>cxx11</code> ABI, causes <code>MyString foo(int)</code> to be mangled as <code>_Z3fooi</code>, just like <code>int foo(int)</code>. This allows the object files to be linked, and an executable is produced. Attempting to run it causes a segfault.</li> <li>MSVC: The linker will look for <code>?foo@@YAHH@Z</code>; as we instead supplied <code>?foo@@YA?AVMyString@@H@Z</code>, linking will fail.</li> </ul> Considering this, a mangling scheme that includes the return type is safer, even though functions can't be overloaded solely on differences in return type.

Is the return type of a function part of the mangled name?

Tags:

c++

name-mangling

Suppose I have two functions with the same parameter types and name (not in the same program):

std::string foo(int x) {
  return "hello"; 
}

int foo(int x) {
  return x;
}

Will they have the same mangled name once compiled?

Is the the return type part of the mangled name in C++?

557

asked Nov 24 '16 16:11

sdgfsdh

2 Answers

As mangling schemes aren't standardised, there's no single answer to this question; the closest thing to an actual answer would be to look at mangled names generated by the most common mangling schemes. To my knowledge, those are the GCC and MSVC schemes, in alphabetical order, so...

GCC:

To test this, we can use a simple program.

#include <string>
#include <cstdlib>

std::string foo(int x) { return "hello"; }
//int         foo(int x) { return x; }

int main() {
    // Assuming executable file named "a.out".
    system("nm a.out");
}

Compile and run with GCC or Clang, and it'll list the symbols it contains. Depending on which of the functions is uncommented, the results will be:

// GCC:
// ----

std::string foo(int x) { return "hello"; } // _Z3fooB5cxx11i
                                             // foo[abi:cxx11](int)
int         foo(int x) { return x; }       // _Z3fooi
                                             // foo(int)

// Clang:
// ------

std::string foo(int x) { return "hello"; } // _Z3fooi
                                             // foo(int)
int         foo(int x) { return x; }       // _Z3fooi
                                             // foo(int)

The GCC scheme contains relatively little information, not including return types:

Symbol type: _Z for "function".
Name: 3foo for ::foo.
Parameters: i for int.

Despite this, however, they are different when compiled with GCC (but not with Clang), because GCC indicates that the std::string version uses the cxx11 ABI.

Note that it does still keep track of the return type, and make sure signatures match; it just doesn't use the function's mangled name to do so.

MSVC:

To test this, we can use a simple program, as above.

#include <string>
#include <cstdlib>
    
std::string foo(int x) { return "hello"; }
//int         foo(int x) { return x; }
    
int main() {
    // Assuming object file named "a.obj".
    // Pipe to file, because there are a lot of symbols when <string> is included.
    system("dumpbin/symbols a.obj > a.txt");
}

Compile and run with Visual Studio, and a.txt will list the symbols it contains. Depending on which of the functions is uncommented, the results will be:

std::string foo(int x) { return "hello"; }
  // ?foo@@YA?AV?$basic_string@DU?$char_traits@D@std@@V?$allocator@D@2@@std@@H@Z
  // class std::basic_string<char,struct std::char_traits<char>,class std::allocator<char> > __cdecl foo(int)
int         foo(int x) { return x; }
  // ?foo@@YAHH@Z
  // int __cdecl foo(int)

The MSVC scheme contains the entire declaration, including things that weren't explicitly specified:

Name: foo@ for ::foo, followed by @ to terminate.
Symbol type: Everything after the name-terminating @.
Type and member status: Y for "non-member function".
Calling convention: A for __cdecl.
Return type:
- H for int.
- ?AV?$basic_string@DU?$char_traits@D@std@@V?$allocator@D@2@@std@ (followed by @ to terminate) for std::basic_string<char, std::char_traits<char>, std::allocator<char>> (std::string for short).
Parameter list: H for int (followed by @ to terminate).
Exception specifier: Z for throw(...); this one is omitted from demangled names unless it's something else, probably because MSVC just ignores it anyway.

This allows it to whine at you if declarations aren't identical across every compilation unit.

Generally, most compilers will use one of those schemes (or sometimes a variation thereof) when targeting *nix or Windows, respectively, but this isn't guaranteed. For example...

Clang, to my knowledge, will use the GCC scheme for *nix, or the MSVC scheme for Windows.
Intel C++ uses the GCC scheme for Linux and Mac, and the MSVC scheme (with a few minor variations) for Windows.
The Borland and Watcom compilers have their own schemes.
The Symantec and Digital Mars compilers generally use the MSVC scheme, with a few small changes.
Older versions of GCC, and a lot of UNIX tools, use a modified version of cfront's mangling scheme.
And so on...

^{Schemes used by other compilers are thanks to Agner Fog's PDF.}

Note:

Examining the generated symbols, it becomes apparent that GCC's mangling scheme doesn't provide the same level of protection against Machiavelli as MSVC's. Consider the following:

// foo.cpp
#include <string>

// Simple wrapper class, to avoid encoding `cxx11 ABI` into the GCC name.
class MyString {
    std::string data;

  public:
    MyString(const char* const d) : data(d) {}

    operator std::string() { return data; }
};

// Evil.
MyString foo(int i) { return "hello"; }

// -----

// main.cpp
#include <iostream>

// Evil.
int foo(int);

int main() {
    std::cout << foo(3) << '\n';
}

If we compile each source file separately, then attempt to link the object files together...

GCC: MyString, due to not being part of the cxx11 ABI, causes MyString foo(int) to be mangled as _Z3fooi, just like int foo(int). This allows the object files to be linked, and an executable is produced. Attempting to run it causes a segfault.
MSVC: The linker will look for ?foo@@YAHH@Z; as we instead supplied ?foo@@YA?AVMyString@@H@Z, linking will fail.

Considering this, a mangling scheme that includes the return type is safer, even though functions can't be overloaded solely on differences in return type.

answered Oct 12 '22 11:10

Justin Time - Reinstate Monica

No, and I expect that their mangled name will be the same with all modern compilers. More importantly, using them in the same program results in undefined behavior. Functions in C++ cannot differ only in their return type.

answered Oct 12 '22 10:10

Sam Varshavchik

Related questions
                            
                                Obtaining the last character in a stringstream without copying its whole buffer
                            
                                using std::tuple to construct a vector-based dataset refer to variadic-templates
                            
                                Lambdas, local types, and global namespace
                            
                                std::string stream parse a number in binary format
                            
                                Can Qt arrange for QObject* to be set to nullptr when QObject is destroyed?
                            
                                assigning Rvalue reference to Lvalue reference
                            
                                How to avoid the need to specify deleter for std::shared_ptr every time it's constructed or reset?
                            
                                How to use gcov with Cmake
                            
                                Compilation of C++14 in qtcreator
                            
                                Unable to reach a break point in C++ using Android Studio and ndkBuild
                            
                                Extracting data from irregular form using openCV and OCR
                            
                                Stack allocation feature (performance)
                            
                                Can a file both be an executable (EXE) and a dynamic-link library (DLL) at the same time?
                            
                                qualified-id in declaration before '(' token
                            
                                Segment tree space requirement
                            
                                Variadic function without named argument
                            
                                C++ compilation error: ‘pair’ does not name a type
                            
                                Do I really need to bend over backwards for a friend operator<< for a class in a namespace?
                            
                                What would be a use case for dynamic_cast of siblings?
                            
                                Map two-dimensional array to Eigen::Matrix

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With