Why isn't there an endianness modifier in C++ like there is for signedness?

Tags:

(I guess this question could apply to many typed languages, but I chose to use C++ as an example.)

Why is there no way to just write:

struct foo {     little int x;   // little-endian     big long int y; // big-endian     short z;        // native endianness };

to specify the endianness for specific members, variables and parameters?

Comparison to signedness

I understand that the type of a variable not only determines how many bytes are used to store a value but also how those bytes are interpreted when performing computations.

For example, these two declarations each allocate one byte, and for both bytes, every possible 8-bit sequence is a valid value:

signed char s; unsigned char u;

but the same binary sequence might be interpreted differently, e.g. 11111111 would mean -1 when assigned to s but 255 when assigned to u. When signed and unsigned variables are involved in the same computation, the compiler (mostly) takes care of proper conversions.

In my understanding, endianness is just a variation of the same principle: a different interpretation of a binary pattern based on compile-time information about the memory in which it will be stored.

It seems obvious to have that feature in a typed language that allows low-level programming. However, this is not a part of C, C++ or any other language I know, and I did not find any discussion about this online.

Update

I'll try to summarize some takeaways from the many comments that I got in the first hour after asking:

signedness is strictly binary (either signed or unsigned) and will always be, in contrast to endianness, which also has two well-known variants (big and little), but also lesser-known variants such as mixed/middle endian. New variants might be invented in the future.
endianness matters when accessing multiple-byte values byte-wise. There are many aspects beyond just endianness that affect the memory layout of multi-byte structures, so this kind of access is mostly discouraged.
C++ aims to target an abstract machine and minimize the number of assumptions about the implementation. This abstract machine does not have any endianness.

Also, now I realize that signedness and endianness are not a perfect analogy, because:

endianness only defines how something is represented as a binary sequence, but now what can be represented. Both big int and little int would have the exact same value range.
signedness defines how bits and actual values map to each other, but also affects what can be represented, e.g. -3 can't be represented by an unsigned char and (assuming that char has 8 bits) 130 can't be represented by a signed char.

So that changing the endianness of some variables would never change the behavior of the program (except for byte-wise access), whereas a change of signedness usually would.

441

asked Nov 28 '17 12:11

Lena Schimmel

1 Answers

What the standard says

[intro.abstract]/1:

The semantic descriptions in this document define a parameterized nondeterministic abstract machine. This document places no requirement on the structure of conforming implementations. In particular, they need not copy or emulate the structure of the abstract machine. Rather, conforming implementations are required to emulate (only) the observable behavior of the abstract machine as explained below.

C++ could not define an endianness qualifier since it has no concept of endianness.

Discussion

About the difference between signness and endianness, OP wrote

In my understanding, endianness is just a variation of the same principle [(signness)]: a different interpretation of a binary pattern based on compile-time information about the memory in which it will be stored.

I'd argue signness both have a semantic and a representative aspect¹. What [intro.abstract]/1 implies is that C++ only care about semantic, and never addresses the way a signed number should be represented in memory². Actually, "sign bit" only appears once in the C++ specs and refer to an implementation-defined value.
On the other hand, endianness only have a representative aspect: endianness conveys no meaning.

With C++20, std::endian appears. It is still implementation-defined, but let us test the endian of the host without depending on old tricks based on undefined behaviour.

¹⁾ Semantic aspect: an signed integer can represent values below zero; representative aspect: one need to, for example, reserve a bit to convey the positive/negative sign.
²⁾ In the same vein, C++ never describe how a floating point number should be represented, IEEE-754 is often used, but this is a choice made by the implementation, in any case enforced by the standard: [basic.fundamental]/8 "The value representation of floating-point types is implementation-defined".

129

answered Oct 07 '22 11:10

YSC

Related questions
                            
                                How can I decode the boost library naming?
                            
                                Correct way to initialize vector member variable
                            
                                Numerically stable way to compute sqrt((b²*c²) / (1-c²)) for c in [-1, 1]
                            
                                GNU compiler warning "class has virtual functions but non-virtual destructor"
                            
                                What is the meaning of "generic programming" in c++?
                            
                                Vectors in Arduino
                            
                                What does "WINAPI" in main function mean?
                            
                                C++11 does not deduce type when std::function or lambda functions are involved
                            
                                C++ STL map::erase a non-existing key
                            
                                how does malloc understand alignment?
                            
                                Reading a password from std::cin
                            
                                GCC 7, -Wimplicit-fallthrough warnings, and portable way to clear them?
                            
                                Overriding public virtual functions with private functions in C++
                            
                                How to parse date/time from string?
                            
                                Deprecated conversion from string literal to 'char*'
                            
                                Using a C++ class member function as a C callback function
                            
                                How to check whether operator== exists?
                            
                                Do the &= and |= operators for bool short-circuit?
                            
                                c++ boost split string
                            
                                Do C++ enums Start at 0?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why isn't there an endianness modifier in C++ like there is for signedness?

Tags:

c++

language-features

static-typing

endianness

Comparison to signedness

Update

Lena Schimmel

People also ask

1 Answers

What the standard says

`[intro.abstract]/1`:

Discussion

YSC

Recent Activity

Donate For Us

Why isn't there an endianness modifier in C++ like there is for signedness?

Tags:

c++

language-features

static-typing

endianness

Comparison to signedness

Update

Lena Schimmel

People also ask

1 Answers

What the standard says

[intro.abstract]/1:

Discussion

YSC

Related questions

Recent Activity

Donate For Us

`[intro.abstract]/1`: