Endianness from what I understand, is when the bytes that compose a multibyte word differ in their order, at least in the most typical case. So that an 16-bit integer may be stored as either <code>0xHHLL</code> or <code>0xLLHH</code>. Assuming I don't have that wrong, what I would like to know is when does Endianness become a major factor when sending information between two computers where the Endian may or may not be different. <ul> <li>If I transmit a short integer of 1, in the form of a char array and with no correction, is it received and interpretted as 256?</li> <li> If I decompose and recompose the short integer using the following code, will endianness no longer be a factor? <pre class="prettyprint"><code>// Sender: for(n=0, n < sizeof(uint16)*8; ++n) { stl_bitset[n] = (value >> n) & 1; }; // Receiver: for(n=0, n < sizeof(uint16)*8; ++n) { value |= uint16(stl_bitset[n] & 1) << n; }; </code></pre> </li> <li>Is there a standard way of compensating for endianness?</li> </ul> Thanks in advance!

Very abstractly speaking, endianness is a property of the reinterpretation of a variable as a char-array. Practically, this matters precisely when you <code>read()</code> from and <code>write()</code> to an external byte stream (like a file or a socket). Or, speaking abstractly again, endianness matters when you serialize data (essentially because serialized data has no type system and just consists of dumb bytes); and endianness does not matter within your programming language, because the language only operates on values, not on representations. Going from one to the other is where you need to dig into the details. To wit - writing: <pre class="prettyprint"><code>uint32_t n = get_number(); unsigned char bytesLE[4] = { n, n >> 8, n >> 16, n >> 24 }; // little-endian order unsigned char bytesBE[4] = { n >> 24, n >> 16, n >> 8, n }; // big-endian order write(bytes..., 4); </code></pre> Here we could just have said, <code>reinterpret_cast<unsigned char *>(&n)</code>, and the result would have depended on the endianness of the system. And reading: <pre class="prettyprint"><code>unsigned char buf[4] = read_data(); uint32_t n_LE = buf[0] + buf[1] << 8 + buf[2] << 16 + buf[3] << 24; // little-endian uint32_t n_BE = buf[3] + buf[2] << 8 + buf[1] << 16 + buf[0] << 24; // big-endian </code></pre> Again, here we could have said, <code>uint32_t n = *reinterpret_cast<uint32_t*>(buf)</code>, and the result would have depended on the machine endianness. As you can see, with integral types you never have to know the endianness of your own system, only of the data stream, if you use algebraic input and output operations. With other data types such as <code>double</code>, the issue is more complicated.

For the record, if you're transferring data between devices you should pretty much always use network-byte-ordering with <code>ntohl</code>, <code>htonl</code>, <code>ntohs</code>, <code>htons</code>. It'll convert to the network byte order standard for Endianness regardless of what your system and the destination system use. Of course, both systems shoud be programmed like this - but they usually are in networking scenarios.

When does Endianness become a factor?

Tags:

c++

networking

stl

endianness

Endianness from what I understand, is when the bytes that compose a multibyte word differ in their order, at least in the most typical case. So that an 16-bit integer may be stored as either 0xHHLL or 0xLLHH.

Assuming I don't have that wrong, what I would like to know is when does Endianness become a major factor when sending information between two computers where the Endian may or may not be different.

If I transmit a short integer of 1, in the form of a char array and with no correction, is it received and interpretted as 256?

If I decompose and recompose the short integer using the following code, will endianness no longer be a factor?

// Sender:
for(n=0, n < sizeof(uint16)*8; ++n) {
    stl_bitset[n] = (value >> n) & 1;
};

// Receiver:
for(n=0, n < sizeof(uint16)*8; ++n) {
    value |= uint16(stl_bitset[n] & 1) << n;
};

Is there a standard way of compensating for endianness?

Thanks in advance!

595

asked Aug 24 '11 17:08

Anne Quinn

2 Answers

Very abstractly speaking, endianness is a property of the reinterpretation of a variable as a char-array.

Practically, this matters precisely when you read() from and write() to an external byte stream (like a file or a socket). Or, speaking abstractly again, endianness matters when you serialize data (essentially because serialized data has no type system and just consists of dumb bytes); and endianness does not matter within your programming language, because the language only operates on values, not on representations. Going from one to the other is where you need to dig into the details.

To wit - writing:

uint32_t n = get_number();

unsigned char bytesLE[4] = { n, n >> 8, n >> 16, n >> 24 };  // little-endian order
unsigned char bytesBE[4] = { n >> 24, n >> 16, n >> 8, n };  // big-endian order

write(bytes..., 4);

Here we could just have said, reinterpret_cast<unsigned char *>(&n), and the result would have depended on the endianness of the system.

And reading:

unsigned char buf[4] = read_data();

uint32_t n_LE = buf[0] + buf[1] << 8 + buf[2] << 16 + buf[3] << 24; // little-endian
uint32_t n_BE = buf[3] + buf[2] << 8 + buf[1] << 16 + buf[0] << 24; // big-endian

Again, here we could have said, uint32_t n = *reinterpret_cast<uint32_t*>(buf), and the result would have depended on the machine endianness.

As you can see, with integral types you never have to know the endianness of your own system, only of the data stream, if you use algebraic input and output operations. With other data types such as double, the issue is more complicated.

146

answered Oct 17 '22 04:10

Kerrek SB

For the record, if you're transferring data between devices you should pretty much always use network-byte-ordering with ntohl, htonl, ntohs, htons. It'll convert to the network byte order standard for Endianness regardless of what your system and the destination system use. Of course, both systems shoud be programmed like this - but they usually are in networking scenarios.

answered Oct 17 '22 04:10

John Humphreys

Related questions
                            
                                How to implement multithread safe singleton in C++11 without using <mutex>
                            
                                What is the size of a pointer?
                            
                                Why would you use the keyword const if you already know variable should be constant?
                            
                                Vector of Vectors to create matrix
                            
                                GCC C++ Linker errors: Undefined reference to 'vtable for XXX', Undefined reference to 'ClassName::ClassName()'
                            
                                Is it legal for a C++ optimizer to reorder calls to clock()?
                            
                                std::fstream buffering vs manual buffering (why 10x gain with manual buffering)?
                            
                                Convert a vector<T> to initializer_list<T>
                            
                                virtual inheritance [duplicate]
                            
                                Where to declare/define class scope constants in C++?
                            
                                Why is std::fill(0) slower than std::fill(1)?
                            
                                std::tuple sizeof, is it a missed optimization?
                            
                                Common reasons for bugs in release version not present in debug mode
                            
                                How does the standard library implement std::swap?
                            
                                Comparing floating point number to zero
                            
                                Should mutexes be mutable?
                            
                                What is the difference between the /Ox and /O2 compiler options?
                            
                                Is the return type part of the function signature?
                            
                                In CLion, header only library: file "does not belong to any project target, code insight features might not work properly"
                            
                                Is string::c_str() no longer null terminated in C++11?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With