What is the correct way to convert 2 bytes to a signed 16-bit integer?

Q: What if int16_t is 16 bit?

If int is 16-bit then your version relies on implementation-defined behaviour if the value of the expression in the return statement is out of range for int16_t.

Tags:

language-lawyer

In this answer, zwol made this claim:

The correct way to convert two bytes of data from an external source into a 16-bit signed integer is with helper functions like this:

#include <stdint.h>  int16_t be16_to_cpu_signed(const uint8_t data[static 2]) {     uint32_t val = (((uint32_t)data[0]) << 8) |                     (((uint32_t)data[1]) << 0);     return ((int32_t) val) - 0x10000u; }  int16_t le16_to_cpu_signed(const uint8_t data[static 2]) {     uint32_t val = (((uint32_t)data[0]) << 0) |                     (((uint32_t)data[1]) << 8);     return ((int32_t) val) - 0x10000u; }

Which of the above functions is appropriate depends on whether the array contains a little endian or a big endian representation. Endianness is not the issue at question here, I am wondering why zwol subtracts 0x10000u from the uint32_t value converted to int32_t.

Why is this the correct way?

How does it avoid the implementation defined behavior when converting to the return type?

Since you can assume 2's complement representation, how would this simpler cast fail: return (uint16_t)val;

What is wrong with this naive solution:

int16_t le16_to_cpu_signed(const uint8_t data[static 2]) {     return (uint16_t)data[0] | ((uint16_t)data[1] << 8); }

607

asked Mar 26 '20 09:03

chqrlie

2 Answers

If int is 16-bit then your version relies on implementation-defined behaviour if the value of the expression in the return statement is out of range for int16_t.

However the first version also has a similar problem; for example if int32_t is a typedef for int, and the input bytes are both 0xFF, then the result of the subtraction in the return statement is UINT_MAX which causes implementation-defined behaviour when converted to int16_t.

IMHO the answer you link to has several major issues .

109

answered Sep 21 '22 02:09

M.M

This should be pedantically correct and work also on platforms that use sign bit or 1's complement representations, instead of the usual 2's complement. The input bytes are assumed to be in 2's complement.

int le16_to_cpu_signed(const uint8_t data[static 2]) {     unsigned value = data[0] | ((unsigned)data[1] << 8);     if (value & 0x8000)         return -(int)(~value) - 1;     else         return value; }

Because of the branch, it will be more expensive than other options.

What this accomplishes is that it avoids any assumption on how int representation relates to unsigned representation on the platform. The cast to int is required to preserve arithmetic value for any number that will fit in target type. Because the inversion ensures top bit of 16-bit number will be zero, the value will fit. Then the unary - and subtraction of 1 apply the usual rule for 2's complement negation. Depending on platform, INT16_MIN could still overflow if it doesn't fit in the int type on the target, in which case long should be used.

The difference to the original version in the question comes at the return time. While the original just always subtracted 0x10000 and 2's complement let signed overflow wrap it to int16_t range, this version has the explicit if that avoids signed wrapover (which is undefined).

Now in practice, almost all platforms in use today use 2's complement representation. In fact, if the platform has standard-compliant stdint.h that defines int32_t, it must use 2's complement for it. Where this approach sometimes comes handy is with some scripting languages that don't have integer data types at all - you can modify the operations shown above for floats and it will give the correct result.

answered Sep 18 '22 02:09

jpa

Related questions
                            
                                What is the difference between struct addrinfo and struct sockaddr
                            
                                What is the difference between AF_INET and PF_INET constants?
                            
                                How do I pad a printf to take account of negative signs and variable length numbers?
                            
                                Create statically-linked binary that uses getaddrinfo?
                            
                                How to implement memmove in standard C without an intermediate copy?
                            
                                Can someone explain how to append an element to an array in C programming?
                            
                                How undefined is undefined behavior?
                            
                                GCC: vectorization difference between two similar loops
                            
                                Is there any accuracy gain when casting to double and back when doing float division?
                            
                                Why does gcc have a warning for long long?
                            
                                Equivalents to MSVC's _countof in other compilers?
                            
                                Understanding the different clocks of clock_gettime()
                            
                                Assign C array to C++'s std::array? (std::array<T,U> = T[U]) - no suitable constructor exists from "T [U]" to "std::array<T,U>"
                            
                                strcpy() return value
                            
                                Is there a Windows equivalent to fdopen for HANDLEs?
                            
                                Is there a way to diff files from C++?
                            
                                How does ptrace work in Linux?
                            
                                Initialize Static Array of Structs in C
                            
                                Tiny javascript implementation? [closed]
                            
                                Can I use shared library created in C++ in a C program?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What is the correct way to convert 2 bytes to a signed 16-bit integer?

Tags:

c

casting

language-lawyer

chqrlie

People also ask

2 Answers

M.M

jpa

Recent Activity

Donate For Us