Why is size_t unsigned?

Tags:

Bjarne Stroustrup wrote in The C++ Programming Language:

The unsigned integer types are ideal for uses that treat storage as a bit array. Using an unsigned instead of an int to gain one more bit to represent positive integers is almost never a good idea. Attempts to ensure that some values are positive by declaring variables unsigned will typically be defeated by the implicit conversion rules.

size_t seems to be unsigned "to gain one more bit to represent positive integers". So was this a mistake (or trade-off), and if so, should we minimize use of it in our own code?

Another relevant article by Scott Meyers is here. To summarize, he recommends not using unsigned in interfaces, regardless of whether the value is always positive or not. In other words, even if negative values make no sense, you shouldn't necessarily use unsigned.

912

asked Apr 16 '12 02:04

Jon

1 Answers

size_t is unsigned for historical reasons.

On an architecture with 16 bit pointers, such as the "small" model DOS programming, it would be impractical to limit strings to 32 KB.

For this reason, the C standard requires (via required ranges) ptrdiff_t, the signed counterpart to size_t and the result type of pointer difference, to be effectively 17 bits.

Those reasons can still apply in parts of the embedded programming world.

However, they do not apply to modern 32-bit or 64-bit programming, where a much more important consideration is that the unfortunate implicit conversion rules of C and C++ make unsigned types into bug attractors, when they're used for numbers (and hence, arithmetical operations and magnitude comparisions). With 20-20 hindsight we can now see that the decision to adopt those particular conversion rules, where e.g. string( "Hi" ).length() < -3 is practically guaranteed, was rather silly and impractical. However, that decision means that in modern programming, adopting unsigned types for numbers has severe disadvantages and no advantages – except for satisfying the feelings of those who find unsigned to be a self-descriptive type name, and fail to think of typedef int MyType.

Summing up, it was not a mistake. It was a decision for then very rational, practical programming reasons. It had nothing to do with transferring expectations from bounds-checked languages like Pascal to C++ (which is a fallacy, but a very very common one, even if some of those who do it have never heard of Pascal).

answered Oct 19 '22 22:10

Cheers and hth. - Alf

Related questions
                            
                                Compile a DLL in C/C++, then call it from another program
                            
                                Using std::bind with member function, use object pointer or not for this argument?
                            
                                Is there a sorted_vector class, which supports insert() etc.?
                            
                                Exceptions with Unicode what()
                            
                                Can the compiler optimize from heap to stack allocation?
                            
                                Heap corruption under Win32; how to locate?
                            
                                C++ ifstream failbit and badbit
                            
                                Can we have a static virtual functions? If not, then WHY? [duplicate]
                            
                                C++ HTML template framework, templatizing library, HTML generator library [closed]
                            
                                std::string vs string in c++ [duplicate]
                            
                                know if .lib is static or import
                            
                                What happens when an exception goes unhandled in a multithreaded C++11 program?
                            
                                Catch Multiple Custom Exceptions? - C++
                            
                                Enum to String C++ [duplicate]
                            
                                Must I call atomic load/store explicitly?
                            
                                gmock setting default actions / ON_CALL vs. EXPECT_CALL
                            
                                Why use endl when I can use a newline character? [duplicate]
                            
                                Convert Eigen Matrix to C array
                            
                                Writing function definition in header files in C++
                            
                                When to use const char * and when to use const char []

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why is size_t unsigned?

Tags:

c++

size-t

unsigned-integer

Jon

People also ask

1 Answers

Cheers and hth. - Alf

Recent Activity

Donate For Us