The video "Gangnam Style" (I'm sure you've heard it) just exceeded 2 billion views on youtube. In fact, Google says that they never expected a video to be greater than a 32-bit integer... which alludes to the fact that Google used <code> int </code> instead of <code>unsigned</code> for their view counter. I think they had to re-write their code a bit to accommodate larger views. Checking their style guide: https://google-styleguide.googlecode.com/svn/trunk/cppguide.html#Integer_Types ...they advise "don't use an unsigned integer type," and give one good reason why: <code>unsigned</code> could be buggy. It's a good reason, but could be guarded against. My question is: is it bad coding practice in general to use <code>unsigned int</code>?

The Google rule is widely accepted in professional circles. The problem is that the unsigned integral types are sort of broken, and have unexpected and unnatural behavior when used for numeric values; they don't work well as a cardinal type. For example, an index into an array may never be negative, but it makes perfect sense to write <code>abs(i1 - i2)</code> to find the distance between two indices. Which won't work if <code>i1</code> and <code>i2</code> have unsigned types. As a general rule, this particular rule in the Google style guidelines corresponds more or less to what the designers of the language intended. Any time you see something other than <code>int</code>, you can assume a special reason for it. If it is because of the range, it will be <code>long</code> or <code>long long</code>, or even <code>int_least64_t</code>. Using unsigned types is generally a signal that you're dealing with bits, rather than the numeric value of the variable, or (at least in the case of <code>unsigned char</code>) that you're dealing with raw memory. With regards to the "self-documentation" of using an <code>unsigned</code>: this doesn't hold up, since there are almost always a lot of values that the variable cannot (or should not) take, including many positive ones. C++ doesn't have sub-range types, and the way <code>unsigned</code> is defined means that it cannot really be used as one either.

This guideline is extremely misleading. Blindly using <code>int</code> instead of <code>unsigned int</code> won't solve anything. That simply shifts the problems somewhere else. You absolutely must be aware of integer overflow when doing arithmetic on fixed precision integers. If your code is written in a way that it does not handle integer overflow gracefully for some given inputs, then your code is broken regardless of whether you use <code>signed</code> or <code>unsigned int</code>s. With <code>unsigned int</code>s you must be aware of integer underflow as well, and with <code>double</code>s and <code>float</code>s you must be aware of many additional issues with floating point arithmetic. Just take this article about a bug in the standard Java binary search algorithm published by none other than Google for why you must be aware of integer overflow. In fact, that very article shows C++ code casting to <code>unsigned int</code> in order to guarantee correct behavior. The article also starts out by presenting a bug in Java where guess what, they don't have <code>unsigned int</code>. However, they still ran into a bug with integer overflow.

Advice on unsigned int (Gangnam Style edition)

Tags:

The video "Gangnam Style" (I'm sure you've heard it) just exceeded 2 billion views on youtube. In fact, Google says that they never expected a video to be greater than a 32-bit integer... which alludes to the fact that Google used int instead of unsigned for their view counter. I think they had to re-write their code a bit to accommodate larger views.

Checking their style guide: https://google-styleguide.googlecode.com/svn/trunk/cppguide.html#Integer_Types

...they advise "don't use an unsigned integer type," and give one good reason why: unsigned could be buggy.

It's a good reason, but could be guarded against. My question is: is it bad coding practice in general to use unsigned int?

384

asked Dec 03 '14 15:12

Chewco

2 Answers

The Google rule is widely accepted in professional circles. The problem is that the unsigned integral types are sort of broken, and have unexpected and unnatural behavior when used for numeric values; they don't work well as a cardinal type. For example, an index into an array may never be negative, but it makes perfect sense to write abs(i1 - i2) to find the distance between two indices. Which won't work if i1 and i2 have unsigned types.

As a general rule, this particular rule in the Google style guidelines corresponds more or less to what the designers of the language intended. Any time you see something other than int, you can assume a special reason for it. If it is because of the range, it will be long or long long, or even int_least64_t. Using unsigned types is generally a signal that you're dealing with bits, rather than the numeric value of the variable, or (at least in the case of unsigned char) that you're dealing with raw memory.

With regards to the "self-documentation" of using an unsigned: this doesn't hold up, since there are almost always a lot of values that the variable cannot (or should not) take, including many positive ones. C++ doesn't have sub-range types, and the way unsigned is defined means that it cannot really be used as one either.

answered Oct 02 '22 01:10

James Kanze

This guideline is extremely misleading. Blindly using int instead of unsigned int won't solve anything. That simply shifts the problems somewhere else. You absolutely must be aware of integer overflow when doing arithmetic on fixed precision integers. If your code is written in a way that it does not handle integer overflow gracefully for some given inputs, then your code is broken regardless of whether you use signed or unsigned ints. With unsigned ints you must be aware of integer underflow as well, and with doubles and floats you must be aware of many additional issues with floating point arithmetic.

Just take this article about a bug in the standard Java binary search algorithm published by none other than Google for why you must be aware of integer overflow. In fact, that very article shows C++ code casting to unsigned int in order to guarantee correct behavior. The article also starts out by presenting a bug in Java where guess what, they don't have unsigned int. However, they still ran into a bug with integer overflow.

answered Oct 02 '22 01:10

b4hand

Related questions
                            
                                CSS - Darken on hover with unknown color [duplicate]
                            
                                height: auto on SVG not working
                            
                                HttpPostedfileBase is null using jQuery Ajax
                            
                                How to override Spark's log4j.properties per driver?
                            
                                IntelliJ 14.1 logging output in xml
                            
                                No bean resolver registered in the context to resolve access to bean
                            
                                Linkage against libQt5Core
                            
                                How to create ZipArchive from files in memory in C#?
                            
                                How to speed up pandas with cython (or numpy)
                            
                                String literals vs array of char when initializing a pointer
                            
                                Why ::before pseudo-element not working with :visited pseudo-class?
                            
                                How to benchmark memory usage of a function?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With