In C/C++, what an <code>unsigned char</code> is used for? How is it different from a regular <code>char</code>?

In C++, there are three distinct character types: <ul> <li><code>char</code></li> <li><code>signed char</code></li> <li><code>unsigned char</code></li> </ul> If you are using character types for text, use the unqualified <code>char</code>: <ul> <li>it is the type of character literals like <code>'a'</code> or <code>'0'</code> (in C++ only, in C their type is <code>int</code>)</li> <li>it is the type that makes up C strings like <code>"abcde"</code> </li> </ul> It also works out as a number value, but it is unspecified whether that value is treated as signed or unsigned. Beware character comparisons through inequalities - although if you limit yourself to ASCII (0-127) you're just about safe. If you are using character types as numbers, use: <ul> <li> <code>signed char</code>, which gives you at least the -127 to 127 range. (-128 to 127 is common)</li> <li> <code>unsigned char</code>, which gives you at least the 0 to 255 range.</li> </ul> "At least", because the C++ standard only gives the minimum range of values that each numeric type is required to cover. <code>sizeof (char)</code> is required to be 1 (i.e. one byte), but a byte could in theory be for example 32 bits. <code>sizeof</code> would still be report its size as <code>1</code> - meaning that you could have <code>sizeof (char) == sizeof (long) == 1</code>.

This is implementation dependent, as the C standard does NOT define the signed-ness of <code>char</code>. Depending on the platform, char may be <code>signed</code> or <code>unsigned</code>, so you need to explicitly ask for <code>signed char</code> or <code>unsigned char</code> if your implementation depends on it. Just use <code>char</code> if you intend to represent characters from strings, as this will match what your platform puts in the string. The difference between <code>signed char</code> and <code>unsigned char</code> is as you'd expect. On most platforms, <code>signed char</code> will be an 8-bit two's complement number ranging from <code>-128</code> to <code>127</code>, and <code>unsigned char</code> will be an 8-bit unsigned integer (<code>0</code> to <code>255</code>). Note the standard does NOT require that <code>char</code> types have 8 bits, only that <code>sizeof(char)</code> return <code>1</code>. You can get at the number of bits in a char with <code>CHAR_BIT</code> in <code>limits.h</code>. There are few if any platforms today where this will be something other than <code>8</code>, though. There is a nice summary of this issue here. As others have mentioned since I posted this, you're better off using <code>int8_t</code> and <code>uint8_t</code> if you really want to represent small integers.

What is an unsigned char?

2 Answers

In C++, there are three distinct character types:

char
signed char
unsigned char

If you are using character types for text, use the unqualified char:

it is the type of character literals like 'a' or '0' (in C++ only, in C their type is int)
it is the type that makes up C strings like "abcde"

It also works out as a number value, but it is unspecified whether that value is treated as signed or unsigned. Beware character comparisons through inequalities - although if you limit yourself to ASCII (0-127) you're just about safe.

If you are using character types as numbers, use:

signed char, which gives you at least the -127 to 127 range. (-128 to 127 is common)
unsigned char, which gives you at least the 0 to 255 range.

"At least", because the C++ standard only gives the minimum range of values that each numeric type is required to cover. sizeof (char) is required to be 1 (i.e. one byte), but a byte could in theory be for example 32 bits. sizeof would still be report its size as 1 - meaning that you could have sizeof (char) == sizeof (long) == 1.

answered Oct 04 '22 17:10

Fruny

This is implementation dependent, as the C standard does NOT define the signed-ness of char. Depending on the platform, char may be signed or unsigned, so you need to explicitly ask for signed char or unsigned char if your implementation depends on it. Just use char if you intend to represent characters from strings, as this will match what your platform puts in the string.

The difference between signed char and unsigned char is as you'd expect. On most platforms, signed char will be an 8-bit two's complement number ranging from -128 to 127, and unsigned char will be an 8-bit unsigned integer (0 to 255). Note the standard does NOT require that char types have 8 bits, only that sizeof(char) return 1. You can get at the number of bits in a char with CHAR_BIT in limits.h. There are few if any platforms today where this will be something other than 8, though.

There is a nice summary of this issue here.

As others have mentioned since I posted this, you're better off using int8_t and uint8_t if you really want to represent small integers.

answered Oct 04 '22 17:10

Todd Gamblin

Related questions
                            
                                Why is this program erroneously rejected by three C++ compilers?
                            
                                What is the meaning of prepended double colon "::"?
                            
                                Use of 'const' for function parameters
                            
                                What uses are there for "placement new"?
                            
                                What are the differences between struct and class in C++?
                            
                                What is Linux’s native GUI API?
                            
                                How does the compilation/linking process work?
                            
                                What's the difference between "STL" and "C++ Standard Library"?
                            
                                What is the difference between float and double?
                            
                                C++11 rvalues and move semantics confusion (return statement)
                            
                                Why does GCC generate 15-20% faster code if I optimize for size instead of speed?
                            
                                C++ multiline string literal
                            
                                When to use extern in C++
                            
                                Iteration over std::vector: unsigned vs signed index variable
                            
                                Static constant string (class member)
                            
                                Initializing a static std::map<int, int> in C++
                            
                                error: passing xxx as 'this' argument of xxx discards qualifiers
                            
                                error: request for member '..' in '..' which is of non-class type
                            
                                How do I use arrays in C++?
                            
                                What is the most effective way to get the index of an iterator of an std::vector?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What is an unsigned char?

Tags:

c++

c

char

Landon Kuhn

People also ask

2 Answers

Fruny

Todd Gamblin

Recent Activity

Donate For Us