Bytewise reading of memory: "signed char " vs "unsigned char "

Tags:

One often needs to read from memory one byte at a time, like in this naive memcpy() implementation:

void *memcpy(void *dest, const void *src, size_t n)
{
    char *from = (char *)src;
    char *to   = (char *)dest;

    while(n--) *to++ = *from++;

    return dest;
}

However, I sometimes see people explicitly use unsigned char * instead of just char *.

Of course, char and unsigned char may not be equal. But does it make a difference whether I use char *, signed char *, or unsigned char * when bytewise reading/writing memory?

UPDATE: Actually, I'm fully aware that c=200 may have different values depending on the type of c. What I am asking here is why people sometimes use unsigned char * instead of just char * when reading memory, e.g. in order to store an uint32_t in a char[4].

657

asked Dec 05 '11 13:12

Philip

2 Answers

You should use unsigned char. The C99 standard says that unsigned char is the only type guaranteed to be dense (no padding bits), and also defines that you may copy any object (except bitfields) exactly by copying it into an unsigned char array, which is the object representation in bytes.

The sensible interepretation of this is to me, that if you use a pointer to access an object as bytes, you should use unsigned char.

Reference: http://blackshell.com/~msmud/cstd.html#6.2.6.1 (From ~~C1x draft~~ C99)

156

answered Nov 13 '22 06:11

u0b34a0f6ae

This is one point where C++ differs from C. Generally speaking, C only guarantees that raw memory access works for unsigned char; char may be signed, and on a 1's complement or signed magnitude machine, a -0 might be converted to +0 automatically, changing the bit pattern. For some reason (unknown to me), the C++ committee extends the guarantees supporting transparent copy (no change in bit patterns) to char, as well as unsigned char; on a 1's complement or signed magnitude machine, the implementors have no choice but to make plain char unsigned, in order to avoid such side effects. (And of course, most programmers today aren't concerned by such machines anyway.)

Anyway, the end result is that older programmers, who come from a C background (and maybe have actually worked on a 1's complement or a signed magnitude machine) will automatically use unsigned char. It's also a frequent convention to reserve plain char for character data uniquely, with signed char for very small integral values, and unsigned char for raw memory, or when bit manipulation is intended. Such a rule allows the reader to distinguish between different uses (provided it is followed religiously).

answered Nov 13 '22 08:11

James Kanze

Related questions
                            
                                Avoid newline in qDebug()
                            
                                Why does chrono have its own namespace?
                            
                                How to run application with Admin privileges using Manifest file in Visual Studio 2005?
                            
                                Why will std::sort crash if the comparison function is not as operator <?
                            
                                How can you test how many instructions per second your computer can do?
                            
                                Emacs indenting of C++11 lambda functions (cc-mode)
                            
                                How to do ray plane intersection?
                            
                                Java's System.exit(0); vs C++ return 0;
                            
                                How do C++ functions return a big object or a structure?
                            
                                Check traits for all variadic template arguments
                            
                                Google test - before class
                            
                                Absence of compilation error when using parametrized constructor
                            
                                No use of base object allowed in private inheritance
                            
                                C++ Visual Studio character encoding issues
                            
                                Will the destructor of the base class called if an object throws an exception in the constructor?
                            
                                What's the equivalent of WORD in C#?
                            
                                Pong physics problem
                            
                                Set difference in C++
                            
                                C++ Any way to programmatically detect POD-struct?
                            
                                Can I store some user data in every item of a QListWidget?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Bytewise reading of memory: "signed char " vs "unsigned char "

Tags:

c++

c

char

coding-style

unsigned-char

Philip

People also ask

2 Answers

u0b34a0f6ae

James Kanze

Recent Activity

Donate For Us

Bytewise reading of memory: "signed char *" vs "unsigned char *"

Tags:

c++

c

char

coding-style

unsigned-char

Philip

People also ask

2 Answers

u0b34a0f6ae

James Kanze

Related questions

Recent Activity

Donate For Us

Bytewise reading of memory: "signed char " vs "unsigned char "