In C++, <code>sizeof('a') == sizeof(char) == 1</code>. This makes intuitive sense, since <code>'a'</code> is a character literal, and <code>sizeof(char) == 1</code> as defined by the standard. In C however, <code>sizeof('a') == sizeof(int)</code>. That is, it appears that C character literals are actually integers. Does anyone know why? I can find plenty of mentions of this C quirk but no explanation for why it exists.

The original question is "why?" The reason is that the definition of a literal character has evolved and changed, while trying to remain backwards compatible with existing code. In the dark days of early C there were no types at all. By the time I first learnt to program in C, types had been introduced, but functions didn't have prototypes to tell the caller what the argument types were. Instead it was standardised that everything passed as a parameter would either be the size of an int (this included all pointers) or it would be a double. This meant that when you were writing the function, all the parameters that weren't double were stored on the stack as ints, no matter how you declared them, and the compiler put code in the function to handle this for you. This made things somewhat inconsistent, so when K&R wrote their famous book, they put in the rule that a character literal would always be promoted to an int in any expression, not just a function parameter. When the ANSI committee first standardised C, they changed this rule so that a character literal would simply be an int, since this seemed a simpler way of achieving the same thing. When C++ was being designed, all functions were required to have full prototypes (this is still not required in C, although it is universally accepted as good practice). Because of this, it was decided that a character literal could be stored in a char. The advantage of this in C++ is that a function with a char parameter and a function with an int parameter have different signatures. This advantage is not the case in C. This is why they are different. Evolution...

Why are C character literals ints instead of chars?

Tags:

c++

c

char

sizeof

In C++, sizeof('a') == sizeof(char) == 1. This makes intuitive sense, since 'a' is a character literal, and sizeof(char) == 1 as defined by the standard.

In C however, sizeof('a') == sizeof(int). That is, it appears that C character literals are actually integers. Does anyone know why? I can find plenty of mentions of this C quirk but no explanation for why it exists.

317

asked Jan 11 '09 22:01

Joseph Garvin

3 Answers

discussion on same subject

"More specifically the integral promotions. In K&R C it was virtually (?) impossible to use a character value without it being promoted to int first, so making character constant int in the first place eliminated that step. There were and still are multi character constants such as 'abcd' or however many will fit in an int."

124

answered Nov 15 '22 22:11

Malx

The original question is "why?"

The reason is that the definition of a literal character has evolved and changed, while trying to remain backwards compatible with existing code.

In the dark days of early C there were no types at all. By the time I first learnt to program in C, types had been introduced, but functions didn't have prototypes to tell the caller what the argument types were. Instead it was standardised that everything passed as a parameter would either be the size of an int (this included all pointers) or it would be a double.

This meant that when you were writing the function, all the parameters that weren't double were stored on the stack as ints, no matter how you declared them, and the compiler put code in the function to handle this for you.

This made things somewhat inconsistent, so when K&R wrote their famous book, they put in the rule that a character literal would always be promoted to an int in any expression, not just a function parameter.

When the ANSI committee first standardised C, they changed this rule so that a character literal would simply be an int, since this seemed a simpler way of achieving the same thing.

When C++ was being designed, all functions were required to have full prototypes (this is still not required in C, although it is universally accepted as good practice). Because of this, it was decided that a character literal could be stored in a char. The advantage of this in C++ is that a function with a char parameter and a function with an int parameter have different signatures. This advantage is not the case in C.

This is why they are different. Evolution...

answered Nov 15 '22 23:11

John Vincent

I don't know the specific reasons why a character literal in C is of type int. But in C++, there is a good reason not to go that way. Consider this:

Click to copy

void print(int);
void print(char);

print('a');

You would expect that the call to print selects the second version taking a char. Having a character literal being an int would make that impossible. Note that in C++ literals having more than one character still have type int, although their value is implementation defined. So, 'ab' has type int, while 'a' has type char.

answered Nov 16 '22 00:11

Johannes Schaub - litb

Related questions
                            
                                Compiler error: memset was not declared in this scope
                            
                                CMake unable to determine linker language with C++
                            
                                Swapping two variable value without using third variable
                            
                                Faster code-completion with clang
                            
                                What is __gxx_personality_v0 for?
                            
                                (A + B + C) ≠ (A + C + B​) and compiler reordering
                            
                                Can I list-initialize a vector of move-only type?
                            
                                In C++, is it still bad practice to return a vector from a function?
                            
                                C++0x lambda capture by value always const?
                            
                                C++: what regex library should I use? [closed]
                            
                                Where is shared_ptr?
                            
                                How to clear ostringstream [duplicate]
                            
                                What makes this usage of pointers unpredictable?
                            
                                How do I check for C++11 support?
                            
                                Splitting templated C++ classes into .hpp/.cpp files--is it possible?
                            
                                How to disallow temporaries
                            
                                How to calculate a time difference in C++
                            
                                Placement of the asterisk in pointer declarations
                            
                                Why does code mutating a shared variable across threads apparently NOT suffer from a race condition?
                            
                                What's the best way to do a backwards loop in C/C#/C++?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With