Why are hexadecimal numbers prefixed as <code>0x</code>? I understand the usage of the prefix but I don't understand the significance of why <code>0x</code> was chosen.

Short story: The <code>0</code> tells the parser it's dealing with a constant (and not an identifier/reserved word). Something is still needed to specify the number base: the <code>x</code> is an arbitrary choice. Long story: In the 60's, the prevalent programming number systems were decimal and octal — mainframes had 12, 24 or 36 bits per byte, which is nicely divisible by 3 = log2(8). The BCPL language used the syntax <code>8 1234</code> for octal numbers. When Ken Thompson created B from BCPL, he used the <code>0</code> prefix instead. This is great because <ol> <li>an integer constant now always consists of a single token,</li> <li>the parser can still tell right away it's got a constant,</li> <li>the parser can immediately tell the base (<code>0</code> is the same in both bases),</li> <li>it's mathematically sane (<code>00005 == 05</code>), and</li> <li>no precious special characters are needed (as in <code>#123</code>).</li> </ol> When C was created from B, the need for hexadecimal numbers arose (the PDP-11 had 16-bit words) and all of the points above were still valid. Since octals were still needed for other machines, <code>0x</code> was arbitrarily chosen (<code>00</code> was probably ruled out as awkward). C# is a descendant of C, so it inherits the syntax.

Note: I don't know the correct answer, but the below is just my personal speculation! As has been mentioned a 0 before a number means it's octal: <pre class="prettyprint"><code>04524 // octal, leading 0 </code></pre> Imagine needing to come up with a system to denote hexadecimal numbers, and note we're working in a C style environment. How about ending with h like assembly? Unfortunately you can't - it would allow you to make tokens which are valid identifiers (eg. you could name a variable the same thing) which would make for some nasty ambiguities. <pre class="prettyprint"><code>8000h // hex FF00h // oops - valid identifier! Hex or a variable or type named FF00h? </code></pre> You can't lead with a character for the same reason: <pre class="prettyprint"><code>xFF00 // also valid identifier </code></pre> Using a hash was probably thrown out because it conflicts with the preprocessor: <pre class="prettyprint"><code>#define ... #FF00 // invalid preprocessor token? </code></pre> In the end, for whatever reason, they decided to put an x after a leading 0 to denote hexadecimal. It is unambiguous since it still starts with a number character so can't be a valid identifier, and is probably based off the octal convention of a leading 0. <pre class="prettyprint"><code>0xFF00 // definitely not an identifier! </code></pre>

Why are hexadecimal numbers prefixed with 0x?

2 Answers

Short story: The 0 tells the parser it's dealing with a constant (and not an identifier/reserved word). Something is still needed to specify the number base: the x is an arbitrary choice.

Long story: In the 60's, the prevalent programming number systems were decimal and octal — mainframes had 12, 24 or 36 bits per byte, which is nicely divisible by 3 = log2(8).

The BCPL language used the syntax 8 1234 for octal numbers. When Ken Thompson created B from BCPL, he used the 0 prefix instead. This is great because

an integer constant now always consists of a single token,
the parser can still tell right away it's got a constant,
the parser can immediately tell the base (0 is the same in both bases),
it's mathematically sane (00005 == 05), and
no precious special characters are needed (as in #123).

When C was created from B, the need for hexadecimal numbers arose (the PDP-11 had 16-bit words) and all of the points above were still valid. Since octals were still needed for other machines, 0x was arbitrarily chosen (00 was probably ruled out as awkward).

C# is a descendant of C, so it inherits the syntax.

198

answered Nov 03 '22 00:11

Řrřola

Note: I don't know the correct answer, but the below is just my personal speculation!

As has been mentioned a 0 before a number means it's octal:

04524 // octal, leading 0

Imagine needing to come up with a system to denote hexadecimal numbers, and note we're working in a C style environment. How about ending with h like assembly? Unfortunately you can't - it would allow you to make tokens which are valid identifiers (eg. you could name a variable the same thing) which would make for some nasty ambiguities.

8000h // hex FF00h // oops - valid identifier!  Hex or a variable or type named FF00h?

You can't lead with a character for the same reason:

xFF00 // also valid identifier

Using a hash was probably thrown out because it conflicts with the preprocessor:

#define ... #FF00 // invalid preprocessor token?

In the end, for whatever reason, they decided to put an x after a leading 0 to denote hexadecimal. It is unambiguous since it still starts with a number character so can't be a valid identifier, and is probably based off the octal convention of a leading 0.

0xFF00 // definitely not an identifier!

answered Nov 03 '22 00:11

AshleysBrain

Related questions
                            
                                Is It Possible to NSLog C Structs (Like CGRect or CGPoint)?
                            
                                How do I print the full value of a long string in gdb?
                            
                                Why use pointers? [closed]
                            
                                How can I get a file's size in C? [duplicate]
                            
                                Difference between using Makefile and CMake to compile the code
                            
                                Is Fortran easier to optimize than C for heavy calculations?
                            
                                Why do all the C files written by my lecturer start with a single # on the first line?
                            
                                Difference between a Structure and a Union
                            
                                What is array to pointer decay?
                            
                                How to allocate aligned memory only using the standard library?
                            
                                How does free know how much to free?
                            
                                Static linking vs dynamic linking
                            
                                How do you get assembler output from C/C++ source in gcc?
                            
                                Why is the use of alloca() not considered good practice?
                            
                                What are the barriers to understanding pointers and what can be done to overcome them? [closed]
                            
                                Is there a standard sign function (signum, sgn) in C/C++?
                            
                                How many levels of pointers can we have?
                            
                                How can one print a size_t variable portably using the printf family?
                            
                                What is Linux’s native GUI API?
                            
                                Undefined reference to pthread_create in Linux

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why are hexadecimal numbers prefixed with 0x?

Tags:

c

syntax

hex

unj2

People also ask

2 Answers

Řrřola

AshleysBrain

Recent Activity

Donate For Us