This is most probably the dumbest question anyone would ask, but regardless I hope I will find a clear answer for this.
My question is - How is an integer stored in computer memory?
In c# an integer is of size 32 bit. MSDN says we can store numbers from -2,147,483,648 to 2,147,483,647 inside an integer variable.
As per my understanding a bit can store only 2 values i.e 0 & 1. If I can store only 0 or 1 in a bit, how will I be able to store numbers 2 to 9 inside a bit?
More precisely, say I have this code int x = 5
; How will this be represented in memory or in other words how is 5 converted into 0's and 1's, and what is the convention behind it?
Numbers are stored on the computer in binary form. In other words, information is encoded as a sequence of 1's and 0's. On most computers, the memory is organized into 8-bit bytes. This means each 8-bit byte stored in memory will have a separate address.
Specifically, Java stores it using 32 bits of memory. In other words, it can represent values from -2,147,483,648 (-231) to 2,147,483,647 (231-1). In Java 8, it's possible to store an unsigned integer value up to 4,294,967,295 (232-1) by using new special helper functions.
Memory Integers are 16-bit integer operands that may be signed or unsigned. The range of an MI is -32768 to +32767. There are 2048 MIs (Address MI 0 - MI 2047).
In general, computers store real numbers in Scientific Notation, or Floating Point Format. That means that instead of storing the binary number 1010.1101 as it is written on the screen right now, the computer may represent it internally as 0.10101101*24.
It's represented in binary (base 2). Read more about number bases. In base 2 you only need 2 different symbols to represent a number. We usually use the symbols 0
and 1
. In our usual base we use 10
different symbols to represent all the numbers, 0
, 1
, 2
, ... 8
, and 9
.
For comparison, think about a number that doesn't fit in our usual system. Like 14. We don't have a symbol for 14, so how to we represent it? Easy, we just combine two of our symbols 1
and 4
. 14
in base 10 means 1*10^1 + 4*10^0
.
1110
in base 2 (binary) means 1*2^3 + 1*2^2 + 1*2^1 + 0*2^0 = 8 + 4 + 2 + 0 = 14
. So despite not having enough symbols in either base to represent 14
with a single symbol, we can still represent it in both bases.
In another commonly used base, base 16, which is also known as hexadecimal, we have enough symbols to represent 14
using only one of them. You'll usually see 14
written using the symbol e
in hexadecimal.
For negative integers we use a convenient representation called twos-complement which is the complement (all 1
s flipped to 0
and all 0
s flipped to 1
s) with one added to it.
There are two main reasons this is so convenient:
We know immediately if a number is positive of negative by looking at a single bit, the most significant bit out of the 32
we use.
It's mathematically correct in that x - y = x + -y
using regular addition the same way you learnt in grade school. This means that processors don't need to do anything special to implement subtraction if they already have addition. They can simply find the twos-complement of y
(recall, flip the bits and add one) and then add x
and y
using the addition circuit they already have, rather than having a special circuit for subtraction.
This is not a dumb question at all.
Let's start with uint because it's slightly easier. The convention is:
So five would be 00000000000000000000000000000101, because 5 = 1 x 20 + 0 x 21 + 1 x 22 + ... the rest are all zero.
That's a uint
. The convention for ints is:
uint
. This might seem like an odd convention. We use it because it turns out that it is easy to build chips that perform arithmetic in this format extremely quickly.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With