I didn't study IT, and only very recently came across bit shifts and an application for two's complement. So, can you please use simple English in your explanations and assume I know hardly anything about IP addresses, bit operations, and Java datatypes? Today, I found the following piece of code (abbreviated): <pre class="prettyprint"><code>long m = (-1) << (byte) 16; </code></pre> Now, this is for IP subnet masking. I know I need to start out with 4 blocks of 8 bits (i.e. 4 bytes), and all bits have to be "switched on": <code>11111111 11111111 1111111 1111111</code> Next, zeros are shifted in from the right, in this case 16 bits' worth; so we get <code>11111111 11111111 00000000 0000000</code>, the mask. But I do have a few questions: <ol> <li>Does the <code>16</code> have to be of type <code>byte</code> for this to work?</li> <li>The result is of type <code>long</code>. When the expression above runs, <code>-1</code> gets converted into - effectively - 4x8bit blocks. How does Java know it needs 32 positions/bits (an IP address' length), and not, say, 16, or 8, when applying two's complement? (I'm guessing that has to do with the <code>long</code> datatype?)</li> <li>Why is two's complement applied to <code>-1</code> to begin with? (Google gives you <code>-0b1</code> if you ask it what <code>-1</code> is in binary. I first thought it might be to do with overflow, but it isn't, is it...?)</li> <li>Really, what datatypes does the compiler convert this to while it's running the code, to make it all work?</li> </ol> UPDATE: The <code>16</code> is produced at runtime by a method; I just put a constant in here as an example. In hindsight probably a bad idea...

<blockquote> Really, what datatypes does the compiler convert this to while it's running the code, to make it all work? </blockquote> This <pre class="prettyprint"><code>(-1) << (byte) 16; </code></pre> is a constant expression. Its value is known at compile time. It's a <code>long</code> with the value <code>-65536</code> (in decimal representation). If the expression wasn't a constant expression, the type of the variable wouldn't matter when evaluating the expression. It would only matter later when its value is assigned to the variable. Take for example <pre class="prettyprint"><code>int i = -1; long m = i << (byte) 16; </code></pre> The expression above is one that involves a shift operator and two operands, one of type <code>int</code> and another of type <code>byte</code>. The JLS states the following concerning shift operators and their operands <blockquote> Unary numeric promotion (§5.6.1) is performed on each operand separately. </blockquote> which is <blockquote> Otherwise, if the operand is of compile-time type byte, short, or char, it is promoted to a value of type int by a widening primitive conversion (§5.1.2). </blockquote> So the <code>byte</code> value is widened to an <code>int</code>. So no to your first question. The result of the expression would be a value of type <code>int</code> (32 bits). It has to be assigned to a <code>long</code> (64 bits) variable, so the value would be widened to a <code>long</code> before being assigned. From the JLS again <blockquote> The integral types are byte, short, int, and long, whose values are 8-bit, 16-bit, 32-bit and 64-bit signed two's-complement integers, respectively, and char, whose values are 16-bit unsigned integers representing UTF-16 code units (§3.1). </blockquote> That's how they are stored.

It is actually confusing that your <code>m</code> variable is of <code>long</code> type because an IP address is 32-bit and corresponds to an <code>int</code>. Your right-hand side is indeed <code>int</code> and only after it's fully computed is it extended to <code>long</code> (64-bit). Answering your questions: <ol> <li>It doesn't. You can remove the cast.</li> <li>The result is actually of type <code>int</code>, but gets converted to <code>long</code> because the type of <code>m</code> requires it.</li> <li>Two's complement is not "applied" to anything, really. The number <code>-1</code> is encoded in two's complement. You need some way to represent negative numbers with nothing but bits. Plus, here two's complement plays a side role: it is just about <code>-1</code> being encoded as all 1-bits.</li> <li>It's all just a block of 32 one-bits being shifted to the left, zeroes filling in the vacancy. Then, to convert to <code>long</code>, 32 more 1-bits are added on the left side.</li> </ol>

How does Java's bit shift operator work under the hood?

Q: What does bit shifting by 1 do?

Bitshifting shifts the binary representation of each pixel to the left or to the right by a pre-defined number of positions. Shifting a binary number by one bit is equivalent to multiplying (when shifting to the left) or dividing (when shifting to the right) the number by 2.

Tags:

java

bit-manipulation

twos-complement

bit

domain-masking

I didn't study IT, and only very recently came across bit shifts and an application for two's complement. So, can you please use simple English in your explanations and assume I know hardly anything about IP addresses, bit operations, and Java datatypes?

Today, I found the following piece of code (abbreviated):

long m = (-1) << (byte) 16;

Now, this is for IP subnet masking. I know I need to start out with 4 blocks of 8 bits (i.e. 4 bytes), and all bits have to be "switched on": 11111111 11111111 1111111 1111111 Next, zeros are shifted in from the right, in this case 16 bits' worth; so we get 11111111 11111111 00000000 0000000, the mask.

But I do have a few questions:

Does the 16 have to be of type byte for this to work?
The result is of type long. When the expression above runs, -1 gets converted into - effectively - 4x8bit blocks. How does Java know it needs 32 positions/bits (an IP address' length), and not, say, 16, or 8, when applying two's complement? (I'm guessing that has to do with the long datatype?)
Why is two's complement applied to -1 to begin with? (Google gives you -0b1 if you ask it what -1 is in binary. I first thought it might be to do with overflow, but it isn't, is it...?)
Really, what datatypes does the compiler convert this to while it's running the code, to make it all work?

UPDATE: The 16 is produced at runtime by a method; I just put a constant in here as an example. In hindsight probably a bad idea...

593

asked Sep 08 '15 15:09

Christian

2 Answers

Really, what datatypes does the compiler convert this to while it's running the code, to make it all work?

This

(-1) << (byte) 16;

is a constant expression. Its value is known at compile time. It's a long with the value -65536 (in decimal representation).

If the expression wasn't a constant expression, the type of the variable wouldn't matter when evaluating the expression. It would only matter later when its value is assigned to the variable.

Take for example

int i = -1;
long m = i << (byte) 16;

The expression above is one that involves a shift operator and two operands, one of type int and another of type byte.

The JLS states the following concerning shift operators and their operands

Unary numeric promotion (§5.6.1) is performed on each operand separately.

which is

Otherwise, if the operand is of compile-time type byte, short, or char, it is promoted to a value of type int by a widening primitive conversion (§5.1.2).

So the byte value is widened to an int. So no to your first question.

The result of the expression would be a value of type int (32 bits). It has to be assigned to a long (64 bits) variable, so the value would be widened to a long before being assigned.

From the JLS again

The integral types are byte, short, int, and long, whose values are 8-bit, 16-bit, 32-bit and 64-bit signed two's-complement integers, respectively, and char, whose values are 16-bit unsigned integers representing UTF-16 code units (§3.1).

That's how they are stored.

174

answered Oct 28 '22 18:10

Sotirios Delimanolis

It is actually confusing that your m variable is of long type because an IP address is 32-bit and corresponds to an int. Your right-hand side is indeed int and only after it's fully computed is it extended to long (64-bit). Answering your questions:

It doesn't. You can remove the cast.
The result is actually of type int, but gets converted to long because the type of m requires it.
Two's complement is not "applied" to anything, really. The number -1 is encoded in two's complement. You need some way to represent negative numbers with nothing but bits. Plus, here two's complement plays a side role: it is just about -1 being encoded as all 1-bits.
It's all just a block of 32 one-bits being shifted to the left, zeroes filling in the vacancy. Then, to convert to long, 32 more 1-bits are added on the left side.

answered Oct 28 '22 18:10

Marko Topolnik

Related questions
                            
                                Java generics: Bound mismatch
                            
                                What's wrong with using Inheritance Equality in Java?
                            
                                How configure Spring-Boot app to continue to use RestEasy?
                            
                                Java: Covariant Wildcard Bounds in Method parameters
                            
                                Change all tabs with whitespaces in IntelliJ for 10K+ classes
                            
                                Log4j2.xml not found but log4j2-test.xml is
                            
                                hibernate - could not execute statement; SQL [n/a] - saving nested object
                            
                                HttpURLConnection keeping cache
                            
                                Gradle deploy project to ear
                            
                                NullPointerException instead of null (JVM Bug?)
                            
                                Can I use java 8 in an mixed scala 2.10 / java project built by sbt?
                            
                                NullPointer in log during first connection to database
                            
                                Generic Chaos Java
                            
                                Apache Spark: ERROR local class incompatible when initiating a SparkContext class
                            
                                How to create a button in PDF BOX?
                            
                                Java8 Effectively Final compile time error on non final variable
                            
                                index.jsp file opens even when the <welcome-file-list> is not defined
                            
                                Tomcat JVM version different from JAVA_HOME
                            
                                Java 8, Convert file name array to file array
                            
                                Why is my `unmodifiableList` modifiable? [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With