Why does adding a value to Float.MAX_VALUE not reach infinity?

Question

According to the standard, overflow in java is handled using a special value called infinity, but here the sum is 3.4028235E38. Why is this the case?

public class FloatingPointTest {
    public static void main(String[] args) {
        float a = Float.MAX_VALUE;
        float b = 1_000_000_000f; // Note the 'f' to ensure it's a float literal
        float sum = a + b;

        System.out.println("Float.MAX_VALUE: " + a);
        System.out.println("Large number: " + b);
        System.out.println("Sum: " + sum);
        System.out.println("Is sum equal to Float.POSITIVE_INFINITY? " + (sum == Float.POSITIVE_INFINITY));
    }
}

Eric Postpischil · Accepted Answer

Summary

An arithmetic result is rounded before it is tested for overflow. If the rounded result is representable, there is no overflow.

Details

For rounding, Java uses IEEE 754’s round-to-nearest, ties-to-even method (per The Java Virtual Machine Specification, JAVA SE 18 Edition 2022-02-23, clause 2.8 Floating-Point Arithmetic, page 20).

Rounding in IEEE-754 behaves as if the range were unbounded (per IEEE 754-2019 draft D2.47, clause 4.3, first paragraph, page 18). That is, the rounding is performed as if there were no bound on the exponent, and then a check for overflow is performed. For illustration, suppose we had a two-digit decimal format that had finite numbers up to, but not including, 100. So 99 is representable but 100 is out of bounds. Consider adding 99 and .23. The exact result would be 99.23. This is above 99, but we do not declare overflow yet. First, we round 99.23 to two digits. The result is 99. This is within the finite range, so it does not overflow.

In the binary32 format that Java uses for Float, Float.MAX_VALUE is 2¹²⁸−2¹⁰⁴. If the exponent were not bounded, the next greater representable value would be 2¹²⁸. When we add 10⁹ to Float.MAX_VALUE, the exact result would be 2¹²⁸−2¹⁰⁴+10⁹. When we round, this is nearer to 2¹²⁸−2¹⁰⁴ than it is to 2¹²⁸. So it is rounded to 2¹²⁸−2¹⁰⁴, and there is no overflow.

Discussion

The rule that we round before we test for overflow makes sense: There is no reason for rounding to change at the edge of the finite range or to produce an infinity when the ordinary result of rounding would be finite. If we changed the rules to make any result outside the finite range an overflow, then floating-point arithmetic would behave differently near the edge of the range than it does inside the finite range.

If you added 2¹⁰³ to Float.MAX_VALUE, the exact result, 2¹²⁸−2¹⁰⁴+2¹⁰³ would be exactly halfway between 2¹²⁸−2¹⁰⁴ and 2¹²⁸. The rule for tied results is to round to the choice with the even low digit in its significand. The low digit of 2¹²⁸−2¹⁰⁴ is 1, and the low digit of 2¹²⁸ is 0, so the result would be 2¹²⁸. That is out of the finite range, so an overflow would be declared.

Why does adding a value to Float.MAX_VALUE not reach infinity?

Tags:

java

floating-point

saul goodman

1 Answers

Summary

Details

Discussion

Eric Postpischil

Recent Activity

Donate For Us

Why does adding a value to Float.MAX_VALUE not reach infinity?

Tags:

java

floating-point

saul goodman

1 Answers

Summary

Details

Discussion

Eric Postpischil

Related questions

Recent Activity

Donate For Us