How to perform round to even with floating point numbers

Tags:

In regards to IEEE-754 single precision floating point, how do you perform round to nearest, where ties round to the nearest even digit in the required position (the default and by far the most common mode)?

Basically I have the guard bit, round bit, and sticky bit. So if we form those into a vector and call it GRS, then the following rules apply:

If G = 0, round down (do nothing)
If G = 1, and RS == 10 or RS == 01, round up (add one to mantissa)
if GSR = 111, round to even

So I am not sure how to perform the round to nearest. Any help is greatly appreciated.

298

asked Jan 24 '12 04:01

Veridian

1 Answers

Just to make sure we're on the same page, G is the most significant bit of the three, R comes next and S can be thought of as the least significant bit because its value partially represents the even less significant bits that have been truncated in the calculations. These three bits are only used while doing calculations and aren't stored in the floating-point variable before or after the calculations.

This is what you should do in order to round the result to the nearest even number using G, R and S:

GRS - Action
0xx - round down = do nothing (x means any bit value, 0 or 1)
100 - this is a tie: round up if the mantissa's bit just before G is 1, else round down=do nothing
101 - round up
110 - round up
111 - round up

Rounding up is done by adding 1 to the mantissa in the mantissa's least significant bit position just before G. If the mantissa overflows (its 23 least significant bits that you will store become zeroes), you have to add 1 to the exponent. If the exponent overflows, you set the number to +infinity or -infinity depending on the number's sign.

In the case of a tie, you add 1 to the mantissa if the mantissa is odd and you add nothing if it's even. That's what makes the result rounded to the nearest even value.

163

answered Sep 22 '22 12:09

Alexey Frunze

Related questions
                            
                                What is the difference between plaintext and binary data?
                            
                                compact binary representation of json
                            
                                Python converting from base64 to binary
                            
                                Manipulating binary data in Python
                            
                                How to get nth bit (from right) in a binary equivalent of an integer in PHP?
                            
                                Easiest way to compare two Excel files in Java?
                            
                                Strange 0x0D being added to my binary file
                            
                                Convert hex to binary in MySQL
                            
                                Edit (patch) a binary file in IDA Pro
                            
                                Binary Serialization for Lists of Undefined Length in Haskell
                            
                                Read a binary file using Numpy fromfile and a given offset
                            
                                Creating multiple numbers with certain number of bits set
                            
                                Reading a binary file 1 byte at a time
                            
                                Binary representation of a .NET Decimal
                            
                                Delimiting binary sequences
                            
                                Fastest way to Convert String to Binary?
                            
                                How many bits do you need to store a positive integer?
                            
                                How send arraybuffer as binary via Websocket?
                            
                                MySQL binary against non-binary for hash IDs
                            
                                Difference between machine language, binary code and a binary file

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to perform round to even with floating point numbers

Tags:

floating-point

rounding

binary

Veridian

People also ask

1 Answers

Alexey Frunze

Recent Activity

Donate For Us