How to correctly normalize a floating point value in C++?

Tags:

Maybe I don't understand the IEEE754 standard that much, but given a set of floating point values that are float or double, for example :

56.543f 3238.124124f 121.3f ...

you are able to convert them in values ranging from 0 to 1, so you normalize them, by taking an appropriate common factor while considering what is the maximum value and the minimum value in the set.

Now my point is that in this transformation I need a much higher precision for the set of destination that ranges from 0 to 1 if compared to the level of precision that I need in the first one, especially if the values in the first set are covering a wide range of numerical values ( really big and really small values ).

How the float or the double ( or the IEEE 754 standard if you want ) type can handle this situation while providing more precision for the second set of values knowing that I will basically not need an integer part ?

Or it doesn't handle this at all and I need fixed point math with a totally different type ?

424

asked Dec 09 '13 15:12

user2485710

1 Answers

Floating point numbers are stored in a format similar to scientific notation. Internally, they align the leading 1 of the binary representation to the top of the significand. Each value is carried with the same number of binary digits of precision relative to its own magnitude.

When you compress your set of floating point values to the range 0..1, the only precision loss you will get will be due to the rounding that occurs in the various steps of the process.

If you're merely compressing by scaling, you will lose only a small amount of precision near the LSBs of the mantissa (around 1 or 2 ulp, where ulp means "units of the last place).

If you also need to shift your data, then things get trickier. If your data is all positive, then subtracting off the smallest number will not damage anything. But, if your data is a mixture of positive and negative data, then some of your values near zero may suffer a loss in precision.

If you do all the arithmetic at double precision, you'll carry 53 bits of precision through the calculation. If your precision needs fit within that (which likely they do), then you'll be fine. Otherwise, the exact numerical performance will depend on the distribution of your data.

answered Nov 14 '22 23:11

Joe Z

Related questions
                            
                                C++11 string initialization
                            
                                How to end iteration through a C++ std::set one element early?
                            
                                segmentation fault when moving std::vector [closed]
                            
                                QTextStream behavior searching for a string not as expected
                            
                                detect mouse clicks in OpenGL C++
                            
                                Return by reference or create a typical setter/getter? [duplicate]
                            
                                C++: save variable value for next call of the function
                            
                                Ternary operator vs if statement: compiler optimization
                            
                                Which C/C++ compiler does Xcode use?
                            
                                Getline ignoring first character of input
                            
                                How to declare an array without specific size?
                            
                                What is the Qt way to obtain the intersection between two QLists?
                            
                                Opencv: Edge detection, Dilation and Mass ceter drawing
                            
                                C++ can't initialize a pointer in a pair to NULL
                            
                                How to parse csv using boost::spirit
                            
                                SHGetFolderPath Deprecated: What is alternative to retrieve path for Windows folders?
                            
                                Character in Switch-Statement C++
                            
                                Does the C++ standard guarantee that string literals are stored in the program binary unadulterated?
                            
                                Mixing versions of the MSVCRT
                            
                                C++: Best way to get Window Handle of the only window from a process by process id, process handle and title name [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to correctly normalize a floating point value in C++?

Tags:

c++

floating-point

double

ieee-754

user2485710

People also ask

1 Answers

Joe Z

Recent Activity

Donate For Us