This is an interesting question that I was trying to work through the other day. Is it possible to force the significand or exponent of one <code>float</code> to be the same as another <code>float</code> in Python? The question arises because I was trying to rescale some data so that the min and max match another data set. However, my rescaled data was slightly off (after about 6 decimal places) and it was enough to cause problems down the line. To give an idea, I have <code>f1</code> and <code>f2</code> (<code>type(f1) == type(f2) == numpy.ndarray</code>). I want <code>np.max(f1) == np.max(f2) and np.min(f1) == np.min(f2)</code>. To achieve this, I do: <pre class="prettyprint"><code>import numpy as np f2 = (f2-np.min(f2))/(np.max(f2)-np.min(f2)) # f2 is now between 0.0 and 1.0 f2 = f2*(np.max(f1)-np.min(f1)) + np.min(f1) # f2 is now between min(f1) and max(f1) </code></pre> The result (just as an example) would be: <pre class="prettyprint"><code>np.max(f1) # 5.0230593 np.max(f2) # 5.0230602 but I need 5.0230593 </code></pre> My initial thought is that forcing the exponent of the <code>float</code> would be the correct solution. I couldn't find much on it, so I made a workaround for my need: <pre class="prettyprint"><code>exp = 0 mm = np.max(f1) # find where the decimal is while int(10**exp*mm) == 0 exp += 1 # add 4 digits of precision exp += 4 scale = 10**exp f2 = np.round(f2*scale)/scale f1 = np.round(f1*scale)/scale </code></pre> now <code>np.max(f2) == np.max(f1)</code> However, is there a better way? Did I do something wrong? Is it possible to reshape a <code>float</code> to be similar to another <code>float</code> (exponent or other means)? EDIT: as was suggested, I am now using: <pre class="prettyprint"><code>scale = 10**(-np.floor(np.log10(np.max(f1))) + 4) </code></pre> While my solution above will work (for my application), I'm interested to know if there's a solution that can somehow force the <code>float</code> to have the same exponent and/or significand so that the numbers will become identical.

Try replacing the second line by <pre class="prettyprint"><code>f2 = f2*np.max(f1) + (1.0-f2)*np.min(f1) </code></pre> Explanation: There are 2 places where the difference could creep in: Step 1) <code>f2 = (f2-np.min(f2))/(np.max(f2)-np.min(f2))</code> When you inspect <code>np.min(f2)</code> and <code>np.max(f2)</code>, do you get exactly 0 and 1 or something like 1.0000003? Step 2) <code>f2 = f2*(np.max(f1)-np.min(f1)) + np.min(f1)</code> The expression like <code>(a-b)+b</code> doesn't always produce exactly <code>a</code>, due to rounding error. The suggested expression is slightly more stable. For a very detailed explanation, please see What Every Computer Scientist Should Know About Floating-Point Arithmetic by David Goldberg.

TL;DR Use <pre class="prettyprint"><code>f2 = f2*np.max(f1)-np.min(f1)*(f2-1) # f2 is now between min(f1) and max(f1) </code></pre> and make sure you're using double precision, compare floating point numbers by looking at absolute or relative differences, avoid rounding for adjusting (or comparing) floating point numbers, and don't set the underlying components of floating point numbers manually. Details This isn't a very easy error to reproduce, as you have discovered. However, working with floating numbers is subject to error. E.g., adding together <code>1 000 000 000 + 0 . 000 000 000 1</code> gives <code>1 000 000 000 . 000 000 000 1</code>, but this is too many significant figures even for double precision (which supports around 15 significant figures), so the trailing decimal is dropped. Moreover, some "short" numbers can't be represented exactly, as noted in @Kevin's answer. See, e.g., here, for more. (Search for something like "floating point truncation roundoff error" for even more.) Here's an example which does demonstrate a problem: <pre class="prettyprint"><code>import numpy as np numpy.set_printoptions(precision=16) dtype=np.float32 f1 = np.linspace(-1000, 0.001, 3, dtype=dtype) f2 = np.linspace(0, 1, 3, dtype=dtype) f2 = (f2-np.min(f2))/(np.max(f2)-np.min(f2)) # f2 is now between 0.0 and 1.0 f2 = f2*(np.max(f1)-np.min(f1)) + np.min(f1) # f2 is now between min(f1) and max(f1) print (f1) print (f2) </code></pre> output <pre class="prettyprint"><code>[ -1.0000000000000000e+03 -4.9999951171875000e+02 1.0000000474974513e-03] [ -1.0000000000000000e+03 -4.9999951171875000e+02 9.7656250000000000e-04] </code></pre> Following @Mark Dickinson's comment, I have used 32 bit floating point. This is consistent with the error you reported, a relative error of around 10^-7, around the 7th significant figure <pre class="prettyprint"><code>In: (5.0230602 - 5.0230593) / 5.0230593 Out: 1.791736760621852e-07 </code></pre> Going to <code>dtype=np.float64</code> makes things better but it still isn't perfect. The program above then gives <pre class="prettyprint"><code>[ -1.0000000000000000e+03 -4.9999950000000001e+02 1.0000000000000000e-03] [ -1.0000000000000000e+03 -4.9999950000000001e+02 9.9999999997635314e-04] </code></pre> This isn't perfect, but is generally close enough. When comparing floating point numbers you almost never want to use strict equality because of the possibility of small errors as noted above. Instead subtract one number from the other and check the absolute difference is less than some tolerance, and/or look at the relative error. See, e.g., <code>numpy.isclose</code>. Returning to your problem, it seems like it should be possible to do better. After all, <code>f2</code> has the range 0 to 1, so you should be able to replicate the maximum in <code>f1</code>. The problem comes in the line <pre class="prettyprint"><code>f2 = f2*(np.max(f1)-np.min(f1)) + np.min(f1) # f2 is now between min(f1) and max(f1) </code></pre> because when an element of <code>f2</code> is 1 you're doing a lot more to it than just multiplying 1 by the max of <code>f1</code>, leading to the possibility of floating point arithmetic errors occurring. Notice that you can multiply out the brackets <code>f2*(np.max(f1)-np.min(f1))</code> to <code>f2*np.max(f1) - f2*np.min(f1)</code>, and then factorize the resulting <code>- f2*np.min(f1) + np.min(f1)</code> to <code>np.min(f1)*(f2-1)</code> giving <pre class="prettyprint"><code>f2 = f2*np.max(f1)-np.min(f1)*(f2-1) # f2 is now between min(f1) and max(f1) </code></pre> So when an element of <code>f2</code> is 1, we have <code>1*np.max(f1) - np.min(f1)*0</code>. Conversely when an element of <code>f2</code> is 0, we have <code>0*np.max(f1) - np.min(f1)*1</code>. The numbers 1 and 0 can be exactly represented so there should be no errors. The modified program outputs <pre class="prettyprint"><code>[ -1.0000000000000000e+03 -4.9999950000000001e+02 1.0000000000000000e-03] [ -1.0000000000000000e+03 -4.9999950000000001e+02 1.0000000000000000e-03] </code></pre> i.e. as desired. Nevertheless I would still strongly recommend only using inexact floating point comparison (with tight bounds if you need) unless you have a very good reason not to do so. There are all sorts of subtle errors that can occur in floating point arithmetic and the easiest way to avoid them is never to use exact comparison. An alternative approach to that given above, that might be preferable, would be to rescale both arrays to between 0 and 1. This might be the most suitable form to use within the program. (And both arrays could be multiplied by a scaling factor such the original range of <code>f1</code>, if necessary.) Re using rounding to solve your problem, I would not recommend this. The problem with rounding -- apart from the fact that it unnecessary reduces the accuracy of your data -- is that numbers that are very close can round in different directions. E.g. <pre class="prettyprint"><code>f1 = np.array([1.000049]) f2 = np.array([1.000051]) print (f1) print (f2) scale = 10**(-np.floor(np.log10(np.max(f1))) + 4) f2 = np.round(f2*scale)/scale f1 = np.round(f1*scale)/scale print (f1) print (f2) </code></pre> Output <pre class="prettyprint"><code>[ 1.000049] [ 1.000051] [ 1.] [ 1.0001] </code></pre> This is related to the fact that although it's common to discuss numbers matching to so many significant figures, people don't actually compare them this way in the computer. You calculate the difference and then divide by the correct number (for a relative error). Re mantissas and exponents, see <code>math.frexp</code> and <code>math.ldexp</code>, documented here. I would not recommend setting these yourself however (consider two numbers that are very close but have different exponents, for example -- do you really want to set the mantissa). Much better to just directly set the maximum of <code>f2</code> explicitly to the maximum of <code>f1</code>, if you want to ensure the numbers are exactly the same (and similarly for the minimum).

Is it possible to force exponent or significand of a float to match another float (Python)?

Tags:

python

floating-point

floating-accuracy

numpy

This is an interesting question that I was trying to work through the other day. Is it possible to force the significand or exponent of one float to be the same as another float in Python?

The question arises because I was trying to rescale some data so that the min and max match another data set. However, my rescaled data was slightly off (after about 6 decimal places) and it was enough to cause problems down the line.

To give an idea, I have f1 and f2 (type(f1) == type(f2) == numpy.ndarray). I want np.max(f1) == np.max(f2) and np.min(f1) == np.min(f2). To achieve this, I do:

import numpy as np

f2 = (f2-np.min(f2))/(np.max(f2)-np.min(f2)) # f2 is now between 0.0 and 1.0
f2 = f2*(np.max(f1)-np.min(f1)) + np.min(f1)  # f2 is now between min(f1) and max(f1)

The result (just as an example) would be:

np.max(f1) # 5.0230593
np.max(f2) # 5.0230602 but I need 5.0230593

My initial thought is that forcing the exponent of the float would be the correct solution. I couldn't find much on it, so I made a workaround for my need:

exp = 0
mm = np.max(f1)

# find where the decimal is
while int(10**exp*mm) == 0
  exp += 1

# add 4 digits of precision
exp += 4

scale = 10**exp

f2 = np.round(f2*scale)/scale
f1 = np.round(f1*scale)/scale

now np.max(f2) == np.max(f1)

However, is there a better way? Did I do something wrong? Is it possible to reshape a float to be similar to another float (exponent or other means)?

EDIT: as was suggested, I am now using:

scale = 10**(-np.floor(np.log10(np.max(f1))) + 4)

While my solution above will work (for my application), I'm interested to know if there's a solution that can somehow force the float to have the same exponent and/or significand so that the numbers will become identical.

617

asked Jan 28 '16 08:01

Jason

3 Answers

Try replacing the second line by

f2 = f2*np.max(f1) + (1.0-f2)*np.min(f1)

Explanation: There are 2 places where the difference could creep in:

Step 1) f2 = (f2-np.min(f2))/(np.max(f2)-np.min(f2))

When you inspect np.min(f2) and np.max(f2), do you get exactly 0 and 1 or something like 1.0000003?

Step 2) f2 = f2*(np.max(f1)-np.min(f1)) + np.min(f1)

The expression like (a-b)+b doesn't always produce exactly a, due to rounding error. The suggested expression is slightly more stable.

For a very detailed explanation, please see What Every Computer Scientist Should Know About Floating-Point Arithmetic by David Goldberg.

answered Oct 26 '22 01:10

Paisa Seeluangsawat

It depends what you mean by "mantissa."

Internally, floats are stored using scientific notation in base 2. So if you mean the base 2 mantissa, it is actually very easy: Just multiply or divide by powers of two (not powers of 10), and the mantissa will stay the same (provided the exponent doesn't get out of range; if it does, you'll get clamped to infinity or zero, or possibly go into denormal numbers depending on architectural details). It's important to understand that the decimal expansions will not match up when you rescale on powers of two. It's the binary expansion that's preserved with this method.

But if you mean the base 10 mantissa, no, it's not possible with floats, because the rescaled value may not be exactly representable. For example, 1.1 cannot be represented exactly in base 2 (with a finite number of digits) in much the same way as 1/3 cannot be represented in base 10 (with a finite number of digits). So rescaling 11 down by 1/10 cannot be done perfectly accurately:

>>> print("%1.29f" % (11 * 0.1))
1.10000000000000008881784197001

You can, however, do the latter with decimals. Decimals work in base 10, and will behave as expected in terms of base 10 rescaling. They also provide a fairly large amount of specialized functionality to detect and handle various kinds of loss of precision. But decimals don't benefit from NumPy speedups, so if you have a very large volume of data to work with, they may not be efficient enough for your use case. Since NumPy depends on hardware support for floating point, and most (all?) modern architectures provide no hardware support for base 10, this is not easily remedied.

184

answered Oct 26 '22 01:10

Kevin

TL;DR

Use

f2 = f2*np.max(f1)-np.min(f1)*(f2-1)  # f2 is now between min(f1) and max(f1)

and make sure you're using double precision, compare floating point numbers by looking at absolute or relative differences, avoid rounding for adjusting (or comparing) floating point numbers, and don't set the underlying components of floating point numbers manually.

Details

This isn't a very easy error to reproduce, as you have discovered. However, working with floating numbers is subject to error. E.g., adding together 1 000 000 000 + 0 . 000 000 000 1 gives 1 000 000 000 . 000 000 000 1, but this is too many significant figures even for double precision (which supports around 15 significant figures), so the trailing decimal is dropped. Moreover, some "short" numbers can't be represented exactly, as noted in @Kevin's answer. See, e.g., here, for more. (Search for something like "floating point truncation roundoff error" for even more.)

Here's an example which does demonstrate a problem:

import numpy as np

numpy.set_printoptions(precision=16)

dtype=np.float32                     
f1 = np.linspace(-1000, 0.001, 3, dtype=dtype)
f2 = np.linspace(0, 1, 3, dtype=dtype)

f2 = (f2-np.min(f2))/(np.max(f2)-np.min(f2)) # f2 is now between 0.0 and 1.0
f2 = f2*(np.max(f1)-np.min(f1)) + np.min(f1)  # f2 is now between min(f1) and max(f1)

print (f1)
print (f2)

output

[ -1.0000000000000000e+03  -4.9999951171875000e+02   1.0000000474974513e-03]
[ -1.0000000000000000e+03  -4.9999951171875000e+02   9.7656250000000000e-04]

Following @Mark Dickinson's comment, I have used 32 bit floating point. This is consistent with the error you reported, a relative error of around 10^-7, around the 7th significant figure

In: (5.0230602 - 5.0230593) / 5.0230593
Out: 1.791736760621852e-07

Going to dtype=np.float64 makes things better but it still isn't perfect. The program above then gives

[ -1.0000000000000000e+03  -4.9999950000000001e+02   1.0000000000000000e-03]
[ -1.0000000000000000e+03  -4.9999950000000001e+02   9.9999999997635314e-04]

This isn't perfect, but is generally close enough. When comparing floating point numbers you almost never want to use strict equality because of the possibility of small errors as noted above. Instead subtract one number from the other and check the absolute difference is less than some tolerance, and/or look at the relative error. See, e.g., numpy.isclose.

Returning to your problem, it seems like it should be possible to do better. After all, f2 has the range 0 to 1, so you should be able to replicate the maximum in f1. The problem comes in the line

f2 = f2*(np.max(f1)-np.min(f1)) + np.min(f1)  # f2 is now between min(f1) and max(f1)

because when an element of f2 is 1 you're doing a lot more to it than just multiplying 1 by the max of f1, leading to the possibility of floating point arithmetic errors occurring. Notice that you can multiply out the brackets f2*(np.max(f1)-np.min(f1)) to f2*np.max(f1) - f2*np.min(f1), and then factorize the resulting - f2*np.min(f1) + np.min(f1) to np.min(f1)*(f2-1) giving

f2 = f2*np.max(f1)-np.min(f1)*(f2-1)  # f2 is now between min(f1) and max(f1)

So when an element of f2 is 1, we have 1*np.max(f1) - np.min(f1)*0. Conversely when an element of f2 is 0, we have 0*np.max(f1) - np.min(f1)*1. The numbers 1 and 0 can be exactly represented so there should be no errors.

The modified program outputs

[ -1.0000000000000000e+03  -4.9999950000000001e+02   1.0000000000000000e-03]
[ -1.0000000000000000e+03  -4.9999950000000001e+02   1.0000000000000000e-03]

i.e. as desired.

Nevertheless I would still strongly recommend only using inexact floating point comparison (with tight bounds if you need) unless you have a very good reason not to do so. There are all sorts of subtle errors that can occur in floating point arithmetic and the easiest way to avoid them is never to use exact comparison.

An alternative approach to that given above, that might be preferable, would be to rescale both arrays to between 0 and 1. This might be the most suitable form to use within the program. (And both arrays could be multiplied by a scaling factor such the original range of f1, if necessary.)

Re using rounding to solve your problem, I would not recommend this. The problem with rounding -- apart from the fact that it unnecessary reduces the accuracy of your data -- is that numbers that are very close can round in different directions. E.g.

f1 = np.array([1.000049])
f2 = np.array([1.000051])
print (f1)
print (f2)
scale = 10**(-np.floor(np.log10(np.max(f1))) + 4)
f2 = np.round(f2*scale)/scale
f1 = np.round(f1*scale)/scale
print (f1)
print (f2)

Output

[ 1.000049]
[ 1.000051]
[ 1.]
[ 1.0001]

This is related to the fact that although it's common to discuss numbers matching to so many significant figures, people don't actually compare them this way in the computer. You calculate the difference and then divide by the correct number (for a relative error).

Re mantissas and exponents, see math.frexp and math.ldexp, documented here. I would not recommend setting these yourself however (consider two numbers that are very close but have different exponents, for example -- do you really want to set the mantissa). Much better to just directly set the maximum of f2 explicitly to the maximum of f1, if you want to ensure the numbers are exactly the same (and similarly for the minimum).

answered Oct 25 '22 23:10

TooTone

Related questions
                            
                                Binding routes when using an app factory
                            
                                Why does saving mat files with scipy result in larger file size than with Matlab?
                            
                                Geoip2's python library doesn't work in pySpark's map function
                            
                                Send table in pywin32 outlook email
                            
                                Kivy Apk build with buildozer error: # Java compiler (javac) not found, please install it
                            
                                Set height and width of figure created with plt.subplots in matplotlib?
                            
                                Unable to build blender
                            
                                Index a numpy array with another array
                            
                                pysftp AuthenticationException while connecting to server with private key
                            
                                How to use Django templating as is without server
                            
                                Python re can't split zero-width anchors? [duplicate]
                            
                                Artifacts in convolution
                            
                                Import functions directly from Python 3 modules
                            
                                Case-insensitive search on a postgres ArrayField with django
                            
                                tensorflow: saving and restoring session
                            
                                The most elegant way to get back from pandas.df_dummies
                            
                                f2py error with allocatable arrays
                            
                                Kivy add widget at the end of layout
                            
                                Django migration being killed
                            
                                PyQt application with sqlalchemy database

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With