Comparing the results of a floating point computation across a couple of different machines, they are consistently producing different results. Here is a stripped down example that reproduces the behavior: <pre class="prettyprint"><code>import numpy as np from numpy.random import randn as rand M = 1024 N = 2048 np.random.seed(0) a = rand(M,N).astype(dtype=np.float32) w = rand(N,M).astype(dtype=np.float32) b = np.dot(a, w) for i in range(10): b = b + np.dot(b, a)[:, :1024] np.divide(b, 100., out=b) print b[0,:3] </code></pre> Different machines produce different results like <ul> <li>[ -2.85753540e-05 -5.94204867e-05 -2.62337649e-04]</li> <li>[ -2.85751412e-05 -5.94208468e-05 -2.62336689e-04]</li> <li>[ -2.85754559e-05 -5.94202756e-05 -2.62337562e-04]</li> </ul> but I can also get identical results, e.g. by running on two MacBooks of the same vintage. This happens with machines that have the same version of Python and numpy, but not necessarily linked against the same BLAS libraries (e.g accelerate framework on Mac, OpenBLAS on Ubuntu). However, shouldn't different numerical libraries all conform to the same IEEE floating point standard and give exactly the same results?

Floating point calculations are not always reproducible. You may get reproducible results for floating calculations across different machines if you use the same executable image, inputs, libraries built with the same compiler and identical compiler settings (switches). However if you use a dynamically linked library you may get different results, because of numerous reasons. First of all, as Veedrac pointed in comments it might use different algorithms for its routines on different architectures. Second, a compiler might produce different code depending on switches (various optimizations, control settings). Even <code>a+b+c</code> yields non-deterministic results across machines and compilers, because we can not be sure about order of evaluation, precision in intermediate calculations. Read here why it is not guaranteed to get identical results on different <code>IEEE 754-1985</code> implementations. New standard (<code>IEEE 754-2008</code>) tries to go further, but it still doesn't guarantee identical results among different implementations, because for example it allows implementers to choose when tinyness (underflow exception) is detected More information about floating point determinism can be found in this article.

Floating point math in python / numpy not reproducible across machines

Tags:

python

floating-point

numpy

blas

Comparing the results of a floating point computation across a couple of different machines, they are consistently producing different results. Here is a stripped down example that reproduces the behavior:

import numpy as np
from numpy.random import randn as rand

M = 1024
N = 2048
np.random.seed(0)

a = rand(M,N).astype(dtype=np.float32)
w = rand(N,M).astype(dtype=np.float32)

b = np.dot(a, w)
for i in range(10):
    b = b + np.dot(b, a)[:, :1024]
    np.divide(b, 100., out=b)

print b[0,:3]

Different machines produce different results like

[ -2.85753540e-05 -5.94204867e-05 -2.62337649e-04]
[ -2.85751412e-05 -5.94208468e-05 -2.62336689e-04]
[ -2.85754559e-05 -5.94202756e-05 -2.62337562e-04]

but I can also get identical results, e.g. by running on two MacBooks of the same vintage. This happens with machines that have the same version of Python and numpy, but not necessarily linked against the same BLAS libraries (e.g accelerate framework on Mac, OpenBLAS on Ubuntu). However, shouldn't different numerical libraries all conform to the same IEEE floating point standard and give exactly the same results?

338

asked May 06 '15 00:05

Urs

1 Answers

Floating point calculations are not always reproducible.

You may get reproducible results for floating calculations across different machines if you use the same executable image, inputs, libraries built with the same compiler and identical compiler settings (switches).

However if you use a dynamically linked library you may get different results, because of numerous reasons. First of all, as Veedrac pointed in comments it might use different algorithms for its routines on different architectures. Second, a compiler might produce different code depending on switches (various optimizations, control settings). Even a+b+c yields non-deterministic results across machines and compilers, because we can not be sure about order of evaluation, precision in intermediate calculations.

Read here why it is not guaranteed to get identical results on different IEEE 754-1985 implementations. New standard (IEEE 754-2008) tries to go further, but it still doesn't guarantee identical results among different implementations, because for example it allows implementers to choose when tinyness (underflow exception) is detected

More information about floating point determinism can be found in this article.

187

answered Oct 18 '22 11:10

Alik

Related questions
                            
                                Sqlite load_extension fail for spatialite in Python
                            
                                Why do python exceptions typically not print offending values?
                            
                                segfault using numpy's lapack_lite with multiprocessing on osx, not linux
                            
                                Omit (or format) the value of a variable when documenting with Sphinx
                            
                                IOError: [Errno 22] Invalid argument when reading/writing large bytestring
                            
                                How to find leaks in Python ctypes libraries
                            
                                Parallel many dimensional optimization
                            
                                Add custom Django admin action
                            
                                How can I use Django Social Auth to connect with Twitter?
                            
                                Flask User Management : How to make Stateless Server using better authentication ways?
                            
                                How to speed up Levenshtein distance calculation
                            
                                Django - Distinguish different types of IntegrityError
                            
                                matplotlib exit after animation
                            
                                How to verify a .__getitem__() call in a Mock mock_calls list during unit testing
                            
                                Capture 192 kHz audio using Python 3
                            
                                How to close the browser after completing a download?
                            
                                How to make Sphinx Respect Importing Classes Into Package with __init__.py
                            
                                Are python 3.x venv environments relocatable?
                            
                                Pandas interpolate NaNs based on different column
                            
                                numpy.arctanh(x) for x >= 1 returns NaN but I want complex

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With