NumPy precision when doing dot product

Tags:

I am using Theano/NumPy doing some deep learning stuff. I found a very annoying problem. I got a weights matrix A(suppose to be 50*2048), and a feature vector b(2048 dim).

A is initialized using

Click to copy

self.alpha = np.random.random((50, 2048)).astype(np.float32) * 2 - 1.0

b is a 2048 dim numpy.ndarrary from theano.

The problem is

Click to copy

X = numpy.dot(A, b)
Y = [numpy.dot(A[i], b) for i in xrange(50)]

Some rows of X and Y are not strictly equal. I compared them and found that the difference is in 1e-6 to 1e-7.

Currently I prefer to use the second to computed the dot product since it seems that it can learn better weights. But the first is much faster. So I'm wondering why there is such a big difference. Is it caused by different implementations of dot(matrix, vector) and dot(vector, vector)? Thanks a lot!

--edit As uhoh mentioned, this is the code that you can reproduce it.

Click to copy

import numpy as np

test_time = 1000
vector_size = 100
matrix_size = (100, 100)

for i in xrange(test_time):
    a = np.random.random(matrix_size).astype(np.float32) * 2 - 1.0
    b = np.random.random(vector_size).astype(np.float32)
    x = np.dot(a, b)
    y = [np.dot(a[i], b) for i in xrange(a.shape[0])]
    for k in xrange(len(y)):
        epsilon = x[k] - y[k]
        if abs(epsilon) > 1e-7:
            print('Diff: {0}\t{1}\t{2}'.format(x[k], y[k], epsilon))

969

asked Dec 29 '15 02:12

magic282

1 Answers

Well there is usually a trade-off between performance and precision. You may have to compensate one in favor or the other. Although I personally do not believe a difference of 0.0000001 is a big deal in most applications. If you seek higher precision you'd better go with float64, but note that float64 operations are extremely slow on the GPUs, especially NVIDIA 9xx series GPUs.

I may note that the mentioned issue seems to depend on your hardware settings too cause I do not encounter such problem on my machine.

You may also use np.allclose(x, y) to see if the difference is tangible.

answered Oct 15 '22 11:10

Amir

Related questions
                            
                                PyAudio cannot use microphone on Ubuntu 14.04 with 'unable to open slave'
                            
                                Scapy: Using a PacketListField to dissect multiple packets contained in a packet
                            
                                How can I start a process and put it to background in python?
                            
                                How to update artists in scrollable, matplotlib and multiplot
                            
                                Lasagne/nolearn autoencoder - how to get hidden layer output?
                            
                                Caching a computed value as a constant in TensorFlow
                            
                                Polling the output from airodump-ng in Python
                            
                                Python 2, map not equivalent to list comprehension in simple case; length dependent
                            
                                Django Duration Field with negative value
                            
                                dask bag not using all cores? alternatives?
                            
                                Selenium Python Changing IP
                            
                                Temporarily Override Login Message from Specific Redirected Route with Flask-Login
                            
                                Multiplying the output of two layers in keras
                            
                                Using while loop in Pycharm and Kivy
                            
                                Get latest entry in django in multiple dimensions
                            
                                UnboundLocalError on nested module reimport
                            
                                Get the values of a related model in Django Rest Framework?
                            
                                How do you compile python 3.5 code with Mingw?
                            
                                matplotlib colorbar formatting
                            
                                Sqlalchemy multiple insert fails with percentage symbol (%) in column name

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

NumPy precision when doing dot product

Tags:

python

precision

machine-learning

numpy

theano

magic282

People also ask

1 Answers

Amir

Recent Activity

Donate For Us