Precision, why do Matlab and Python numpy give so different outputs?

Tags:

I know about basic data types and that float types (float,double) can not hold some numbers exactly.

In porting some code from Matlab to Python (Numpy) I however found some significant differences in calculations, and I think it's going back to precision.

Take the following code, z-normalizing a 500 dimensional vector with only first two elements having a non-zero value.

Matlab:

Z = repmat(0,500,1); Z(1)=3;Z(2)=1;
Za = (Z-repmat(mean(Z),500,1)) ./ repmat(std(Z),500,1);
Za(1)
>>> 21.1694

Python:

from numpy import zeros,mean,std
Z = zeros((500,))
Z[0] = 3
Z[1] = 1
Za = (Z - mean(Z)) / std(Z)
print Za[0]
>>> 21.1905669677

Besides that the formatting shows a bit more digits in Python, there is a huge difference (imho), more than 0.02

Both Python and Matlab are using a 64 bit data type (afaik). Python uses 'numpy.float64' and Matlab 'double'.

Why is the difference so huge? Which one is more correct?

557

asked Sep 20 '11 08:09

Peter Smit

1 Answers

Maybe the difference comes from the mean and std calls. Compare those first.

There are several definitions for std, some use the sqaure root of

1 / n * sum((xi - mean(x)) ** 2)

others use

1 / (n - 1) * sum((xi - mean(x)) ** 2)

instead.

From a mathematical point: these formulas are estimators of the variance of a normal distributed random variable. The distribution has two parameters sigma and mu. If you know mu exactly the optimal estimator for sigma ** 2 is

1 / n * sum((xi - mu) ** 2)

If you have to estimate mu from the data using mu = mean(xi), the optimal estimator for sigma**2 is

1 / (n - 1) * sum((xi- mean(x))**2)

133

answered Sep 29 '22 11:09

rocksportrocker

Related questions
                            
                                pip, proxy authentication and "Not supported proxy scheme"
                            
                                Django custom command error: unrecognized arguments
                            
                                sklearn: how to get coefficients of polynomial features
                            
                                How do I use Python and lxml to parse a local html file?
                            
                                Add newline to string, cross-platform
                            
                                How to install python module extras with pip requirements.txt file
                            
                                What is a practical difference between check_call check_output call, and Popen methods in the subprocess module?
                            
                                django TypeError: get() got multiple values for keyword argument 'invoice_id'
                            
                                How do I print the local and remote address and port of a connected socket?
                            
                                Scrapy: HTTP status code is not handled or not allowed?
                            
                                How to check if a pandas dataframe contains only numeric column wise?
                            
                                How to remove list items depending on predecessor in python
                            
                                How do I pass an async function to a thread target in Python?
                            
                                What is the difference between np.linspace and np.arange?
                            
                                No code completion and syntax highlighting in Pydev
                            
                                WxPython Incompatible With Snow Leopard?
                            
                                How to capture the output from "subprocess.call" to a file?
                            
                                Automatically growing lists in Python
                            
                                Error when creating a PostgreSQL database using python, sqlalchemy and psycopg2
                            
                                Problems importing python-Xlib

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Precision, why do Matlab and Python numpy give so different outputs?

Tags:

python

statistics

matlab

floating-point-precision

Peter Smit

People also ask

1 Answers

rocksportrocker

Recent Activity

Donate For Us