<h3>TL;DR: The question is about multiplication <strong>ACCURACY</strong> </h3> <p>I have to multiply matrices <code>A</code> (100x8000), <code>B</code> (8000x27) and <code>C</code> (27x1).</p> <p>Since matrices <code>B</code> and <code>C</code> are constant and <code>A</code> is variable, I prefer to calculate it as: <code>ABC = np.dot(A, np.dot(B, C))</code>. However I wonder, that it may be <strong>numerically</strong> worse (in terms of <strong>accuracy</strong>) than <code>np.dot(np.dot(a, B), C)</code>.</p> <p>What may be important: matrices <code>A</code> and <code>B</code> contain 8000 samples of (respectively) 100 and 27 correlated features.</p> <p>Is there a <strong>numerically</strong> optimal (in terms of <strong>accuracy</strong>) order of the multiplication? If yes - how may I determine it?</p> <h3>Special Case</h3> <p>It may be assumed that both <code>A</code> and <code>B</code> matrices are nonnegative. Moreover:</p> <pre class="prettyprint"><code>C = np.linalg.solve(cov(B, k), X) </code></pre> <p>where <code>X</code> is a 27x1 matrix of 27 (possibly correlated) random variables of unknown distribution, <code>cov = lambda X, k: np.dot(X.T, X) + k * np.eye(X.shape[1])</code>, and <code>k</code> is a nonnegative constant minimizing the expression:</p> <pre class="prettyprint"><code>sum((X[i, 0] - np.dot(np.dot(B[:, [i]].T, drop(B, i)), np.linalg.solve(cov(drop(B, i), k), np.delete(X, i, axis=0))) **2 for i in range(27)) </code></pre> <p>The <code>drop()</code> function is defined as <code>lambda X, i: np.delete(X, i, axis=1)</code>.</p> <h3>Even More Special Case</h3> <p>It may be assumed that <code>np.cov(B.T, B)</code> is a covariance matrix of <code>X</code>, which follows multivariate Gaussian distribution.</p>

<p>At the moment the best idea I have (for a particular set of matrices) is to perform the following numerical experiment:</p> <ol> <li>Calculate a reference matrix as an average of products calculated with high precision (e.g. `np.float128).</li> <li>Calculate test products with lower precision (<code>np.float64</code>, <code>np.float32</code>, even <code>np.float16</code>),</li> <li>Analyse errors calculated as a difference between test products and the reference matrix. The errors are expected to decline as the precision is higher.</li> </ol>

Is there a numerically optimal order of matrix multiplication?

TL;DR: The question is about multiplication ACCURACY

I have to multiply matrices A (100x8000), B (8000x27) and C (27x1).

Since matrices B and C are constant and A is variable, I prefer to calculate it as: ABC = np.dot(A, np.dot(B, C)). However I wonder, that it may be numerically worse (in terms of accuracy) than np.dot(np.dot(a, B), C).

What may be important: matrices A and B contain 8000 samples of (respectively) 100 and 27 correlated features.

Is there a numerically optimal (in terms of accuracy) order of the multiplication? If yes - how may I determine it?

Special Case

It may be assumed that both A and B matrices are nonnegative. Moreover:

C = np.linalg.solve(cov(B, k), X)

where X is a 27x1 matrix of 27 (possibly correlated) random variables of unknown distribution, cov = lambda X, k: np.dot(X.T, X) + k * np.eye(X.shape[1]), and k is a nonnegative constant minimizing the expression:

sum((X[i, 0] - np.dot(np.dot(B[:, [i]].T, drop(B, i)),
                      np.linalg.solve(cov(drop(B, i), k),
                                      np.delete(X, i, axis=0))) **2
    for i in range(27))

The drop() function is defined as lambda X, i: np.delete(X, i, axis=1).

Even More Special Case

It may be assumed that np.cov(B.T, B) is a covariance matrix of X, which follows multivariate Gaussian distribution.

945

asked Jul 08 '19 11:07

abukaj

1 Answers

At the moment the best idea I have (for a particular set of matrices) is to perform the following numerical experiment:

Calculate a reference matrix as an average of products calculated with high precision (e.g. `np.float128).
Calculate test products with lower precision (np.float64, np.float32, even np.float16),
Analyse errors calculated as a difference between test products and the reference matrix. The errors are expected to decline as the precision is higher.

answered Oct 21 '22 00:10

abukaj

Related questions
                            
                                importing pyautogui in ubuntu throwing KEYERROR :DISPLAY
                            
                                When does Python perform type conversion when comparing int and float?
                            
                                MaxRetryError: HTTPConnectionPool: Max retries exceeded (Caused by ProtocolError('Connection aborted.', error(111, 'Connection refused')))
                            
                                Update a dataframe by dataframes with NaN values
                            
                                Python OAuth2 server with social networks for a RESTfull API
                            
                                Mixing tornado and sqlalchemy
                            
                                Can't set font size and rtl
                            
                                Visual Studio Code - Can you have real-time linting for python?
                            
                                Right usage of second argument in proxy
                            
                                Python Plotly in Power BI
                            
                                cannot unpack non-iterable numpy.float64 object python3 opencv
                            
                                ValueError: Unknown layer:name when loading a keras model
                            
                                How to install Plotly for Python 3 Jupyter Notebook?
                            
                                tensorflow gradient - getting all nan values
                            
                                Sympy - Rename part of an expression
                            
                                How to rearrange an Ordered Dictionary with a based on part of the key from a list
                            
                                Error pickling a `matlab` object in joblib `Parallel` context
                            
                                What does distutils do with the "requires" metadata?
                            
                                Auto Import and Refactor (Move) function from one file to another in vscode
                            
                                dataclasses: how to ignore None values using asdict()?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Is there a numerically optimal order of matrix multiplication?

Tags:

python

floating-accuracy

numpy

matrix-multiplication

numerical-methods

TL;DR: The question is about multiplication ACCURACY

Special Case

Even More Special Case

abukaj

People also ask

1 Answers

abukaj

Recent Activity

Donate For Us