I do not quite understand why <code>numpy.linalg.solve()</code> gives the more precise answer, whereas <code>numpy.linalg.inv()</code> breaks down somewhat, giving (what I believe are) estimates. For a concrete example, I am solving the equation <code>C^{-1} * d</code> where <code>C</code> denotes a matrix, and <code>d</code> is a vector-array. For the sake of discussion, the dimensions of <code>C</code> are shape <code>(1000,1000)</code> and <code>d</code> is shape <code>(1,1000)</code>. <code>numpy.linalg.solve(A, b)</code> solves the equation <code>A*x=b</code> for x, i.e. <code>x = A^{-1} * b.</code> Therefore, I could either solve this equation by (1) <pre class="prettyprint"><code>inverse = numpy.linalg.inv(C) result = inverse * d </code></pre> or (2) <pre class="prettyprint"><code>numpy.linalg.solve(C, d) </code></pre> Method (2) gives far more precise results. Why is this? What exactly is happening such that one "works better" than the other?

<code>np.linalg.solve(A, b)</code> does not compute the inverse of A. Instead it calls one of the <code>gesv</code> LAPACK routines, which first factorizes A using LU decomposition, then solves for x using forward and backward substitution (see here). <code>np.linalg.inv</code> uses the same method to compute the inverse of A by solving for A-1 in A·A-1 = I where I is the identity*. The factorization step is exactly the same as above, but it takes more floating point operations to solve for A-1 (an n×n matrix) than for x (an n-long vector). Additionally, if you then wanted to obtain x via the identity A-1·b = x then the extra matrix multiplication would incur yet more floating point operations, and therefore slower performance and more numerical error. There's no need for the intermediate step of computing A-1 - it is faster and more accurate to obtain x directly. <hr> * The relevant bit of source for <code>inv</code> is here. Unfortunately it's a bit tricky to understand since it's templated C. The important thing to note is that an identity matrix is being passed to the LAPACK solver as parameter <code>B</code>.

Why does numpy.linalg.solve() offer more precise matrix inversions than numpy.linalg.inv()?

Tags:

python

arrays

matrix

numpy

linear-algebra

I do not quite understand why numpy.linalg.solve() gives the more precise answer, whereas numpy.linalg.inv() breaks down somewhat, giving (what I believe are) estimates.

For a concrete example, I am solving the equation C^{-1} * d where C denotes a matrix, and d is a vector-array. For the sake of discussion, the dimensions of C are shape (1000,1000) and d is shape (1,1000).

numpy.linalg.solve(A, b) solves the equation A*x=b for x, i.e. x = A^{-1} * b. Therefore, I could either solve this equation by

(1)

inverse = numpy.linalg.inv(C)
result = inverse * d

or (2)

numpy.linalg.solve(C, d)

Method (2) gives far more precise results. Why is this?

What exactly is happening such that one "works better" than the other?

486

asked Jul 06 '15 21:07

ShanZhengYang

1 Answers

np.linalg.solve(A, b) does not compute the inverse of A. Instead it calls one of the gesv LAPACK routines, which first factorizes A using LU decomposition, then solves for x using forward and backward substitution (see here).

np.linalg.inv uses the same method to compute the inverse of A by solving for A^-1 in A·A^-1 = I where I is the identity*. The factorization step is exactly the same as above, but it takes more floating point operations to solve for A^-1 (an n×n matrix) than for x (an n-long vector). Additionally, if you then wanted to obtain x via the identity A^-1·b = x then the extra matrix multiplication would incur yet more floating point operations, and therefore slower performance and more numerical error.

There's no need for the intermediate step of computing A^-1 - it is faster and more accurate to obtain x directly.

* The relevant bit of source for inv is here. Unfortunately it's a bit tricky to understand since it's templated C. The important thing to note is that an identity matrix is being passed to the LAPACK solver as parameter B.

171

answered Oct 12 '22 17:10

ali_m

Related questions
                            
                                What is the purpose of the c flag in the "conda install" command
                            
                                For Python programmers, is there anything equivalent to Perl's CPAN?
                            
                                Compare dictionaries ignoring specific keys
                            
                                Pyusb on windows - no backend available
                            
                                easyprocess.EasyProcessCheckInstalledError: cmd=['Xvfb', '-help'] OSError=[Errno 2] No such file or directory
                            
                                Why does the shape of a 1D array not show the number of rows as 1?
                            
                                How to use dash within Jupyter notebook or JupyterLab?
                            
                                How to write the Visitor Pattern for Abstract Syntax Tree in Python?
                            
                                ImportError: No module named statsmodels
                            
                                xlsxwriter: is there a way to open an existing worksheet in my workbook?
                            
                                pandas - Extend Index of a DataFrame setting all columns for new rows to NaN?
                            
                                What is the necessity of plt.figure() in matplotlib?
                            
                                Pandas Apply Key Error
                            
                                How do I parse a yaml string with python?
                            
                                pandas pd.options.display.max_rows not working as expected
                            
                                C++ GDB Python Pretty Printing Tutorial?
                            
                                getting the opposite diagonal of a numpy array
                            
                                How to convert a string to an image?
                            
                                Python multiple repeat Error
                            
                                Finding the Values of the Arrow Keys in Python: Why are they triples?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With