What are the advantages and disadvantages of each? From what I've seen, either one can work as a replacement for the other if need be, so should I bother using both or should I stick to just one of them? Will the style of the program influence my choice? I am doing some machine learning using numpy, so there are indeed lots of matrices, but also lots of vectors (arrays).

Numpy matrices are strictly 2-dimensional, while numpy arrays (ndarrays) are N-dimensional. Matrix objects are a subclass of ndarray, so they inherit all the attributes and methods of ndarrays. The main advantage of numpy matrices is that they provide a convenient notation for matrix multiplication: if a and b are matrices, then <code>a*b</code> is their matrix product. <pre class="prettyprint"><code>import numpy as np a = np.mat('4 3; 2 1') b = np.mat('1 2; 3 4') print(a) # [[4 3] # [2 1]] print(b) # [[1 2] # [3 4]] print(a*b) # [[13 20] # [ 5 8]] </code></pre> On the other hand, as of Python 3.5, NumPy supports infix matrix multiplication using the <code>@</code> operator, so you can achieve the same convenience of matrix multiplication with ndarrays in Python >= 3.5. <pre class="prettyprint"><code>import numpy as np a = np.array([[4, 3], [2, 1]]) b = np.array([[1, 2], [3, 4]]) print(a@b) # [[13 20] # [ 5 8]] </code></pre> Both matrix objects and ndarrays have <code>.T</code> to return the transpose, but matrix objects also have <code>.H</code> for the conjugate transpose, and <code>.I</code> for the inverse. In contrast, numpy arrays consistently abide by the rule that operations are applied element-wise (except for the new <code>@</code> operator). Thus, if <code>a</code> and <code>b</code> are numpy arrays, then <code>a*b</code> is the array formed by multiplying the components element-wise: <pre class="prettyprint"><code>c = np.array([[4, 3], [2, 1]]) d = np.array([[1, 2], [3, 4]]) print(c*d) # [[4 6] # [6 4]] </code></pre> To obtain the result of matrix multiplication, you use <code>np.dot</code> (or <code>@</code> in Python >= 3.5, as shown above): <pre class="prettyprint"><code>print(np.dot(c,d)) # [[13 20] # [ 5 8]] </code></pre> The <code>**</code> operator also behaves differently: <pre class="prettyprint"><code>print(a**2) # [[22 15] # [10 7]] print(c**2) # [[16 9] # [ 4 1]] </code></pre> Since <code>a</code> is a matrix, <code>a**2</code> returns the matrix product <code>a*a</code>. Since <code>c</code> is an ndarray, <code>c**2</code> returns an ndarray with each component squared element-wise. There are other technical differences between matrix objects and ndarrays (having to do with <code>np.ravel</code>, item selection and sequence behavior). The main advantage of numpy arrays is that they are more general than 2-dimensional matrices. What happens when you want a 3-dimensional array? Then you have to use an ndarray, not a matrix object. Thus, learning to use matrix objects is more work -- you have to learn matrix object operations, and ndarray operations. Writing a program that mixes both matrices and arrays makes your life difficult because you have to keep track of what type of object your variables are, lest multiplication return something you don't expect. In contrast, if you stick solely with ndarrays, then you can do everything matrix objects can do, and more, except with slightly different functions/notation. If you are willing to give up the visual appeal of NumPy matrix product notation (which can be achieved almost as elegantly with ndarrays in Python >= 3.5), then I think NumPy arrays are definitely the way to go. PS. Of course, you really don't have to choose one at the expense of the other, since <code>np.asmatrix</code> and <code>np.asarray</code> allow you to convert one to the other (as long as the array is 2-dimensional). <hr> There is a synopsis of the differences between NumPy <code>arrays</code> vs NumPy <code>matrix</code>es here.

Scipy.org recommends that you use arrays: <blockquote> *'array' or 'matrix'? Which should I use? - Short answer Use arrays. <ul> <li>They support multidimensional array algebra that is supported in MATLAB</li> <li>They are the standard vector/matrix/tensor type of NumPy. Many NumPy functions return arrays, not matrices.</li> <li>There is a clear distinction between element-wise operations and linear algebra operations.</li> <li>You can have standard vectors or row/column vectors if you like.</li> </ul> Until Python 3.5 the only disadvantage of using the array type was that you had to use <code>dot</code> instead of <code>*</code> to multiply (reduce) two tensors (scalar product, matrix vector multiplication etc.). Since Python 3.5 you can use the matrix multiplication <code>@</code> operator. Given the above, we intend to deprecate <code>matrix</code> eventually. </blockquote>

As per the official documents, it's not anymore advisable to use matrix class since it will be removed in the future. https://numpy.org/doc/stable/reference/generated/numpy.matrix.html As other answers already state that you can achieve all the operations with NumPy arrays.

Just to add one case to unutbu's list. One of the biggest practical differences for me of numpy ndarrays compared to numpy matrices or matrix languages like matlab, is that the dimension is not preserved in reduce operations. Matrices are always 2d, while the mean of an array, for example, has one dimension less. For example demean rows of a matrix or array: with matrix <pre class="prettyprint"><code>>>> m = np.mat([[1,2],[2,3]]) >>> m matrix([[1, 2], [2, 3]]) >>> mm = m.mean(1) >>> mm matrix([[ 1.5], [ 2.5]]) >>> mm.shape (2, 1) >>> m - mm matrix([[-0.5, 0.5], [-0.5, 0.5]]) </code></pre> with array <pre class="prettyprint"><code>>>> a = np.array([[1,2],[2,3]]) >>> a array([[1, 2], [2, 3]]) >>> am = a.mean(1) >>> am.shape (2,) >>> am array([ 1.5, 2.5]) >>> a - am #wrong array([[-0.5, -0.5], [ 0.5, 0.5]]) >>> a - am[:, np.newaxis] #right array([[-0.5, 0.5], [-0.5, 0.5]]) </code></pre> I also think that mixing arrays and matrices gives rise to many "happy" debugging hours. However, scipy.sparse matrices are always matrices in terms of operators like multiplication.

As others have mentioned, perhaps the main advantage of <code>matrix</code> was that it provided a convenient notation for matrix multiplication. However, in Python 3.5 there is finally a dedicated infix operator for matrix multiplication: <code>@</code>. With recent NumPy versions, it can be used with <code>ndarray</code>s: <pre class="prettyprint"><code>A = numpy.ones((1, 3)) B = numpy.ones((3, 3)) A @ B </code></pre> So nowadays, even more, when in doubt, you should stick to <code>ndarray</code>.

<h3>Matrix Operations with Numpy Arrays:</h3> I would like to keep updating this answer about matrix operations with numpy arrays if some users are interested looking for information about matrices and numpy. As the accepted answer, and the numpy-ref.pdf said: <blockquote> class numpy.matrix will be removed in the future. </blockquote> So now matrix algebra operations has to be done with Numpy Arrays. <pre class="prettyprint"><code>a = np.array([[1,3],[-2,4]]) b = np.array([[3,-2],[5,6]]) </code></pre> Matrix Multiplication (infix matrix multiplication) <pre class="prettyprint"><code>a@b array([[18, 16], [14, 28]]) </code></pre> Transpose: <pre class="prettyprint"><code>ab = a@b ab.T array([[18, 14], [16, 28]]) </code></pre> Inverse of a matrix: <pre class="prettyprint"><code>np.linalg.inv(ab) array([[ 0.1 , -0.05714286], [-0.05 , 0.06428571]]) ab_i=np.linalg.inv(ab) ab@ab_i # proof of inverse array([[1., 0.], [0., 1.]]) # identity matrix </code></pre> Determinant of a matrix. <pre class="prettyprint"><code>np.linalg.det(ab) 279.9999999999999 </code></pre> Solving a Linear System: <pre class="prettyprint"><code>1. x + y = 3, x + 2y = -8 b = np.array([3,-8]) a = np.array([[1,1], [1,2]]) x = np.linalg.solve(a,b) x array([ 14., -11.]) # Solution x=14, y=-11 </code></pre> Eigenvalues and Eigenvectors: <pre class="prettyprint"><code>a = np.array([[10,-18], [6,-11]]) np.linalg.eig(a) (array([ 1., -2.]), array([[0.89442719, 0.83205029], [0.4472136 , 0.5547002 ]]) </code></pre>

What are the differences between numpy arrays and matrices? Which one should I use?

6 Answers

Numpy matrices are strictly 2-dimensional, while numpy arrays (ndarrays) are N-dimensional. Matrix objects are a subclass of ndarray, so they inherit all the attributes and methods of ndarrays.

The main advantage of numpy matrices is that they provide a convenient notation for matrix multiplication: if a and b are matrices, then a*b is their matrix product.

import numpy as np

a = np.mat('4 3; 2 1')
b = np.mat('1 2; 3 4')
print(a)
# [[4 3]
#  [2 1]]
print(b)
# [[1 2]
#  [3 4]]
print(a*b)
# [[13 20]
#  [ 5  8]]

On the other hand, as of Python 3.5, NumPy supports infix matrix multiplication using the @ operator, so you can achieve the same convenience of matrix multiplication with ndarrays in Python >= 3.5.

import numpy as np

a = np.array([[4, 3], [2, 1]])
b = np.array([[1, 2], [3, 4]])
print(a@b)
# [[13 20]
#  [ 5  8]]

Both matrix objects and ndarrays have .T to return the transpose, but matrix objects also have .H for the conjugate transpose, and .I for the inverse.

In contrast, numpy arrays consistently abide by the rule that operations are applied element-wise (except for the new @ operator). Thus, if a and b are numpy arrays, then a*b is the array formed by multiplying the components element-wise:

c = np.array([[4, 3], [2, 1]])
d = np.array([[1, 2], [3, 4]])
print(c*d)
# [[4 6]
#  [6 4]]

To obtain the result of matrix multiplication, you use np.dot (or @ in Python >= 3.5, as shown above):

print(np.dot(c,d))
# [[13 20]
#  [ 5  8]]

The ** operator also behaves differently:

print(a**2)
# [[22 15]
#  [10  7]]
print(c**2)
# [[16  9]
#  [ 4  1]]

Since a is a matrix, a**2 returns the matrix product a*a. Since c is an ndarray, c**2 returns an ndarray with each component squared element-wise.

There are other technical differences between matrix objects and ndarrays (having to do with np.ravel, item selection and sequence behavior).

The main advantage of numpy arrays is that they are more general than 2-dimensional matrices. What happens when you want a 3-dimensional array? Then you have to use an ndarray, not a matrix object. Thus, learning to use matrix objects is more work -- you have to learn matrix object operations, and ndarray operations.

Writing a program that mixes both matrices and arrays makes your life difficult because you have to keep track of what type of object your variables are, lest multiplication return something you don't expect.

In contrast, if you stick solely with ndarrays, then you can do everything matrix objects can do, and more, except with slightly different functions/notation.

If you are willing to give up the visual appeal of NumPy matrix product notation (which can be achieved almost as elegantly with ndarrays in Python >= 3.5), then I think NumPy arrays are definitely the way to go.

PS. Of course, you really don't have to choose one at the expense of the other, since np.asmatrix and np.asarray allow you to convert one to the other (as long as the array is 2-dimensional).

There is a synopsis of the differences between NumPy arrays vs NumPy matrixes here.

153

answered Oct 05 '22 23:10

unutbu

Scipy.org recommends that you use arrays:

*'array' or 'matrix'? Which should I use? - Short answer

Use arrays.

They support multidimensional array algebra that is supported in MATLAB

They are the standard vector/matrix/tensor type of NumPy. Many NumPy functions return arrays, not matrices.

There is a clear distinction between element-wise operations and linear algebra operations.

You can have standard vectors or row/column vectors if you like.

Until Python 3.5 the only disadvantage of using the array type was that you had to use dot instead of * to multiply (reduce) two tensors (scalar product, matrix vector multiplication etc.). Since Python 3.5 you can use the matrix multiplication @ operator.

Given the above, we intend to deprecate matrix eventually.

answered Oct 06 '22 00:10

atomh33ls

As per the official documents, it's not anymore advisable to use matrix class since it will be removed in the future.

https://numpy.org/doc/stable/reference/generated/numpy.matrix.html

As other answers already state that you can achieve all the operations with NumPy arrays.

answered Oct 05 '22 22:10

Aks

Just to add one case to unutbu's list.

One of the biggest practical differences for me of numpy ndarrays compared to numpy matrices or matrix languages like matlab, is that the dimension is not preserved in reduce operations. Matrices are always 2d, while the mean of an array, for example, has one dimension less.

For example demean rows of a matrix or array:

with matrix

>>> m = np.mat([[1,2],[2,3]])
>>> m
matrix([[1, 2],
        [2, 3]])
>>> mm = m.mean(1)
>>> mm
matrix([[ 1.5],
        [ 2.5]])
>>> mm.shape
(2, 1)
>>> m - mm
matrix([[-0.5,  0.5],
        [-0.5,  0.5]])

with array

>>> a = np.array([[1,2],[2,3]])
>>> a
array([[1, 2],
       [2, 3]])
>>> am = a.mean(1)
>>> am.shape
(2,)
>>> am
array([ 1.5,  2.5])
>>> a - am #wrong
array([[-0.5, -0.5],
       [ 0.5,  0.5]])
>>> a - am[:, np.newaxis]  #right
array([[-0.5,  0.5],
       [-0.5,  0.5]])

I also think that mixing arrays and matrices gives rise to many "happy" debugging hours. However, scipy.sparse matrices are always matrices in terms of operators like multiplication.

answered Oct 06 '22 00:10

Josef

As others have mentioned, perhaps the main advantage of matrix was that it provided a convenient notation for matrix multiplication.

However, in Python 3.5 there is finally a dedicated infix operator for matrix multiplication: @.

With recent NumPy versions, it can be used with ndarrays:

A = numpy.ones((1, 3))
B = numpy.ones((3, 3))
A @ B

So nowadays, even more, when in doubt, you should stick to ndarray.

answered Oct 05 '22 23:10

Peque

Matrix Operations with Numpy Arrays:

I would like to keep updating this answer about matrix operations with numpy arrays if some users are interested looking for information about matrices and numpy.

As the accepted answer, and the numpy-ref.pdf said:

class numpy.matrix will be removed in the future.

So now matrix algebra operations has to be done with Numpy Arrays.

a = np.array([[1,3],[-2,4]])
b = np.array([[3,-2],[5,6]])

Matrix Multiplication (infix matrix multiplication)

a@b
array([[18, 16],
       [14, 28]])

Transpose:

ab = a@b
ab.T       
array([[18, 14],
       [16, 28]])

Inverse of a matrix:

np.linalg.inv(ab)
array([[ 0.1       , -0.05714286],
       [-0.05      ,  0.06428571]])      

ab_i=np.linalg.inv(ab) 
ab@ab_i  # proof of inverse
array([[1., 0.],
       [0., 1.]]) # identity matrix

Determinant of a matrix.

np.linalg.det(ab)
279.9999999999999

Solving a Linear System:

1.   x + y = 3,
    x + 2y = -8
b = np.array([3,-8])
a = np.array([[1,1], [1,2]])
x = np.linalg.solve(a,b)
x
array([ 14., -11.])
# Solution x=14, y=-11

Eigenvalues and Eigenvectors:

a = np.array([[10,-18], [6,-11]])
np.linalg.eig(a)
(array([ 1., -2.]), array([[0.89442719, 0.83205029],
        [0.4472136 , 0.5547002 ]])

answered Oct 05 '22 23:10

rubengavidia0x

Related questions
                            
                                How can I find all matches to a regular expression in Python?
                            
                                How do I run all Python unit tests in a directory?
                            
                                time.sleep -- sleeps thread or process?
                            
                                What's the difference between select_related and prefetch_related in Django ORM?
                            
                                What is the difference between an expression and a statement in Python?
                            
                                Is there a standardized method to swap two variables in Python?
                            
                                Convert DataFrame column type from string to datetime
                            
                                Converting Python dict to kwargs?
                            
                                defaultdict of defaultdict?
                            
                                How to update/upgrade a package using pip?
                            
                                Re-ordering columns in pandas dataframe based on column name [duplicate]
                            
                                Remove xticks in a matplotlib plot?
                            
                                What are the differences between json and simplejson Python modules?
                            
                                How to reversibly store and load a Pandas dataframe to/from disk
                            
                                Installing python module within code
                            
                                How do I detect the Python version at runtime? [duplicate]
                            
                                Pylint, PyChecker or PyFlakes? [closed]
                            
                                How to specify multiple return types using type-hints
                            
                                How do I get a value of datetime.today() in Python that is "timezone aware"?
                            
                                Stripping everything but alphanumeric chars from a string in Python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What are the differences between numpy arrays and matrices? Which one should I use?

Tags:

python

arrays

matrix

numpy

levesque

People also ask

6 Answers

unutbu

atomh33ls

Aks

Josef

Peque

Matrix Operations with Numpy Arrays:

rubengavidia0x

Recent Activity

Donate For Us