Difference in matrix multiplication tensorflow vs numpy

Tags:

I have a case where matrix multiplication of two matrices with certain dimensions work in numpy, but doesn't work in tensorflow.

Click to copy

x = np.ndarray(shape=(10,20,30), dtype = float)
y = np.ndarray(shape=(30,40), dtype = float)
z = np.matmul(x,y)
print("np shapes: %s x %s = %s" % (np.shape(x), np.shape(y), np.shape(z)))

This works as expected and prints:

Click to copy

np shapes: (10, 20, 30) x (30, 40) = (10, 20, 40)

However in tensorflow when I try to multiply placeholder and variable of the same shapes as the numpy arrays above I get an error

Click to copy

x = tf.placeholder(tf.float32, shape=(10,20,30))
y = tf.Variable(tf.truncated_normal([30,40], name='w'))
print("tf shapes: %s x %s" % (x.get_shape(), y.get_shape()))
tf.matmul(x,y)

Results in

Click to copy

tf shapes: (10, 20, 30) x (30, 40)
InvalidArgumentError: 
Shape must be rank 2 but is rank 3 for 'MatMul_12' 
(op: 'MatMul') with input shapes: [10,20,30], [30,40].

Why does this operation fail?

984

asked Feb 12 '17 21:02

Kuba

1 Answers

Don't know why tf.matmul does not support this kind of multiplication (may be one of the core developers could provide a meaningful answer).

But if you just want to be able to multiply tensors in this way, take a look at tf.einsum function. It could operate with tensors of arbitrary rank.

107

answered Oct 18 '22 22:10

Dmitriy Danevskiy

Related questions
                            
                                `TypeError: argument 2 must be a connection, cursor or None` in Psycopg2
                            
                                Pyspark - Sum over multiple sparse vectors (CountVectorizer Output)
                            
                                Selenium Remote Webdriver with remote profile
                            
                                django.db.utils.OperationalError: server closed the connection unexpectedly
                            
                                AWS Redis + uWSGI behind NGINX - high load
                            
                                Fonts Corrupted
                            
                                How to find all uses of a python function or variable in a python package
                            
                                Outliers using RPCA
                            
                                Trained keras model much slower making its predictions than in training
                            
                                Adding a property to an int value in python
                            
                                Unable to locate nested geopoint after updating to elasticsearch 2.3
                            
                                Calling a stateful LSTM as a functional model?
                            
                                Share Python logger across multiple files
                            
                                Troubleshooting tips for clustering word2vec output with DBSCAN
                            
                                Memory profiling of a running python application
                            
                                Preparing variable-length data for sklearn
                            
                                Pandas Unicode Import Export error with to_excel() read_excel()
                            
                                How to implement autovivification for nested dictionary ONLY when assigning values?
                            
                                Connecting to Hive using python's Jaydebeapi
                            
                                Modifying YAML using ruamel.yaml adds extra new lines

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Difference in matrix multiplication tensorflow vs numpy

Tags:

python

matrix

numpy

tensorflow

Kuba

People also ask

1 Answers

Dmitriy Danevskiy

Recent Activity

Donate For Us