TensorFlow - numpy-like tensor indexing

Tags:

tensorflow

In numpy, we can do this:

x = np.random.random((10,10)) a = np.random.randint(0,10,5) b = np.random.randint(0,10,5) x[a,b] # gives 5 entries from x, indexed according to the corresponding entries in a and b

When I try something equivalent in TensorFlow:

xt = tf.constant(x) at = tf.constant(a) bt = tf.constant(b) xt[at,bt]

The last line gives a "Bad slice index tensor" exception. It seems TensorFlow doesn't support indexing like numpy or Theano.

Does anybody know if there is a TensorFlow way of doing this (indexing a tensor by arbitrary values). I've seen the tf.nn.embedding part, but I'm not sure they can be used for this and even if they can, it's a huge workaround for something this straightforward.

(Right now, I'm feeding the data from x as an input and doing the indexing in numpy but I hoped to put x inside TensorFlow to get higher efficiency)

806

asked Nov 16 '15 13:11

Denis L

2 Answers

You can actually do that now with tf.gather_nd. Let's say you have a matrix m like the following:

| 1 2 3 4 | | 5 6 7 8 |

And you want to build a matrix r of size, let's say, 3x2, built from elements of m, like this:

| 3 6 | | 2 7 | | 5 3 | | 1 1 |

Each element of r corresponds to a row and column of m, and you can have matrices rows and cols with these indices (zero-based, since we are programming, not doing math!):

       | 0 1 |         | 2 1 | rows = | 0 1 |  cols = | 1 2 |        | 1 0 |         | 0 2 |        | 0 0 |         | 0 0 |

Which you can stack into a 3-dimensional tensor like this:

| | 0 2 | | 1 1 | | | | 0 1 | | 1 2 | | | | 1 0 | | 2 0 | | | | 0 0 | | 0 0 | |

This way, you can get from m to r through rows and cols as follows:

import numpy as np import tensorflow as tf  m = np.array([[1, 2, 3, 4], [5, 6, 7, 8]]) rows = np.array([[0, 1], [0, 1], [1, 0], [0, 0]]) cols = np.array([[2, 1], [1, 2], [0, 2], [0, 0]])  x = tf.placeholder('float32', (None, None)) idx1 = tf.placeholder('int32', (None, None)) idx2 = tf.placeholder('int32', (None, None)) result = tf.gather_nd(x, tf.stack((idx1, idx2), -1))  with tf.Session() as sess:     r = sess.run(result, feed_dict={         x: m,         idx1: rows,         idx2: cols,     }) print(r)

Output:

[[ 3.  6.]  [ 2.  7.]  [ 5.  3.]  [ 1.  1.]]

164

answered Oct 05 '22 22:10

jdehesa

LDGN's comment is correct. This is not possible at the moment, and is a requested feature. If you follow issue#206 on github you'll get updated if/when this is available. Many people would like this feature.

answered Oct 05 '22 23:10

dga

Related questions
                            
                                What is the origin of __author__?
                            
                                Issue warning for missing comma between list items bug
                            
                                Merging a list of time-range tuples that have overlapping time-ranges
                            
                                Equivalent of R/ifelse in Python/Pandas? Compare string columns?
                            
                                Does TensorFlow have cross validation implemented for its users?
                            
                                AWS lambda - Release /tmp storage after each execution
                            
                                Preventing namespace collisions between private and pypi-based Python packages
                            
                                Use %20 instead of + for space in python query parameters
                            
                                How to close Boto S3 connection?
                            
                                How is the TFIDFVectorizer in scikit-learn supposed to work?
                            
                                Mypy/typeshed stubs for Pandas
                            
                                Twisted + SQLAlchemy and the best way to do it
                            
                                How to debug python CLI that takes stdin?
                            
                                Capturing output of python script run inside a docker container
                            
                                Why can't matplotlib plot in a different thread?
                            
                                Displaying graphs/charts in Django [closed]
                            
                                General approach to developing an image classification algorithm for Dilbert cartoons
                            
                                Python ConfigParser: Checking for option existence
                            
                                Change Matplotlib's default font
                            
                                Pandas' merge returns a column with _x appended to the name

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With