multidimensional boolean array indexing in numpy

Tags:

I have a two 2D arrays, one of numbers and one of boolean values:

x = 
array([[ 0.,  0.,  0.,  0.,  0.,  0.,  0.,  0.,  0.,  0.],
       [ 1.,  1.,  1.,  1.,  1.,  1.,  1.,  1.,  1.,  1.],
       [ 2.,  2.,  2.,  2.,  2.,  2.,  2.,  2.,  2.,  2.],
       [ 3.,  3.,  3.,  3.,  3.,  3.,  3.,  3.,  3.,  3.],
       [ 4.,  4.,  4.,  4.,  4.,  4.,  4.,  4.,  4.,  4.],
       [ 5.,  5.,  5.,  5.,  5.,  5.,  5.,  5.,  5.,  5.],
       [ 6.,  6.,  6.,  6.,  6.,  6.,  6.,  6.,  6.,  6.],
       [ 7.,  7.,  7.,  7.,  7.,  7.,  7.,  7.,  7.,  7.],
       [ 8.,  8.,  8.,  8.,  8.,  8.,  8.,  8.,  8.,  8.],
       [ 9.,  9.,  9.,  9.,  9.,  9.,  9.,  9.,  9.,  9.]])

idx = 
array([[False, False, False, False, False, False, False, False, False, False],
       [False,  True,  True,  True,  True,  True, False, False, False, False],
       [False,  True,  True,  True,  True,  True, False, False, False, False],
       [False,  True,  True,  True,  True,  True, False, False, False, False],
       [False, False, False,  True,  True,  True,  True, False, False, False],
       [False, False, False, False,  True,  True,  True, False, False, False],
       [False, False, False, False, False, False,  True, False, False, False],
       [False, False, False, False, False, False, False,  True, False, False],
       [False, False, False, False, False, False, False, False, False, False],
       [False, False, False, False, False, False, False, False, False, False]], dtype=bool)

When I index the array it returns a 1D array:

x[idx]
array([ 1.,  1.,  1.,  1.,  1.,  2.,  2.,  2.,  2.,  2.,  3.,  3.,  3.,
    3.,  3.,  4.,  4.,  4.,  4.,  5.,  5.,  5.,  6.,  7.])

How do I index the array and return a 2D array with the expected output:

x[idx]
array([[ 1.,  1.,  1.,  1.,  1.],
       [ 2.,  2.,  2.,  2.,  2.],
       [ 3.,  3.,  3.,  3.,  3.],
       [ 4.,  4.,  4.,  4.],
       [ 5.,  5.,  5.],
       [ 6.],
       [ 7.]])

956

asked Sep 29 '22 16:09

camdenl

1 Answers

Your command returns a 1D array since it's impossible to fulfill without (a) destroying the column structure, which is usually needed. e.g., the 7 in your requested output originally belonged to column 7, and now it's on column 0; and (b) numpy does not, afaik, support high dimensional array with different sizes on the same dimension. What I mean is that numpy can't have an array whose first three rows are of length 5, 4th row of length 4, etc. - all the rows (same dimension) need to have the same length.

I think the best result you could hope for is an array of arrays (and not a 2D array). This is how I would construct it, though there are probably better ways I don't know of:

In [9]: from itertools import izip
In [11]: array([r[ridx] for r, ridx in izip(x, idx) if ridx.sum() > 0])
Out[11]: 
array([array([ 1.,  1.,  1.,  1.,  1.]), array([ 2.,  2.,  2.,  2.,  2.]),
       array([ 3.,  3.,  3.,  3.,  3.]), array([ 4.,  4.,  4.,  4.]),
       array([ 5.,  5.,  5.]), array([ 6.]), array([ 7.])], dtype=object)

141

answered Oct 05 '22 08:10

Korem

Related questions
                            
                                h5py: slicing dataset without loading into memory
                            
                                Round, align and print list of floats with format()
                            
                                How to apply Pandas Groupby with multiple conditions for split and apply multiple calculations?
                            
                                Scrolling web page using selenium python webdriver
                            
                                Can't configure a virtualenv TO NOT use packages outside my virtual environment on Windows
                            
                                Testing Django Commands with Mock
                            
                                Voice calls in Django/Python
                            
                                How to use pickled classifier with countVectorizer.fit_transform() for labeling data
                            
                                Error transferring database from MySql to Postgres using mysql2pgsql
                            
                                Python requests including cookies error out
                            
                                What's a Pythonic way to make a non-blocking version of an object?
                            
                                Parent methods which return child class instances
                            
                                unittest.mock.patch: Context manager vs setUp/tearDown in unittest
                            
                                How to display multiple views in a Django template?
                            
                                Use facts gathered by ansible programmatically
                            
                                Is NDCG (normalized discounted gain) flawed? I have calculated a few alternative ranking quality measures, and I can't make heads or tails of it
                            
                                In Eve, how can you make a sub-resource of a collection and keep the parent collections endpoint?
                            
                                How can I get column name and type from an existing table in SQLAlchemy?
                            
                                Python not sorting unicode properly. Strcoll doesn't help
                            
                                Pandas lookup from one of multiple columns, based on value

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

multidimensional boolean array indexing in numpy

Tags:

python

arrays

numpy

camdenl

People also ask

1 Answers

Korem

Recent Activity

Donate For Us