I'm trying to use numpy to remove rows from a two dimensional array where the first value of the row (so the element at index 0) does not match a certain condition. I am able to do this with regular python using two loops, but I would like to do it more efficiently with numpy, e.g. with <code>numpy.where</code> I have been trying various things with <code>numpy.where</code> and <code>numpy.delete</code> but I struggle with the fact that I want to select rows by using a condition that only needs to be verified by the first element, and not the second (I dont care about the value of the second element) Here is an example where I only want to keep the rows where the first value of each row is 6. Input: <pre class="prettyprint"><code>[[0,4], [0,5], [3,5], [6,8], [9,1], [6,1]] </code></pre> Output: <pre class="prettyprint"><code>[[6,8], [6,1]] </code></pre>

Program: <pre class="prettyprint"><code>import numpy as np np_array = np.array([[0,4],[0,5],[3,5],[6,8],[9,1],[6,1]]) rows=np.where(np_array[:,0]==6) print(np_array[rows]) </code></pre> Output: <pre class="prettyprint"><code>[[6 8] [6 1]] </code></pre> And If You Want to Get Into 2d List use <pre class="prettyprint"><code>np_array[rows].tolist() </code></pre> Output of 2d List <pre class="prettyprint"><code>[[6, 8], [6, 1]] </code></pre>

Numpy select rows based on condition

Tags:

python

arrays

matrix

numpy

I'm trying to use numpy to remove rows from a two dimensional array where the first value of the row (so the element at index 0) does not match a certain condition.

I am able to do this with regular python using two loops, but I would like to do it more efficiently with numpy, e.g. with numpy.where

I have been trying various things with numpy.where and numpy.delete but I struggle with the fact that I want to select rows by using a condition that only needs to be verified by the first element, and not the second (I dont care about the value of the second element)

Here is an example where I only want to keep the rows where the first value of each row is 6.

Input:

[[0,4],
[0,5],
[3,5],
[6,8],
[9,1],
[6,1]]

Output:

[[6,8],
[6,1]]

244

asked Sep 24 '19 11:09

charelf

2 Answers

Use a boolean mask:

mask = (z[:, 0] == 6)
z[mask, :]

This is much more efficient than np.where because you can use the boolean mask directly, without having the overhead of converting it to an array of indices first.

One liner:

z[z[:, 0] == 6, :]

126

answered Sep 21 '22 08:09

Mad Physicist

Program:

import numpy as np
np_array = np.array([[0,4],[0,5],[3,5],[6,8],[9,1],[6,1]])
rows=np.where(np_array[:,0]==6)
print(np_array[rows])

Output:

[[6 8]
 [6 1]]

And If You Want to Get Into 2d List use

np_array[rows].tolist()

Output of 2d List

[[6, 8], [6, 1]]

answered Sep 20 '22 08:09

ravishankar chavare

Related questions
                            
                                Row Sum of a dot product for huge matrix in python
                            
                                Replacing a text with \n in it, with a real \n output
                            
                                passing a py.test fixture between test files in a module
                            
                                How to find maximum value of a column in python dataframe
                            
                                How to save a binary image(with dtype=bool) using cv2?
                            
                                Absolute imports in python not working, relative imports work
                            
                                Options for running Python scripts in Azure
                            
                                Why do we need Signatures in Celery?
                            
                                Missing handler error in AWS Lambda
                            
                                What is the space complexity of the python sort?
                            
                                Why is time complexity O(1) for pow(x,y) while it is O(n) for x**y?
                            
                                How to print a numpy.array in one line?
                            
                                Adding callback function on each retry attempt using requests/urllib3
                            
                                How to read the response body on Python urllib when the status is an error like 400 which raises an exception?
                            
                                Getting "ImportError: libXrender.so.1: cannot open shared object file" when importing OpenCV
                            
                                Creating curved edges with NetworkX in Python3
                            
                                When is a class variable initialized in Python?
                            
                                How to prevent user to access login page in django when already logged in?
                            
                                db.create_all() 'NoneType' object has no attribute 'drivername'
                            
                                Type hints for SQLAlchemy engine and session objects

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With