I have a 120,000*4 numpy array as shown below. Each row is a sample. The first column is time in second, or the <code>index</code> using Pandas terminology. <pre class="prettyprint"><code>0.014 14.175 -29.97 -22.68 0.022 13.905 -29.835 -22.68 0.030 12.257 -29.32 -22.67 ... ... 1259.980 -0.405 2.205 3.825 1259.991 -0.495 2.115 3.735 </code></pre> I want to select the rows recorded between 100.000 to 200.000 sec and save it into a new array. If this were a Pandas dataframe, I would simply write <code>df.loc[100:200]</code>. What is the equivalent operation in numpy? This is NOT a question of feasibility. I simply wonder if there are any pythonic one-line solutions.

This assumes indexes are sorted: IIUC, <pre class="prettyprint"><code>x=np.array([ [1,2,3,4], [5,6,7,8], [9,10,11,12], [13,14,15,16]]) x[(x[:,0] >= 5) & (x[:,0] <= 9) ] </code></pre> So you would have 100 and 200 instead of 5 and 9. <hr> For a more general solution, check Wen`s answer

Data from Raf <pre class="prettyprint"><code>x[np.where(x[:,0]==5)[0][0]:np.where(x[:,0]==9)[0][0]+1,:] Out[341]: array([[ 5, 6, 7, 8], [ 9, 10, 11, 12]]) </code></pre> Notice only using greater and less than for that can not fully replace the <code>.loc</code>, the back end of .loc is index position not value range For example <pre class="prettyprint"><code>df Out[348]: 0 1 2 3 0 1 2 3 4 1 5 6 7 8 4444 9 10 11 12 3 13 14 15 16 df.loc[1:3] Out[347]: 0 1 2 3 1 5 6 7 8 4444 9 10 11 12 3 13 14 15 16 </code></pre>

What is Numpy equivalence of dataframe.loc() in Pandas

Tags:

python

pandas

numpy

I have a 120,000*4 numpy array as shown below. Each row is a sample. The first column is time in second, or the index using Pandas terminology.

0.014      14.175  -29.97  -22.68 
0.022      13.905  -29.835 -22.68
0.030      12.257  -29.32  -22.67
... ...
1259.980   -0.405   2.205   3.825
1259.991   -0.495   2.115   3.735

I want to select the rows recorded between 100.000 to 200.000 sec and save it into a new array. If this were a Pandas dataframe, I would simply write df.loc[100:200]. What is the equivalent operation in numpy?

This is NOT a question of feasibility. I simply wonder if there are any pythonic one-line solutions.

815

asked Jul 24 '18 23:07

F.S.

2 Answers

This assumes indexes are sorted:

IIUC,

x=np.array([ [1,2,3,4],
           [5,6,7,8],
           [9,10,11,12],
           [13,14,15,16]])

x[(x[:,0] >= 5) & (x[:,0] <= 9) ]

So you would have 100 and 200 instead of 5 and 9.

For a more general solution, check Wen`s answer

161

answered Sep 30 '22 04:09

rafaelc

Data from Raf

x[np.where(x[:,0]==5)[0][0]:np.where(x[:,0]==9)[0][0]+1,:]
Out[341]: 
array([[ 5,  6,  7,  8],
       [ 9, 10, 11, 12]])

Notice

only using greater and less than for that can not fully replace the .loc, the back end of .loc is index position not value range

For example

df
Out[348]: 
       0   1   2   3
0      1   2   3   4
1      5   6   7   8
4444   9  10  11  12
3     13  14  15  16

df.loc[1:3]
Out[347]: 
       0   1   2   3
1      5   6   7   8
4444   9  10  11  12
3     13  14  15  16

answered Sep 30 '22 04:09

BENY

Related questions
                            
                                Python Invoke - Can't find any collection named 'tasks'!
                            
                                Django model subclassing approaches
                            
                                Changing time components of pandas datetime64 column
                            
                                How to create charts with Plotly on Django?
                            
                                Folium map not displaying
                            
                                Why can't pdb access a variable containing an exception?
                            
                                Running Flask with Gunicorn raises TypeError: index() takes 0 positional arguments but 2 were given
                            
                                Binary to String/Text in Python
                            
                                Heroku Scheduler With Python Script
                            
                                Weird repeated sequence printed to console when installing packages through conda
                            
                                Convert Pandas Dataframe to Float with commas and negative numbers
                            
                                How do I perform an UPDATE of existing rows of a db table using a Pandas DataFrame?
                            
                                How to mock a method return value of a class
                            
                                Python logger per function or per module
                            
                                Is 3-space indentation required in reST?
                            
                                DBSCAN with custom metric
                            
                                Keras LSTM: a time-series multi-step multi-features forecasting - poor results
                            
                                Efficient Way of making a set of tuple in which the order of tuple doesn't matters
                            
                                Multiple instances of Python running simultaneously limited to 35
                            
                                How to Install the latest version of seaborn

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With