I am looking for a way to implement an "as of" operator in <code>numpy</code>. Specifically, if: <ol> <li> <code>t1</code> is an <code>n</code>-vector of timestamps in a strictly increasing order;</li> <li> <code>d1</code> is an <code>n x p</code> matrix of observations, with <code>i</code>-th row corresponding to <code>t1[i]</code>;</li> <li> <code>t2</code> in an <code>m</code>-vector of timestamps, also in a strictly increasing order;</li> </ol> I need to create an <code>m x p</code> matrix <code>d2</code>, where <code>d2[i]</code> is simply <code>d1[j]</code> for the largest value of <code>j</code> such that <code>t1[j] <= t2[i]</code>. In other words, I need to get the rows of <code>d1</code> as of the timestamps in <code>t2</code>. It is easy to write this in pure Python, but I am wondering if there's a way to avoid having interpreted loops (<code>n</code>, <code>m</code> and <code>p</code> are quite large). The timestamps are <code>datetime.datetime</code> objects. The observations are floating-point values. edit: For entries where <code>t1[j] <= t2[i]</code> can't be satisfied (i.e. where a timestamp in <code>t2</code> precedes all timestamps in <code>t1</code>), I would ideally like to get rows of <code>NaN</code>s.

Your best choice is <code>numpy.searchsorted()</code>: <pre class="prettyprint"><code>d1[numpy.searchsorted(t1, t2, side="right") - 1] </code></pre> This will search the indices where the values of <code>t2</code> would have to be inserted into <code>t1</code> to maintain order. The <code>side="right"</code> and <code>- 1</code> bits are to ensure exactly the specified behaviour. Edit: To get rows of NaNs where the condition <code>t1[j] <= t2[i]</code> can't be satisfied, you could use <pre class="prettyprint"><code>nan_row = numpy.repeat(numpy.nan, d1.shape[1]) d1_nan = numpy.vstack((nan_row, d1)) d2 = d1_nan[numpy.searchsorted(t1, t2, side="right")] </code></pre>

"as of" in numpy

1 Answers

Your best choice is numpy.searchsorted():

Click to copy

d1[numpy.searchsorted(t1, t2, side="right") - 1]

This will search the indices where the values of t2 would have to be inserted into t1 to maintain order. The side="right" and - 1 bits are to ensure exactly the specified behaviour.

Edit: To get rows of NaNs where the condition t1[j] <= t2[i] can't be satisfied, you could use

Click to copy

nan_row = numpy.repeat(numpy.nan, d1.shape[1])
d1_nan = numpy.vstack((nan_row, d1))
d2 = d1_nan[numpy.searchsorted(t1, t2, side="right")]

157

answered Oct 08 '22 21:10

Sven Marnach

Related questions
                            
                                Any reason there are no returned value from set.add [closed]
                            
                                Stardand context menu in Python TKinter text widget when mouse right button is pressed
                            
                                Python subprocess.Popen as different user on Windows
                            
                                3D scatterplots in sage
                            
                                How to efficiently add sparse matrices in Python
                            
                                Find speed of vehicle from images
                            
                                virtualenv --no-site-packages is not working for me
                            
                                Best Python clustering library to use for product data analysis [closed]
                            
                                Qt Designer for PyQt on OSX 10.6
                            
                                Namespacing and classes
                            
                                Why does os.path.getsize() return a negative number for a 10gb file?
                            
                                Passing Python array to c++ function with SWIG
                            
                                Complex transforming nested dictionaries into objects in python
                            
                                Dictionary-like efficient storing of scipy/numpy arrays
                            
                                SciPy global minimum curve fit
                            
                                Writing video with OpenCV + Python + Mac
                            
                                Using matplotlib slider widget to change clim in image
                            
                                Python: Problem with raw_input reading a number
                            
                                Completing object with its relations and avoiding unnecessary queries in sqlalchemy
                            
                                How to open excel file fast in Python?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

"as of" in numpy

Tags:

python

loops

numpy

time-series

NPE

People also ask

1 Answers

Sven Marnach

Recent Activity

Donate For Us