For a data analysis task, I want to find zero crossings in a numpy array, coming from a convolution with first a sobel-like kernel, and then a mexican hat kernel. Zero crossings allow me to detect edges in the data. Unfortunately, the data is somewhat noisy and I only want to find zero crossings with a minimal jump size, <code>20</code> in the follwing example: <pre class="prettyprint"><code>import numpy as np arr = np.array([12, 15, 9, 8, -1, 1, -12, -10, 10]) </code></pre> Should result in <pre class="prettyprint"><code>>>>array([1, 3, 7]) </code></pre> or <pre class="prettyprint"><code>>>>array([3, 7]) </code></pre> Where <code>3</code> is the index of <code>-1</code>, just before the middle of the first jump and <code>7</code> is the index of <code>-10</code> I have tried a modification of the following code (source: Efficiently detect sign-changes in python) <pre class="prettyprint"><code>zero_crossings = np.where(np.diff(np.sign(np.trunc(arr/10))))[0] </code></pre> Which correctly ignores small jumps, but puts the zero-crossings at <code>[1,5,7]</code> What would be an efficient way of doing this? The definition of minimal jump is not strict, but results should be along the lines of my question. Edit: For Clarification <pre class="prettyprint"><code>arr = np.array([12, 15, 9, 8, -1, 1, -12, -10, 10]) arr_floored = np.trunc(arr/10) >>>>np.array([10, 10, 0, 0, 0, 0, -10, -10, 10]) sgn = np.sign(arr_floored) >>>>array([ 1, 1, 0, 0, 0, 0, -1, -1, 1]) dsgn = np.diff(sgn) >>>>array([ 0, -1, 0, 0, 0, -1, 0, 2]) np.where(dsgn) >>>>(array([1, 5, 7], dtype=int64),) </code></pre> Further edgecases: <pre class="prettyprint"><code>arr = [10,9,8,7,6,5,4,3,2,1,0,-1,-2,-3,-4,-5,-6,-7,-8,-9,-10] </code></pre> Should result in <pre class="prettyprint"><code>>>> np.array([10]) </code></pre> Also just noticed: The problem might be ill-posed(in a mathematical sense). I will clarify it later today.

Here's a solution that gives the midpoint of crossings involving a noise threshold to filter potentially multiple fluctuations around zero applied across multiple data points. It give the correct answers for the two examples you supplied. However, I've made a couple of assumptions: <ul> <li>You didn't define precisely what range of data points to consider to determine the midpoint of the crossing, but I've used your sample code as a basis - it was detecting crossings where <code>ABS(start | end) >= 10</code> hence I've used the minimum range where this condition holds. NB: This does not detect a transition from +15 to -6. EDIT: Actually it's not always the minimum range, but the code should be enough for you to get started and adjust as needed.</li> <li>I've assumed that it is ok to also use pandas (to track the indexes of data points of interest). You could probably avoid pandas if essential.</li> </ul> <code> import numpy as np import pandas as pd arr = np.array([12, 15, 9, 8, -1, 1, -12, -10, 10]) sgn = pd.Series(np.sign(np.trunc(arr/10))) trailingEdge = sgn[sgn!=0].diff() edgeIndex = np.array(trailingEdge[trailingEdge!=0].index) edgeIndex[:-1] + np.diff(edgeIndex) / 2 </code> gives: <code>array([3., 7.])</code> and <code>arr = [10,9,8,7,6,5,4,3,2,1,0,-1,-2,-3,-4,-5,-6,-7,-8,-9,-10]</code> gives: <code>array([10.])</code>

<h3>Base case</h3> I guess you want <pre class="prettyprint"><code>import numpy as np x = np.array([10, -50, -30, 50, 10, 3, -200, -12, 123]) indices = np.where(np.logical_and(np.abs(np.diff(x)) >= 20, np.diff(np.sign(x)) != 0))[0] </code></pre> read as: indices, where ((absolute differences of x) are larger or equal 20) and (the sign flips) which returns <pre class="prettyprint"><code>array([0, 2, 5, 7]) </code></pre> <h3>Periodic signal</h3> The usual numpy functions don't cover this case. I would suggest simply adding the first element in the end, via the pad function: <pre class="prettyprint"><code>import numpy as np x = np.array([10, 5, 0, -5, -10]) x = np.pad(x, (0, 1), 'wrap') indices = np.where(np.logical_and(np.abs(np.diff(x)) >= 20, np.diff(np.sign(x)) != 0))[0] </code></pre>

Finding minimal jump zero crossings in numpy

Tags:

python

numpy

signal-processing

For a data analysis task, I want to find zero crossings in a numpy array, coming from a convolution with first a sobel-like kernel, and then a mexican hat kernel. Zero crossings allow me to detect edges in the data.

Unfortunately, the data is somewhat noisy and I only want to find zero crossings with a minimal jump size, 20 in the follwing example:

import numpy as np
arr = np.array([12, 15, 9, 8, -1, 1, -12, -10, 10])

Should result in

>>>array([1, 3, 7])

>>>array([3, 7])

Where 3 is the index of -1, just before the middle of the first jump and 7 is the index of -10

I have tried a modification of the following code (source: Efficiently detect sign-changes in python)

zero_crossings = np.where(np.diff(np.sign(np.trunc(arr/10))))[0]

Which correctly ignores small jumps, but puts the zero-crossings at [1,5,7]

What would be an efficient way of doing this?

The definition of minimal jump is not strict, but results should be along the lines of my question.

Edit: For Clarification

arr = np.array([12, 15, 9, 8, -1, 1, -12, -10, 10])
arr_floored = np.trunc(arr/10)
>>>>np.array([10, 10, 0, 0, 0, 0, -10, -10, 10])
sgn = np.sign(arr_floored)
>>>>array([ 1,  1,  0,  0,  0,  0, -1, -1,  1])
dsgn = np.diff(sgn)
>>>>array([ 0, -1,  0,  0,  0, -1,  0,  2])
np.where(dsgn)
>>>>(array([1, 5, 7], dtype=int64),)

Further edgecases:

arr = [10,9,8,7,6,5,4,3,2,1,0,-1,-2,-3,-4,-5,-6,-7,-8,-9,-10]

Should result in

>>> np.array([10])

Also just noticed: The problem might be ill-posed(in a mathematical sense). I will clarify it later today.

821

asked May 23 '19 07:05

AlexNe

2 Answers

Here's a solution that gives the midpoint of crossings involving a noise threshold to filter potentially multiple fluctuations around zero applied across multiple data points. It give the correct answers for the two examples you supplied. However, I've made a couple of assumptions:

You didn't define precisely what range of data points to consider to determine the midpoint of the crossing, but I've used your sample code as a basis - it was detecting crossings where ABS(start | end) >= 10 hence I've used the minimum range where this condition holds.
NB: This does not detect a transition from +15 to -6.
EDIT: Actually it's not always the minimum range, but the code should be enough for you to get started and adjust as needed.
I've assumed that it is ok to also use pandas (to track the indexes of data points of interest). You could probably avoid pandas if essential.

import numpy as np import pandas as pd arr = np.array([12, 15, 9, 8, -1, 1, -12, -10, 10]) sgn = pd.Series(np.sign(np.trunc(arr/10))) trailingEdge = sgn[sgn!=0].diff() edgeIndex = np.array(trailingEdge[trailingEdge!=0].index) edgeIndex[:-1] + np.diff(edgeIndex) / 2

gives:

array([3., 7.])

and

arr = [10,9,8,7,6,5,4,3,2,1,0,-1,-2,-3,-4,-5,-6,-7,-8,-9,-10]

gives:

array([10.])

133

answered Oct 23 '22 01:10

Mike

Base case

I guess you want

import numpy as np
x = np.array([10, -50, -30, 50, 10, 3, -200, -12, 123])
indices = np.where(np.logical_and(np.abs(np.diff(x)) >= 20, np.diff(np.sign(x)) != 0))[0]

read as: indices, where ((absolute differences of x) are larger or equal 20) and (the sign flips)

which returns

array([0, 2, 5, 7])

Periodic signal

The usual numpy functions don't cover this case. I would suggest simply adding the first element in the end, via the pad function:

import numpy as np
x = np.array([10, 5, 0, -5, -10])
x = np.pad(x, (0, 1), 'wrap')
indices = np.where(np.logical_and(np.abs(np.diff(x)) >= 20, np.diff(np.sign(x)) != 0))[0]

answered Oct 23 '22 01:10

Obay

Related questions
                            
                                Selecting an element on Appium / Android with Python that has same Class and Same Index of another element on UIAutomatorViewer
                            
                                Django app : unit tests fails because of django.db.utils.IntegrityError
                            
                                How to get the co-ordinates of the text recogonized from Image using OCR in python
                            
                                Adding Tensorboard summaries from graph ops generated inside Dataset map() function calls
                            
                                How to upgrade django project multiple versions (1.8 to 1.11+)?
                            
                                Unable to convert Kafka topic data into structured JSON with Confluent Elasticsearch sink connector
                            
                                Does the TensorFlow backend of Keras rely on the eager execution?
                            
                                Storing multiple dataframes of different widths with Parquet?
                            
                                Jupyter commands work only with a dash (e.g. jupyter-kernelspec instead of jupyter kernelspec)
                            
                                Groupby search first and last True values
                            
                                TensorFlow tf.data.Dataset and bucketing
                            
                                requirements.txt - How to mark alternative packages
                            
                                Python Click: Multiple Key Value Pair Arguments
                            
                                Running/Debugging Pycharm Python Scripts with remote Docker Machine
                            
                                How to do a polynomial fit with fixed points in 3D
                            
                                Jinja2 check if value exists in list of dictionaries
                            
                                How to solve "Error connecting to SMTP host: [Errno 10061] No connection could be made because the target machine actively refused it''?
                            
                                Implementing an “infinite loop” Dataset & DataLoader in PyTorch
                            
                                How to get functools.lru_cache to return new instances?
                            
                                Start async task now, await later

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With