In Python, how do I calcuate the peaks of a histogram? I tried this: <pre class="prettyprint"><code>import numpy as np from scipy.signal import argrelextrema data = [0, 1, 2, 3, 4, 0, 1, 2, 3, 4, 0, 1, 2, 3, 4, 1, 2, 3, 4, 5, 6, 7, 8, 9, 5, 6, 7, 8, 9, 5, 6, 7, 8, 9, 12, 15, 16, 17, 18, 19, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24,] h = np.histogram(data, bins=[0, 5, 10, 15, 20, 25]) hData = h[0] peaks = argrelextrema(hData, np.greater) </code></pre> But the result was: <pre class="prettyprint"><code>(array([3]),) </code></pre> I'd expect it to find the peaks in bin 0 and bin 3. Note that the peaks span more than 1 bin. I don't want it to consider the peaks that span more than 1 column as additional peak. I'm open to another way to get the peaks. Note: <pre class="prettyprint"><code>>>> h[0] array([19, 15, 1, 10, 5]) >>> </code></pre>

I wrote an easy function: <pre class="prettyprint"><code>def find_peaks(a): x = np.array(a) max = np.max(x) lenght = len(a) ret = [] for i in range(lenght): ispeak = True if i-1 > 0: ispeak &= (x[i] > 1.8 * x[i-1]) if i+1 < lenght: ispeak &= (x[i] > 1.8 * x[i+1]) ispeak &= (x[i] > 0.05 * max) if ispeak: ret.append(i) return ret </code></pre> I defined a peak as a value bigger than 180% that of the neighbors and bigger than 5% of the max value. Of course you can adapt the values as you prefer in order to find the best set up for your problem.

calculate histogram peaks in python

Tags:

python

In Python, how do I calcuate the peaks of a histogram?

I tried this:

import numpy as np
from scipy.signal import argrelextrema

data = [0, 1, 2, 3, 4, 0, 1, 2, 3, 4, 0, 1, 2, 3, 4, 1, 2, 3, 4,

        5, 6, 7, 8, 9, 5, 6, 7, 8, 9, 5, 6, 7, 8, 9,

        12,

        15, 16, 17, 18, 19, 15, 16, 17, 18, 

        19, 20, 21, 22, 23, 24,]

h = np.histogram(data, bins=[0, 5, 10, 15, 20, 25])
hData = h[0]
peaks = argrelextrema(hData, np.greater)

But the result was:

(array([3]),)

I'd expect it to find the peaks in bin 0 and bin 3.

Note that the peaks span more than 1 bin. I don't want it to consider the peaks that span more than 1 column as additional peak.

I'm open to another way to get the peaks.

Note:

>>> h[0]
array([19, 15,  1, 10,  5])
>>>

250

asked Aug 10 '15 01:08

brian

2 Answers

In computational topology, the formalism of persistent homology provides a definition of "peak" that seems to address your need. In the 1-dimensional case the peaks are illustrated by the blue bars in the following figure:

Most persistent peaks

A description of the algorithm is given in this Stack Overflow answer of a peak detection question.

The nice thing is that this method not only identifies the peaks but it quantifies the "significance" in a natural way.

A simple and efficient implementation (as fast as sorting numbers) and the source material to the above answer given in this blog article: https://www.sthu.org/blog/13-perstopology-peakdetection/index.html

answered Oct 07 '22 14:10

S. Huber

I wrote an easy function:

def find_peaks(a):
  x = np.array(a)
  max = np.max(x)
  lenght = len(a)
  ret = []
  for i in range(lenght):
      ispeak = True
      if i-1 > 0:
          ispeak &= (x[i] > 1.8 * x[i-1])
      if i+1 < lenght:
          ispeak &= (x[i] > 1.8 * x[i+1])

      ispeak &= (x[i] > 0.05 * max)
      if ispeak:
          ret.append(i)
  return ret

I defined a peak as a value bigger than 180% that of the neighbors and bigger than 5% of the max value. Of course you can adapt the values as you prefer in order to find the best set up for your problem.

answered Oct 07 '22 13:10

Andrea Mauro

Related questions
                            
                                import sklearn not working in PyCharm
                            
                                Unable to run hg due to "ImportError: No module named osutil"
                            
                                Popen.communicate() returns (None, None) even if script print results
                            
                                ImportError: No module named 'paramiko'
                            
                                Selenium Chromedriver Hangs?
                            
                                How to change color of QMainWindow borders and title bar?
                            
                                pip: How to install into /usr/local
                            
                                Calculating variance of an image python efficiently
                            
                                How to use a dot in Python format strings?
                            
                                Why is the plot generated from ggplot not showing up?
                            
                                aiohttp - exception ignored message
                            
                                How to run tests django rest framework tests?
                            
                                4 dimensional array of zeros in python
                            
                                How could I get the RAW pixel data out of a .NEF file using python?
                            
                                How to use Ubuntu 14.04 on AWS Elastic Beanstalk for a Python Django app
                            
                                Creating a dark, reversed color palette in Seaborn
                            
                                How to test Retry in Celery application in Python?
                            
                                Iterate over dictionary of objects
                            
                                2D Nearest Neighbor Interpolation in Python
                            
                                Numpy: Average of values corresponding to unique coordinate positions

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With