Ok what im trying to do is a kind of audio processing software that can detect a prevalent frequency an if the frequency is played for long enough (few ms) i know i got a positive match. i know i would need to use FFT or something simiral but in this field of math i suck, i did search the internet but didn not find a code that could do only this. the goal im trying to accieve is to make myself a custom protocol to send data trough sound, need very low bitrate per sec (5-10bps) but im also very limited on the transmiting end so the recieving software will need to be able custom (cant use an actual hardware/software modem) also i want this to be software only (no additional hardware except soundcard) thanks alot for the help.

The aubio libraries have been wrapped with SWIG and can thus be used by Python. Among their many features include several methods for pitch detection/estimation including the YIN algorithm and some harmonic comb algorithms. However, if you want something simpler, I wrote some code for pitch estimation some time ago and you can take it or leave it. It won't be as accurate as using the algorithms in aubio, but it might be good enough for your needs. I basically just took the FFT of the data times a window (a Blackman window in this case), squared the FFT values, found the bin that had the highest value, and used a quadratic interpolation around the peak using the log of the max value and its two neighboring values to find the fundamental frequency. The quadratic interpolation I took from some paper that I found. It works fairly well on test tones, but it will not be as robust or as accurate as the other methods mentioned above. The accuracy can be increased by increasing the chunk size (or reduced by decreasing it). The chunk size should be a multiple of 2 to make full use of the FFT. Also, I am only determining the fundamental pitch for each chunk with no overlap. I used PyAudio to play the sound through while writing out the estimated pitch. Source Code: <pre class="prettyprint"><code># Read in a WAV and find the freq's import pyaudio import wave import numpy as np chunk = 2048 # open up a wave wf = wave.open('test-tones/440hz.wav', 'rb') swidth = wf.getsampwidth() RATE = wf.getframerate() # use a Blackman window window = np.blackman(chunk) # open stream p = pyaudio.PyAudio() stream = p.open(format = p.get_format_from_width(wf.getsampwidth()), channels = wf.getnchannels(), rate = RATE, output = True) # read some data data = wf.readframes(chunk) # play stream and find the frequency of each chunk while len(data) == chunk*swidth: # write data out to the audio stream stream.write(data) # unpack the data and times by the hamming window indata = np.array(wave.struct.unpack("%dh"%(len(data)/swidth),\ data))*window # Take the fft and square each value fftData=abs(np.fft.rfft(indata))**2 # find the maximum which = fftData[1:].argmax() + 1 # use quadratic interpolation around the max if which != len(fftData)-1: y0,y1,y2 = np.log(fftData[which-1:which+2:]) x1 = (y2 - y0) * .5 / (2 * y1 - y2 - y0) # find the frequency and output it thefreq = (which+x1)*RATE/chunk print "The freq is %f Hz." % (thefreq) else: thefreq = which*RATE/chunk print "The freq is %f Hz." % (thefreq) # read some more data data = wf.readframes(chunk) if data: stream.write(data) stream.close() p.terminate() </code></pre>

Python frequency detection

Tags:

python

frequency

audio

fft

Ok what im trying to do is a kind of audio processing software that can detect a prevalent frequency an if the frequency is played for long enough (few ms) i know i got a positive match. i know i would need to use FFT or something simiral but in this field of math i suck, i did search the internet but didn not find a code that could do only this.

the goal im trying to accieve is to make myself a custom protocol to send data trough sound, need very low bitrate per sec (5-10bps) but im also very limited on the transmiting end so the recieving software will need to be able custom (cant use an actual hardware/software modem) also i want this to be software only (no additional hardware except soundcard)

thanks alot for the help.

329

asked Apr 15 '10 18:04

MatijaG

1 Answers

The aubio libraries have been wrapped with SWIG and can thus be used by Python. Among their many features include several methods for pitch detection/estimation including the YIN algorithm and some harmonic comb algorithms.

However, if you want something simpler, I wrote some code for pitch estimation some time ago and you can take it or leave it. It won't be as accurate as using the algorithms in aubio, but it might be good enough for your needs. I basically just took the FFT of the data times a window (a Blackman window in this case), squared the FFT values, found the bin that had the highest value, and used a quadratic interpolation around the peak using the log of the max value and its two neighboring values to find the fundamental frequency. The quadratic interpolation I took from some paper that I found.

It works fairly well on test tones, but it will not be as robust or as accurate as the other methods mentioned above. The accuracy can be increased by increasing the chunk size (or reduced by decreasing it). The chunk size should be a multiple of 2 to make full use of the FFT. Also, I am only determining the fundamental pitch for each chunk with no overlap. I used PyAudio to play the sound through while writing out the estimated pitch.

Source Code:

# Read in a WAV and find the freq's import pyaudio import wave import numpy as np  chunk = 2048  # open up a wave wf = wave.open('test-tones/440hz.wav', 'rb') swidth = wf.getsampwidth() RATE = wf.getframerate() # use a Blackman window window = np.blackman(chunk) # open stream p = pyaudio.PyAudio() stream = p.open(format =                 p.get_format_from_width(wf.getsampwidth()),                 channels = wf.getnchannels(),                 rate = RATE,                 output = True)  # read some data data = wf.readframes(chunk) # play stream and find the frequency of each chunk while len(data) == chunk*swidth:     # write data out to the audio stream     stream.write(data)     # unpack the data and times by the hamming window     indata = np.array(wave.struct.unpack("%dh"%(len(data)/swidth),\                                          data))*window     # Take the fft and square each value     fftData=abs(np.fft.rfft(indata))**2     # find the maximum     which = fftData[1:].argmax() + 1     # use quadratic interpolation around the max     if which != len(fftData)-1:         y0,y1,y2 = np.log(fftData[which-1:which+2:])         x1 = (y2 - y0) * .5 / (2 * y1 - y2 - y0)         # find the frequency and output it         thefreq = (which+x1)*RATE/chunk         print "The freq is %f Hz." % (thefreq)     else:         thefreq = which*RATE/chunk         print "The freq is %f Hz." % (thefreq)     # read some more data     data = wf.readframes(chunk) if data:     stream.write(data) stream.close() p.terminate()

117

answered Sep 25 '22 14:09

Justin Peel

Related questions
                            
                                Python's Logical Operator AND
                            
                                `AttributeError: rint` when using numpy.round
                            
                                Build a URL using Requests module Python
                            
                                Coordinates of the closest points of two geometries in Shapely
                            
                                TypeError: 'type' object is not subscriptable when indexing in to a dictionary
                            
                                Pandas drop_duplicates method not working on dataframe containing lists
                            
                                Why in argparse, a 'True' is always 'True'? [duplicate]
                            
                                After conda update, python kernel crashes when matplotlib is used
                            
                                Why is my instance variable not in __dict__?
                            
                                Multiple keys per value
                            
                                Python numpy 2D array indexing
                            
                                iterrows pandas get next rows value
                            
                                How to generate a random normal distribution of integers
                            
                                How to open a huge excel file efficiently
                            
                                AWS BOTO3 S3 python - An error occurred (404) when calling the HeadObject operation: Not Found
                            
                                Django templates and variable attributes
                            
                                Check Linux distribution name
                            
                                Is this the way to validate Django model fields?
                            
                                Flask confusion with app
                            
                                Python TypeError: unsupported operand type(s) for ^: 'float' and 'int'

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With