How do you analyse the fundamental frequency of a PCM or WAV sample? [closed]

People also ask

What is fundamental frequency of song?

The fundamental frequency, often referred to simply as the fundamental, is defined as the lowest frequency of a periodic waveform. In music, the fundamental is the musical pitch of a note that is perceived as the lowest partial present.

The FFT can help you figure out where the frequency is, but it can't tell you exactly what the frequency is. Each point in the FFT is a "bin" of frequencies, so if there's a peak in your FFT, all you know is that the frequency you want is somewhere within that bin, or range of frequencies.

If you want it really accurate, you need a long FFT with a high resolution and lots of bins (= lots of memory and lots of computation). You can also guess the true peak from a low-resolution FFT using quadratic interpolation on the log-scaled spectrum, which works surprisingly well.

If computational cost is most important, you can try to get the signal into a form in which you can count zero crossings, and then the more you count, the more accurate your measurement.

None of these will work if the fundamental is missing, though. :)

I've outlined a few different algorithms here, and the interpolated FFT is usually the most accurate (though this only works when the fundamental is the strongest harmonic - otherwise you need to be smarter about finding it), with zero-crossings a close second (though this only works for waveforms with one crossing per cycle). Neither of these conditions is typical.

Keep in mind that the partials above the fundamental frequency are not perfect harmonics in many instruments, like piano or guitar. Each partial is actually a little bit out of tune, or inharmonic. So the higher-frequency peaks in the FFT will not be exactly on the integer multiples of the fundamental, and the wave shape will change slightly from one cycle to the next, which throws off autocorrelation.

To get a really accurate frequency reading, I'd say to use the autocorrelation to guess the fundamental, then find the true peak using quadratic interpolation. (You can do the autocorrelation in the frequency domain to save CPU cycles.) There are a lot of gotchas, and the right method to use really depends on your application.

There are also other algorithms that are time-based, not frequency based. Autocorrelation is a relatively simple algorithm for pitch detection. Reference: http://cnx.org/content/m11714/latest/

I have written c# implementations of autocorrelation and other algorithms that are readable. Check out http://code.google.com/p/yaalp/.

http://code.google.com/p/yaalp/source/browse/#svn/trunk/csaudio/WaveAudio/WaveAudio Lists the files, and PitchDetection.cs is the one you want.

(The project is GPL; so understand the terms if you use the code).

Guitar tuners don't use FFT's or DFT's. Usually they just count zero crossings. You might not get the fundamental frequency because some waveforms have more zero crossings than others but you can usually get a multiple of the fundamental frequency that way. That's enough to get the note although you might be one or more octaves off.

Low pass filtering before counting zero crossings can usually get rid of the excess zero crossings. Tuning the low pass filter requires some knowlegde of the range of frequency you want to detect though

FFTs (Fast-Fourier Transforms) would indeed be involved. FFTs allow you to approximate any analog signal with a sum of simple sine waves of fixed frequencies and varying amplitudes. What you'll essentially be doing is taking a sample and decomposing it into amplitude->frequency pairs, and then taking the frequency that corresponds to the highest amplitude.

Hopefully another SO reader can fill the gaps I'm leaving between the theory and the code!

A little more specifically:

If you start with the raw PCM in an input array, what you basically have is a graph of wave amplitude vs time.Doing a FFT will transform that to a frequency histogram for frequencies from 0 to 1/2 the input sampling rate. The value of each entry in the result array will be the 'strength' of the corresponding sub-frequency.

So to find the root frequency given an input array of size N sampled at S samples/second:

FFT(N, input, output);
max = max_i = 0;
for(i=0;i<N;i++)
  if (output[i]>max) max_i = i;
root = S/2.0 * max_i/N ;

Retrieval of fundamental frequencies in a PCM audio signal is a difficult task, and there would be a lot to talk about it...

Anyway, usually time-based method are not suitable for polyphonic signals, because a complex wave given by the sum of different harmonic components due to multiple fundamental frequencies has a zero-crossing rate which depends only from the lowest frequency component... Also in the frequency domain the FFT is not the most suitable method, since frequency spacing between notes follow an exponential scale, not linear. This means that a constant frequency resolution, used in the FFT method, may be insufficient to resolve lower frequency notes if the size of the analysis window in the time domain is not large enough.

A more suitable method would be a constant-Q transform, which is DFT applied after a process of low-pass filtering and decimation by 2 (i.e. halving each step the sampling frequency) of the signal, in order to obtain different subbands with different frequency resolution. In this way the calculation of DFT is optimized. The trouble is that also time resolution is variable, and increases for the lower subbands...

Finally, if we are trying to estimate the fundamental frequency of a single note, FFT/DFT methods are ok. Things change for a polyphonic context, in which partials of different sounds overlap and sum/cancel their amplitude depending from their phase difference, and so a single spectral peak could belong to different harmonic contents (belonging to different notes). Correlation in this case don't give good results...

Related questions
                            
                                Convert Midi Note Numbers To Name and Octave
                            
                                How can I detect these audio abnormalities?
                            
                                How do you do bicubic (or other non-linear) interpolation of re-sampled audio data?
                            
                                Best API for low-level audio in Windows?
                            
                                Java Record Mic to Byte Array and Play sound
                            
                                Which algorithm is used for noise canceling in earphones?
                            
                                html5 audio livestreaming
                            
                                Concept about Multimedia Codecs (Container,Format, Codec,Muxer,Demuxer) [closed]
                            
                                How to toggle audio play() pause() with one button or link?
                            
                                How do computers process audio data? [closed]
                            
                                m4a mp4 file format what's the difference or are they the same?
                            
                                Could not find tag for codec pcm_alaw in stream #1, codec not currently supported in container when concatenating 2 files using ffmpeg [closed]
                            
                                Does java have built in libraries for audio _synthesis_?
                            
                                How do I use audio sample data from Java Sound?
                            
                                Why can't I play sounds more than once using HTML5 audio tag?
                            
                                Sprite Kit & playing sound leads to app termination
                            
                                Find if video file has audio present in it [duplicate]
                            
                                Java - reading, manipulating and writing WAV files
                            
                                Custom CSS for <audio> tag?
                            
                                Audio output with video processing with opencv

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How do you analyse the fundamental frequency of a PCM or WAV sample? [closed]

Tags:

signal-processing

audio

fft

pitch-tracking

People also ask

Recent Activity

Donate For Us