Extracting precise frequencies from FFT Bins using phase change between frames

Tags:

I have been looking through this fantastic article: http://blogs.zynaptiq.com/bernsee/pitch-shifting-using-the-ft/

While being fantastic, it is extremely hard and heavy going. This material is really stretching me.

I have extracted the maths from Stefan's code module that calculates the exact frequency for a given bin. But I don't understand the last calculation. Can someone explain to me the mathematical construction at the end?

Before digging into the code, let me set the scene:

Let's say we set fftFrameSize = 1024, so we are dealing with 512+1 bins
As an example, Bin[1]'s ideal frequency fits a single wave in the frame. At a sample rate of 40KHz, tOneFrame = 1024/40K seconds = 1/40s, so Bin[1] would ideally be collecting a 40Hz signal.
Setting osamp (overSample) = 4, we progress along our input signal in steps of 256. So the first analysis examines bytes zero through 1023, then 256 through 1279, etc. Note each float gets processed 4 times.

...

void calcBins(                long fftFrameSize,                long osamp,                float sampleRate,                float * floats,                BIN * bins               ) {     /* initialize our static arrays */     static float gFFTworksp[2*MAX_FRAME_LENGTH];     static float gLastPhase[MAX_FRAME_LENGTH/2+1];      static long gInit = 0;     if (! gInit)      {         memset(gFFTworksp, 0, 2*MAX_FRAME_LENGTH*sizeof(float));         memset(gLastPhase, 0, (MAX_FRAME_LENGTH/2+1)*sizeof(float));         gInit = 1;     }      /* do windowing and re,im interleave */     for (long k = 0; k < fftFrameSize; k++)      {         double window = -.5*cos(2.*M_PI*(double)k/(double)fftFrameSize)+.5;         gFFTworksp[2*k] = floats[k] * window;         printf("sinValue: %f", gFFTworksp[2*k]);         gFFTworksp[2*k+1] = 0.;     }      /* do transform */     smbFft(gFFTworksp, fftFrameSize, -1);      printf("\n");      /* this is the analysis step */     for (long k = 0; k <= fftFrameSize/2; k++)      {         /* de-interlace FFT buffer */         double real = gFFTworksp[2*k];         double imag = gFFTworksp[2*k+1];          /* compute magnitude and phase */         double magn = 2.*sqrt(real*real + imag*imag);         double phase = atan2(imag,real);          /* compute phase difference */         double phaseDiff = phase - gLastPhase[k];         gLastPhase[k] = phase;          /* subtract expected phase difference */         double binPhaseOffset = M_TWOPI * (double)k / (double)osamp;         double deltaPhase = phaseDiff - binPhaseOffset;          /* map delta phase into [-Pi, Pi) interval */         // better, but obfuscatory...         //    deltaPhase -= M_TWOPI * floor(deltaPhase / M_TWOPI + .5);          while (deltaPhase >= M_PI)             deltaPhase -= M_TWOPI;         while (deltaPhase < -M_PI)             deltaPhase += M_TWOPI;

(EDIT:) Now the bit I don't get:

        // Get deviation from bin frequency from the +/- Pi interval          // Compute the k-th partials' true frequency              // Start with bin's ideal frequency         double bin0Freq = (double)sampleRate / (double)fftFrameSize;         bins[k].idealFreq = (double)k * bin0Freq;          // Add deltaFreq         double sampleTime = 1. / (double)sampleRate;         double samplesInStep = (double)fftFrameSize / (double)osamp;         double stepTime = sampleTime * samplesInStep;         double deltaTime = stepTime;                  // Definition of frequency is rate of change of phase, i.e. f = dϕ/dt         // double deltaPhaseUnit = deltaPhase / M_TWOPI; // range [-.5, .5)         double freqAdjust = (1. / M_TWOPI) * deltaPhase / deltaTime;           // Actual freq <-- WHY ???         bins[k].freq = bins[k].idealFreq + freqAdjust;     } }

I just can't see it clearly, even though it seems to be staring in the face. Could someone please explain this process from scratch, step by step?

952

asked Jan 08 '11 09:01

P i

2 Answers

The basic principle is very simple. If a given component exactly matches a bin frequency then its phase will not change from one FT to the next. However if the frequency does not correspond exactly with the bin frequency then there will be a phase change between successive FTs. The frequency delta is just:

delta_freq = delta_phase / delta_time

and the refined estimate of the frequency of the component will then be:

freq_est = bin_freq + delta_freq

157

answered Oct 04 '22 22:10

Paul R

I have implemented this algorithm for Performous myself. When you take another FFT at a time offset, you expect the phase to change according to the offset, i.e. two FFTs taken 256 samples apart should have a phase difference of 256 samples for all frequencies present in the signal (this assumes that the signals themselves are steady, which is a good assumption for short periods like 256 samples).

Now, the actual phase values you get from FFT are not in samples but in phase angle, so they will be different depending on the frequency. In the following code the phaseStep value is the conversion factor needed per bin, i.e. for frequency corresponding to bin x, the phase shift will be x * phaseStep. For bin center frequencies x would be an integer (the bin number) but for actual detected frequencies it may be any real number.

const double freqPerBin = SAMPLE_RATE / FFT_N; const double phaseStep = 2.0 * M_PI * FFT_STEP / FFT_N;

The correction works by assuming that the signal in a bin has the bin center frequency and then calculating the expected phase shift for that. This expected shift is substracted from the actual shift, leaving the error. A remainder (modulo 2 pi) is taken (-pi to pi range) and the final frequency is calculated with bin center + correction.

// process phase difference double delta = phase - m_fftLastPhase[k]; m_fftLastPhase[k] = phase; delta -= k * phaseStep;  // subtract expected phase difference delta = remainder(delta, 2.0 * M_PI);  // map delta phase into +/- M_PI interval delta /= phaseStep;  // calculate diff from bin center frequency double freq = (k + delta) * freqPerBin;  // calculate the true frequency

Notice that many adjacent bins often end up corrected to the same frequency because the delta correction can be up to 0.5 * FFT_N / FFT_STEP bins either way so the smaller FFT_STEP you use, the further away will corrections be possible (but this increases the processing power needed as well as imprecision due to inaccuracies).

I hope this helps :)

answered Oct 04 '22 23:10

Tronic

Related questions
                            
                                Access symbols defined in the linker script by application
                            
                                Why type cast a void pointer?
                            
                                Proper Way to Free Memory of a Returned Variable
                            
                                Why are integer types promoted during addition in C?
                            
                                No O_BINARY and O_TEXT flags in Linux?
                            
                                Why can I set an anonymous enum equal to another in C but not C++?
                            
                                Write to memory buffer instead of file with libjpeg?
                            
                                Sharing a global/static variable between a process and DLL
                            
                                Naming convention when using STRUCT in C
                            
                                find ones position in 64 bit number
                            
                                Using Doxygen with C, do you comment the function prototype or the definition? Or both?
                            
                                Is the PHP language resultantly C?
                            
                                What constitutes a "valid" C Identifier?
                            
                                Is there a need to close file descriptors before exit?
                            
                                What c lib to use when I need to parse a simple config file under linux? [closed]
                            
                                Why do I have to explicitly link with libm? [duplicate]
                            
                                How to use list from sys/queue.h?
                            
                                Replacing extrordinarily slow pow() function
                            
                                What does "@(#)" in comments mean?
                            
                                What is the difference between functions in math and functions in programming?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Extracting precise frequencies from FFT Bins using phase change between frames

Tags:

c

math

signal-processing

fft

phase

P i

People also ask

2 Answers

Paul R

Tronic

Recent Activity

Donate For Us