HOW to get MFCC from an FFT on a signal?

Tags:

SHORT AND SIMPLE: Hi all very simply... I just want to know the steps that are involved to get an MFCC from an FFT.

DETAILED:

Hi all. I am working on a drum application where I want to classify sounds. Its just a matching application, it returns the name of the note that you play on the drum.

Its a simple indian loud big drum. There are only a few notes on there that one can play.

I've implemented the fft algorithm and successfully obtain a spectrum. I now want to take it one step further and return the mfcc from the fft.

This is what i understand so far. its based on linear cosine transform of a log power spectrum on a nonlinear mel scale of frequency.

it uses triangulation to filter out the frequencies and get a desired coefficient. http://instruct1.cit.cornell.edu/courses/ece576/FinalProjects/f2008/pae26_jsc59/pae26_jsc59/images/melfilt.png

so if you have around 1000 values returned from the fft algorithm - the spectrum of the sound, then desirably you'll get around 12 elements (i.e., coefficients). This 12-element vector is used to classify the instrument, including the drum played...

this is exactly what i want.

Could someone please help me on how to do something like this? my programming skills are alright. Im currently creating an application for the iphone. with openframeworks.

Any help would be greatly appreciated. Cheers

560

asked Apr 29 '11 17:04

Pavan

1 Answers

First, you have to split the signal in small frames with 10 to 30ms, apply a windowing function (humming is recommended for sound applications), and compute the fourier transform of the signal. With DFT, to compute Mel Frequecy Cepstral Coefficients you have to follow these steps:

Get power spectrum: |DFT|^2
Compute a triangular bank filter to transform hz scale into mel scale
Get log spectrum
Apply discrete cossine transform

A python code example:

import numpy
from scipy.fftpack import dct
from scipy.io import wavfile

sampleRate, signal = wavfile.read("file.wav")
numCoefficients = 13 # choose the sive of mfcc array
minHz = 0
maxHz = 22.000  

complexSpectrum = numpy.fft(signal)
powerSpectrum = abs(complexSpectrum) ** 2
filteredSpectrum = numpy.dot(powerSpectrum, melFilterBank())
logSpectrum = numpy.log(filteredSpectrum)
dctSpectrum = dct(logSpectrum, type=2)  # MFCC :)

def melFilterBank(blockSize):
    numBands = int(numCoefficients)
    maxMel = int(freqToMel(maxHz))
    minMel = int(freqToMel(minHz))

    # Create a matrix for triangular filters, one row per filter
    filterMatrix = numpy.zeros((numBands, blockSize))

    melRange = numpy.array(xrange(numBands + 2))

    melCenterFilters = melRange * (maxMel - minMel) / (numBands + 1) + minMel

    # each array index represent the center of each triangular filter
    aux = numpy.log(1 + 1000.0 / 700.0) / 1000.0
    aux = (numpy.exp(melCenterFilters * aux) - 1) / 22050
    aux = 0.5 + 700 * blockSize * aux
    aux = numpy.floor(aux)  # Arredonda pra baixo
    centerIndex = numpy.array(aux, int)  # Get int values

    for i in xrange(numBands):
        start, centre, end = centerIndex[i:i + 3]
        k1 = numpy.float32(centre - start)
        k2 = numpy.float32(end - centre)
        up = (numpy.array(xrange(start, centre)) - start) / k1
        down = (end - numpy.array(xrange(centre, end))) / k2

        filterMatrix[i][start:centre] = up
        filterMatrix[i][centre:end] = down

    return filterMatrix.transpose()

def freqToMel(freq):
    return 1127.01048 * math.log(1 + freq / 700.0)

def melToFreq(mel):
    return 700 * (math.exp(mel / 1127.01048) - 1)

This code is based on MFCC Vamp example. I hope this help you!

answered Nov 08 '22 09:11

alfakini

Related questions
                            
                                Buffering log messages in NLog and manually flushes them to target
                            
                                Logging in Python?
                            
                                Include header only once at the top of a rolling file
                            
                                Git - List files created by author
                            
                                What is the default MessageFactory for Log4J
                            
                                How to setup event log for .NET Core 3.0 Worker Service
                            
                                Design pattern / C# trick for repeated bit of code
                            
                                Sortable, readable and standard time format for logs
                            
                                Turn off active record logging in production
                            
                                How do I add an arbitrary value to a TensorFlow summary?
                            
                                Getting Heroku logs for past few weeks
                            
                                Why we should consider the «Logger» class as a singleton?
                            
                                java logging API, disable logging to standard output
                            
                                Log4j 2 doesn't write to file
                            
                                EC2 and login logging
                            
                                Gradle stack trace on terminal
                            
                                Serilog: Request Id implementation
                            
                                How can I calculate the median and standard deviation of a bunch stream of numbers in Perl?
                            
                                Why is my program creating empty .lck files?
                            
                                logback - no end of line delimiter

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

HOW to get MFCC from an FFT on a signal?

Tags:

logging

signal-processing

fft

Pavan

People also ask

1 Answers

alfakini

Recent Activity

Donate For Us