I'm using the librosa library to convert music segments into mel-spectrograms to use as inputs for my neural network, as shown in the docs here. How is this different from MFCCs, if at all? Are there any advantages or disadvantages to using either?

I suppose, jonnor's answer is not exactly correct. There are two steps: 1. Take logs of Mel spectrogram. 2. Compute DCT on logs. Moreover, taking logs seems to be "the main part" for training NN: https://qr.ae/TWtPLD

Difference between mel-spectrogram and an MFCC

2 Answers

To get MFCC, compute the DCT on the mel-spectrogram. The mel-spectrogram is often log-scaled before.

MFCC is a very compressible representation, often using just 20 or 13 coefficients instead of 32-64 bands in Mel spectrogram. The MFCC is a bit more decorrelarated, which can be beneficial with linear models like Gaussian Mixture Models. With lots of data and strong classifiers like Convolutional Neural Networks, mel-spectrogram can often perform better.

157

answered Sep 19 '22 11:09

Jon Nordby

I suppose, jonnor's answer is not exactly correct. There are two steps:
1. Take logs of Mel spectrogram.
2. Compute DCT on logs.
Moreover, taking logs seems to be "the main part" for training NN: https://qr.ae/TWtPLD

answered Sep 21 '22 11:09

Mikhail Akulov

Related questions
                            
                                Limiting scipy.signal.spectrogram to calculate only specific frequencies
                            
                                How can I plot the results of the spectrogram function?
                            
                                HOW TO FIX: MatplotlibDeprecationWarning: shading='flat' when X and Y have the same dimensions as C is deprecated since 3.3
                            
                                Rendering a float array to 24-bit RGB image (using PIL for example)
                            
                                How do I generate a spectrogram of a 1D signal in python?
                            
                                Matplotlib spectrogram intensity legend (colorbar)
                            
                                Cutting of unused frequencies in specgram matplotlib
                            
                                How to handle dynamic input size for audio spectrogram used in CNN?
                            
                                Adding Colorbar to a Spectrogram
                            
                                Calculating spectrogram of .wav files in python
                            
                                generating correct spectrogram using fftw and window function
                            
                                Get a spectrum of frequencies from WAV/RIFF using linux command line
                            
                                Wrong spectrogram when using scipy.signal.spectrogram
                            
                                scipy.signal.spectrogram compared to matplotlib.pyplot.specgram
                            
                                Creating spectrogram from .wav using FFT in java
                            
                                Converting an FFT to a spectogram
                            
                                What is a spectrogram and how do I set its parameters?
                            
                                How do I plot a spectrogram the same way that pylab's specgram() does?
                            
                                MATLAB 'spectrogram' params

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Difference between mel-spectrogram and an MFCC

Tags:

librosa

spectrogram

mfcc

monadoboi

People also ask

2 Answers

Jon Nordby

Mikhail Akulov

Recent Activity

Donate For Us