How does music fingerprinting work (for sites such as Shazam and Lala.com)?

Tags:

My large (120gb) music collection contains many duplicate songs, and I've been trying to fingerprint tracks in the hopes of detecting duplicates. And since I'm a CS Major I'm very curious as to what is done out there? Nothing I do has nearly the accuracy of something like Shazam or Lala.com. How do they "hash" tracks? I have run a standard MD5 hash on all my files (26,000 files) and I found hundreds of equal hashes on different tracks, so that doesn't work.

I'm more interested in Lala.com since they work with full files, unlike Shazam, but I'm assuming both use a similar technique. Can anyone explain how to generate unique identifiers for music?

836

asked Jan 12 '10 04:01

Niels Joubert

2 Answers

The seminal paper on audio fingerprinting is the work by Haitsma and Kalker in 2002-03. For each frame of audio, it preprocesses (differences across time frames and frequency bands) and then stores a binarized version of the frame's spectrum.

This procedure adds robustness. If the entire signal is shifted in time, it still works (at least, one can derive a lower bound on performance degradation). It is pretty robust to environmental noise. Since its inception, there have been many papers on low-level music similarity, so there is no single answer.

Do you have absolutely identical files, i.e., the signals are time aligned, bit depth is the same, sampling rate is the same? Then I would think a hash like MD5 should work. But if any of those parameters are changed, so will the hashes. In such an event, a procedure like the one mentioned earlier would work better.

Take a look at the ISMIR proceedings available free online. Fun stuff. http://www.ismir.net/

169

answered Oct 27 '22 17:10

Steve Tjoa

There are a lot of algorithms for acoustic fingerprinting. Some of the more popular ones are:

AMG LASSO
AudioID
LibFooID

In fact libfooId is opensource , so you can check out its code in google-code!!

answered Oct 27 '22 17:10

Suraj Chandran

Related questions
                            
                                Why Does WPF Swallow Databinding Exceptions?
                            
                                what is the difference between synchronous apis and asynchronous apis?
                            
                                Close all HTML unclosed IMG tags
                            
                                Are all CSS font-weight property's values useful?
                            
                                How to disable editing my history in bash
                            
                                What is the complexity of OrderedDictionary?
                            
                                Event Capturing vs Event Bubbling [duplicate]
                            
                                jQuery ajax upload with progress bar - no flash
                            
                                'schema' design for a social network
                            
                                Redirecting before POST upload has been completed
                            
                                Open local KML File in Google Maps on Android
                            
                                Shift count negative or too big error - correct solution?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With