Any idea how i can go about implementing an algorithm for identifying a song, by matching it with a clip of the song or human voice or tune. I dont expect anyone to write out the algo here :), i know it involves a lot of things together, but i would like to know how i can get started. What are the technologies that i will need to consider for doing something like it. I am looking at something similar to Shazaam or Midomi. Also, is there an online service that does the same and provides API for doing it.
Spectogram is the very basis of Shazam's audio fingerprint algorithm. We can think of it as a condensed digital summary of a song. Just like human fingerprints, every song's acoustic fingerprint is unique, and can be easily identified even if there are small variations in data.
Shazam identifies the title of the song you recorded (an unlabeled audio content) by matching the song's fingerprint with the fingerprint of songs in its database. Shazam creates unique fingerprints for songs on its database by using certain data points identified with a spectrogram's help.
Available to users even before Apple's App Store existed, Shazam was one of the first consumer-used AI services. Now a part of the Apple family, Shazam uses intelligent technology to hear and identify songs in just a few seconds.
I saw a presentation at the devoxx conference about this, have a look at it here. And here is a blog post, by the speaker, about the same subject.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With