wav-to-midi conversion

Tags:

I'm new to this field - but I need to perform a WAV-to-MIDI conversion in java. Is there a way to know what exactly are the steps involved in WAV-to-MIDI conversion? I have a very rough idea as in you need to; sample the wav file, filter it, use FFT for spectral analysis, feature extraction and then write the extracted features on to MIDI. But I cannot find solid sources or papers as in how to do all that? Can some one give me clues as in how and where to start? Are there any Open Source APIs available for this WAV-to-MIDI conversion process?

Advance thanks

695

asked Jan 24 '10 06:01

Dolphin

2 Answers

This is a field which is still highly under development, yet, there are some (experimental) algorithms available.

You can install sonic annotator and use a few vamp plugins.

For example:

Click to copy

./sonic-annotator file.wav -d vamp:qm-vamp-plugins:qm-transcription:transcription -w midi

./sonic-annotator file.wav -d vamp:silvet:silvet:notes -w midi

./sonic-annotator file.wav -d vamp:ua-vamp-plugins:mf0ua:mf0ua -w midi

176

answered Oct 24 '22 09:10

dorien

It's a more involved process than you might imagine.

This research problem is often referred to as music transcription: the act of converting a low-level representation of music (e.g., waveform) into a higher-level representation such as MIDI or even sheet music.

The sophistication of your solution will depend upon the complexity of your input data. Tons of research papers address music transcription only on monophonic piano or drums... because they are easy to transcribe. (Relatively.) Violin is harder. Voice is even harder. Violin plus voice plus piano is much harder. A symphony is nearly impossible. You get the picture.

The basic elements of music transcription involve any of the following overlapping areas:

(multi)pitch estimation
instrument recognition, timbral modeling
rhythm detection
note onset/offset detection
form/structure modeling

Search for papers on "music transcription" on Google Scholar or from the ISMIR proceedings: http://www.ismir.net. If you are more interested in one of the above subtopics, I can point you further. Good luck.

EDIT: That being said, there are existing solutions that we can all find on the web. Feel free to try them. But as you do, evaluate them with a critical eye and ear. What types of audio signals would cause transcription to fail?

EDIT 2: Ah, you are only doing this for piano. Okay, this is doable. Music transcription has advanced to the point where it can transcribe monophonic piano pretty well. A Rachmaninov concerto will still pose problems.

Our recommendations depend upon your end goal. You state "need to perform... in Java." So it sounds like you just want something to work regardless of how it gets you there. In that case, I agree 100% with others: use something that exists.

That's actually an interesting question; all of the MIR libraries I know are typically C/C++/Python/Matlab. But not Java. The EchoNest has a Java API, but I don't think it does note-level transcription. http://developer.echonest.com. (Edit: It does note-level transcription. The returned data includes pitch, timbre, beat, tatum, and more. But I find polyphony is still a problem.)

Oh, Marsyas is Java-based. Cool. I thought it was just C++. http://marsyas.info/ I recommend this. It's developed by George Tzanetakis, a professor in MIR. It does signal-level analysis and should be a good option.

Now, if this is for a fun learning experience, I think you can use the sound manipulation utilities in Java to experiment with the WAV signal and see what comes out.

EDIT: This page describes MIR software better than I can: The Tools We Use

For Matlab, you may be interested in the MIR Toolbox

Here is a nice page of common datasets: MIR Datasets

answered Oct 24 '22 09:10

Steve Tjoa

Related questions
                            
                                splitting wav file in python
                            
                                Where is a good collection of freely licensed instrument .wav samples?
                            
                                Convert audio stream to WAV byte array in Java without temp file
                            
                                Library for reading audio files
                            
                                sound wave sawtooth in c
                            
                                Analyzing wav and drawing a graph
                            
                                Failed to open file file.wav as a WAV due to: file does not start with RIFF id
                            
                                How to edit raw PCM audio data without an audio library?
                            
                                How do I attenuate a WAV file by a given decibel value?
                            
                                Playing a sound from a wave form stored in an array
                            
                                scipy.io.wavfile.read cannot read 24-bits .wav files
                            
                                How can I draw sound data from my wav file?
                            
                                Trouble playing wav in Java
                            
                                Why should I discard half of what a FFT returns?
                            
                                Import wav file in Tensorflow 2
                            
                                WAV-file analysis C (libsndfile, fftw3)
                            
                                Extract Fast Fourier Transform data from file
                            
                                How to manipulate wav file data in Python?
                            
                                Noise Reduction in an Audio file using Python [closed]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

wav-to-midi conversion

Tags:

midi

wav

file-conversion

pitch-tracking

Dolphin

People also ask

2 Answers

dorien

Steve Tjoa

Recent Activity

Donate For Us