What I need is an API/Library that will allow me to convert .wav files (or other media files is necessary) to their text equivalent. Does a library/api like this exist?
Speech recognition can be activated when typing on your Android device. If this facility is available in the app you are using, a microphone icon will appear on the keypad. Pressing this activates the speech recognition. Android does have offline speech recognition capabilities.
If your Google Docs voice typing not working on Mac or Windows PC, it may be caused by the following reasons: Google Docs microphone access is not enabled. The microphone settings of Google Docs are incorrect. Your browser has not been updated to the latest version.
Google. Google Speech-to-Text is a well known speech transcription API.
If you'd have searched for java speech recognition, you would've found the Java Speech API or short JSAPI
This is rather typical Question. Anyhow depending on the language you are using there may be many different choices.
Java http://voce.sourceforge.net/
PHP http://www.speechapi.com/ and http://cmusphinx.sourceforge.net/
Basically, the best option for you is to use some online cloud-based API, that will take your .wav input and return you the response in the text.
In this way, your API will be accessible from any language and will take a lot of pain out of your code.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With