Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Is there a speech to text API or some "hack" by which I can use Google's speech to text facilities?

I think Google's speech to text facilities (Google Voice automatic transcription of voicemail, automatic captioning of videos on YouTube etc) are quite impressive.

I did look to see whether Google has made it available through an API and it seems they haven't (not that I blame them!). A cloud computing service providing speech to text functionality would be pretty cool though.

Is there some sort of "hack" that I can use to access the speech to text. My architecture basically comes down to this - a short 15-20 second wav/mp3/other clip as the input, output is plaintext.

Any ideas people?

like image 972
user245120 Avatar asked Jan 23 '23 02:01

user245120


2 Answers

There are a lot of speech to text APIs. Just because Google doesn't make theirs available, it doesn't mean you're out of luck.

Here is a good one for C#. You can search for others for your platform if it's not .NET.

http://cmusphinx.sourceforge.net/

like image 140
Samuel Neff Avatar answered Jan 25 '23 16:01

Samuel Neff


Check this out: http://mikepultz.com/2011/03/accessing-google-speech-api-chrome-11/

I am currently trying to implement the API in PHP.

--Seth

like image 26
Westy92 Avatar answered Jan 25 '23 14:01

Westy92