Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Is there an API (or any hacks) to access Enhanced Dictation in Mac OS X Mavericks?

I am trying to find an easy way to transcribe an audio file to text (CMU Sphinx, Julius, etc. are difficult for someone not knowledgable about voice recognition, configuring language models, acoustic models, etc.).

I wondered if there was a way to pipe my audio file into the "Enhanced Dictation" feature of Mac OS 10.9 Mavericks, which allows for local, offline voice dictation.

I thought I was being clever when I put a patch cord from my headphones jack to my line in, but unfortunately when you start dictating, it mutes all other audio playing (any suggestions on how to disable this muting will get a right answer from me).

like image 281
winduptoy Avatar asked Nov 02 '22 12:11

winduptoy


1 Answers

I haven't found a direct way of doing this. However, you could use Soundflower as a workaround.

You can for example in VLC choose the Audio->Audio Device->Soundflower (2ch) for your output. Then in System Preferences->Dictation & Speech->Dictation->Soundflower (2ch) [drop down under the microphone icon]. Then you can start playback in VLC, and start dictation listening (for example in TextEdit) and you should see the transcription appear. The Downside to this approach is that it is slow (limited to ~real-time playback of audio), and not very conducive to automated workflow.

Note: you have to start audio playback before switching to TextEdit and initiating ED.

like image 70
user2934244 Avatar answered Nov 08 '22 22:11

user2934244