Voice Recognition Software For Developers [closed]

Tags:

Well the docs finally said it, I need to take it easy on my wrist for a few months. Being that I'm a .NET Developer this could end my livelihood for a little while, something I'm not anxious to do. That said, are there any good handsfree options for developers? Anyone had success using any of the speech recognition software out there?

POSTSCRIPT: I've recovered my arm again to the point where two-handed programming isn't a problem. Dragon Naturally speaking worked well enough, but was slower, not like the keyboard where I was programming faster than I thought.

240

asked Sep 17 '08 21:09

tekiegreg

2 Answers

It's out there, and it works...

There are quite a few speech recognition programs out there, of which Dragon NaturallySpeaking is, I think, one of the most widely used ones. I've used it myself, and have been impressed with its quality. That being a couple of years ago, I guess things have improved even further by now.

...but it ain't easy...

Even though it works amazingly well, I won't say it's an easy solution. It takes time to train the program, and even then, it'll make mistakes. It's painstakingly slow compared to typing, so I had to keep saying to myself "Don't grab the keyboard, don't grab the keyboard, ..." (after which I'd grab the keyboard anyway). I myself tend to mumble a bit, which didn't make things much better, either ;-). Especially the first weeks can be frustrating. You can even get voice-related problems if you strain your voice too much.

...especially for programmers!

All in all, it's certainly a workable solution for people writing normal text/prose. As a programmer, you're in a completely different realm, for which there are no real solutions. Things might have changed by now, but I'd be surprised if they have.

What's the problem? Most SR software is built to recognize normal language. Programmers write very cryptic stuff, and it's hard, if not impossible, to find software that does the conversion between normal language and code. For example, how would you dictate:

if (somevar == 'a') {    print('You pressed a!'); }

Using the commands in your average SR program, this is a huge pain: "if space left bracket equal sign equal sign apostrophe spell a apostrophe ...". And I'm not even talking about navigating your code. Ever noticed how much you're using the keyboard while programming, and how different that usage is from how a 'normal' user uses the keyboard?

How to make the best of it

Thus far, I've only worked with Dragon NaturallySpeaking (DNS), so I can only speak for that product. There are some interesting add-ons and websites targeted for people like programmers:

Vocola is an unofficial plugin that allows you to easily add your own commands to DNS. I found it essential, basically. You'll also be able to find command sets written by other programmers, for e.g. navigating code. It's based on a software package written in Python, so there are also some more advanced and fancy packages around. Also check out Vocola's Resources page. (Warning: when I used it, there were some problems with installing Vocola; check out the newsgroup below for info!)
SpeechComputing.com is a forum/newsgroup with lots of interesting discussions. A good place to start.

Closing remarks

It seems that the best solution to this problem is, really:

Find ways around actual coding.
Try to recover. I'm somewhat reluctant to recommend this book, but it seems to work amazingly well for people with RSI/carpal tunnel and other chronic pain issues: J.E. Sarno, Mindbody prescription. I'm working with it right now, and I think it's definitely worth reading.

113

answered Oct 07 '22 02:10

onnodb

I dictate VB.net and TSQL using Dragon NaturallySpeaking 10 Professional. VB.net is inherently closer to a "spoken" language, but I don't see any reason why it couldn't work for C# or others. I start with a completely empty vocabulary, and build it from scratch to suit my needs (which is why I use the professional version).

Here's the basic steps (this assumes you have already created and trained a user):

Create a new vocabulary based on "Base General - Empty Dictation".
Don't have it scan your documents or email.
Add lists of keywords with pronunciation specific to your programming language (Dim, ByVal\by-val, etc.).
Create a .txt document that contains all of your code minus comments.
Harvest words from this document and add them with pronunciations.
Use the document to train the vocabulary's language model.

I'll write up something with more detail when I get a chance if anyone is interested.

Edit:

Here's how to dictate SQL code. The word list created here can be included in other vocabularies if you are a database developer.

answered Oct 07 '22 01:10

Keith Walton

Related questions
                            
                                Does iOS provide built in text to speech support or any class like NSSpeechRecognizer?
                            
                                Fastest Speech recognition library C++ [closed]
                            
                                iPhone: Speech Recognition is in IOS SDK available?
                            
                                Using the Android RecognizerIntent with a bluetooth headset
                            
                                good Speech recognition API
                            
                                record/save audio from voice recognition intent
                            
                                portaudio.h: No such file or directory
                            
                                Speech Recognition & Programming [closed]
                            
                                Google Speech Recognition timeout
                            
                                How do you enable a microphone input in the android emulator
                            
                                Voice recognition on android with recorded sound clip?
                            
                                example of AlwaysOnHotwordDetector in Android
                            
                                Split speech audio file on words in python
                            
                                onServiceConnected never called after bindService method
                            
                                Can I write SQL using speech recognition?
                            
                                Saving audio input of Android Stock speech recognition engine
                            
                                Voice Recognition stops listening after a few seconds
                            
                                What are language codes in Chrome's implementation of the HTML5 speech recognition API?
                            
                                How to use Speech Recognition inside the iOS SDK? [closed]
                            
                                What does gs protocol mean?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Voice Recognition Software For Developers [closed]

Tags:

ergonomics

speech-recognition

speech

voice

code-by-voice