<h3>Introduction</h3> Android provides two ways for me to use speech recognition. The first way is by an <code>Intent</code>, as in this question: Intent example. A new <code>Activity</code> is pushed onto the top of the stack which listens to the user, hears some speech, attempts to transcribes it (normally via the cloud) then returns the result to my app, via an <code>onActivityResult</code> call. The second is by getting a <code>SpeechRecognizer</code>, like the code here: SpeechRecognizer example. Here, it looks like the speech is recorded and transcribed on some other thread, then callbacks bring me the results. And this is done without leaving my <code>Activity</code>. I would like to understand the pros and cons of these two ways of doing speech recognition. <h3>What I've got so far</h3> Using the <code>Intent</code>: <ul> <li>is simple to code</li> <li>avoids reinventing the wheel</li> <li>gives consistent user experience of speech recognition across the device</li> </ul> but <ul> <li>might be slow for the creation of a new activity with it's own window</li> </ul> Using the <code>SpeechRecognizer</code>: <ul> <li>lets me retain control of UI in my app</li> <li>gives me extra possibilities of things to respond to (documentation)</li> </ul> but <ul> <li>is limited to be called from the main thread</li> <li>more control requires more error-checking.</li> </ul>

In addition to all this, I'd add at least this point: <code>SpeechRecognizer</code> is better for hands-free user interfaces, since your app actually gets to respond to error conditions like "No matches" and perhaps restart itself. When you use the <code>Intent</code>, the app beeps and shows a dialog that the user must press to continue. My summary is as follows: SpeechRecognizer <ul> <li>Show different UI or no UI at all. Do you really want your app's UI to beep? Do you really want your UI to show a dialog when there is an error and wait for user to click?</li> <li>App can do something else while speech recognition is happening</li> <li>Can recognize speech while running in the background or from a service</li> <li>Can Handle errors better</li> <li>Can access low level speech stuff like the raw audio or the RMS. Analyze that audio or use the loudness to make some kind of flashing light to indicate the app is listening</li> </ul> Intent <ul> <li>Consistent, and easy to use UI for users</li> <li>Easy to program</li> </ul>

The main difference is UI. <code>SpeechRecognizer</code> doesn't have any so you are responsible for creating one. I use to wrote a prototype where I've have receiver for listening headset button, then activating speech recognition to listen for some commands. Screen was not activated so I had to use <code>SpeechRecognizer</code> (my UI was some prerecorded sounds and Text To Speech). Second difference is that <code>SpeechRecognizer</code> has ability for constant listening. Intent version will always end exaction after some period. For example <code>SpeechRecognizer</code> is used by speech recognition "keyboard" so you can dictate a SMS. In such case you will receive partial results only (in normal mode <code>SpeechRecognizer</code> gives only final results).

Comparison of Speech Recognition use in Android: by Intent or on-thread?

Introduction

Android provides two ways for me to use speech recognition.

The first way is by an Intent, as in this question: Intent example. A new Activity is pushed onto the top of the stack which listens to the user, hears some speech, attempts to transcribes it (normally via the cloud) then returns the result to my app, via an onActivityResult call.

The second is by getting a SpeechRecognizer, like the code here: SpeechRecognizer example. Here, it looks like the speech is recorded and transcribed on some other thread, then callbacks bring me the results. And this is done without leaving my Activity.

I would like to understand the pros and cons of these two ways of doing speech recognition.

What I've got so far

Using the Intent:

is simple to code
avoids reinventing the wheel
gives consistent user experience of speech recognition across the device

but

might be slow for the creation of a new activity with it's own window

Using the SpeechRecognizer:

lets me retain control of UI in my app
gives me extra possibilities of things to respond to (documentation)

but

is limited to be called from the main thread
more control requires more error-checking.

905

asked Aug 11 '12 10:08

hcarver

2 Answers

In addition to all this, I'd add at least this point:

SpeechRecognizer is better for hands-free user interfaces, since your app actually gets to respond to error conditions like "No matches" and perhaps restart itself. When you use the Intent, the app beeps and shows a dialog that the user must press to continue.

My summary is as follows:

SpeechRecognizer

Show different UI or no UI at all. Do you really want your app's UI to beep? Do you really want your UI to show a dialog when there is an error and wait for user to click?
App can do something else while speech recognition is happening
Can recognize speech while running in the background or from a service
Can Handle errors better
Can access low level speech stuff like the raw audio or the RMS. Analyze that audio or use the loudness to make some kind of flashing light to indicate the app is listening

Intent

Consistent, and easy to use UI for users
Easy to program

171

answered Oct 21 '22 12:10

gregm

The main difference is UI. SpeechRecognizer doesn't have any so you are responsible for creating one.
I use to wrote a prototype where I've have receiver for listening headset button, then activating speech recognition to listen for some commands. Screen was not activated so I had to use SpeechRecognizer (my UI was some prerecorded sounds and Text To Speech).

Second difference is that SpeechRecognizer has ability for constant listening. Intent version will always end exaction after some period. For example SpeechRecognizer is used by speech recognition "keyboard" so you can dictate a SMS.
In such case you will receive partial results only (in normal mode SpeechRecognizer gives only final results).

answered Oct 21 '22 12:10

Marek R

Related questions
                            
                                Android: Force data to be sent over radio vs WiFi
                            
                                R.string; get string from dynamic key name [duplicate]
                            
                                how to scale an image in an ImageView so that it "fits"
                            
                                Android fill percent of layout
                            
                                How to implement XMPP to send push notifications
                            
                                how to use Digest authentication in android?
                            
                                Android Market - developer console: marketing enabled / disabled?
                            
                                Good software engineering vs. Security
                            
                                ZTE V9 not detected by ADB
                            
                                Keeping the same SQLite database when upgrading and Android app from Lite to Pro version
                            
                                access (faster polling) accelerometer via NativeActivity NDK
                            
                                When does the probe function for a Linux kernel driver gets called?
                            
                                How to send a JSON object over HttpClient Request with Android?
                            
                                How does the Apple color emoji font work, and is there an Android version?
                            
                                PopupWindow z ordering
                            
                                How can Android service update the UI of the activity that started it?
                            
                                How do I use android libraries (apklibs) with maven and eclipse?
                            
                                The best way to create drop down menu in android 2.x like in ICS
                            
                                Increase the Android API level during app update
                            
                                Why is LogCat showing all items as warnings (orange)?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Comparison of Speech Recognition use in Android: by Intent or on-thread?

Tags:

android

android-intent

speech-recognition

speech-to-text