Is there a way to use the SpeechRecognizer API directly for speech input?

Tags:

The Android Dev website provides an example of doing speech input using the built-in Google Speech Input Activity. The activity displays a pre-configured pop-up with the mic and passes its results using onActivityResult()

My question: Is there a way to use the SpeechRecognizer class directly to do speech input without displaying the canned activity? This would let me build my own Activity for voice input.

740

asked Feb 12 '11 00:02

vladimir.vivien

1 Answers

Here is the code using SpeechRecognizer class (sourced from here and here):

import android.app.Activity; import android.content.Intent; import android.os.Bundle; import android.view.View; import android.view.View.OnClickListener; import android.speech.RecognitionListener; import android.speech.RecognizerIntent; import android.speech.SpeechRecognizer; import android.widget.Button; import android.widget.TextView; import java.util.ArrayList; import android.util.Log;    public class VoiceRecognitionTest extends Activity implements OnClickListener  {     private TextView mText;    private SpeechRecognizer sr;    private static final String TAG = "MyStt3Activity";    @Override    public void onCreate(Bundle savedInstanceState)     {             super.onCreate(savedInstanceState);             setContentView(R.layout.main);             Button speakButton = (Button) findViewById(R.id.btn_speak);                  mText = (TextView) findViewById(R.id.textView1);                  speakButton.setOnClickListener(this);             sr = SpeechRecognizer.createSpeechRecognizer(this);                    sr.setRecognitionListener(new listener());            }     class listener implements RecognitionListener              {             public void onReadyForSpeech(Bundle params)             {                      Log.d(TAG, "onReadyForSpeech");             }             public void onBeginningOfSpeech()             {                      Log.d(TAG, "onBeginningOfSpeech");             }             public void onRmsChanged(float rmsdB)             {                      Log.d(TAG, "onRmsChanged");             }             public void onBufferReceived(byte[] buffer)             {                      Log.d(TAG, "onBufferReceived");             }             public void onEndOfSpeech()             {                      Log.d(TAG, "onEndofSpeech");             }             public void onError(int error)             {                      Log.d(TAG,  "error " +  error);                      mText.setText("error " + error);             }             public void onResults(Bundle results)                                {                      String str = new String();                      Log.d(TAG, "onResults " + results);                      ArrayList data = results.getStringArrayList(SpeechRecognizer.RESULTS_RECOGNITION);                      for (int i = 0; i < data.size(); i++)                      {                                Log.d(TAG, "result " + data.get(i));                                str += data.get(i);                      }                      mText.setText("results: "+String.valueOf(data.size()));                     }             public void onPartialResults(Bundle partialResults)             {                      Log.d(TAG, "onPartialResults");             }             public void onEvent(int eventType, Bundle params)             {                      Log.d(TAG, "onEvent " + eventType);             }    }    public void onClick(View v) {             if (v.getId() == R.id.btn_speak)              {                 Intent intent = new Intent(RecognizerIntent.ACTION_RECOGNIZE_SPEECH);                         intent.putExtra(RecognizerIntent.EXTRA_LANGUAGE_MODEL,RecognizerIntent.LANGUAGE_MODEL_FREE_FORM);                 intent.putExtra(RecognizerIntent.EXTRA_CALLING_PACKAGE,"voice.recognition.test");                  intent.putExtra(RecognizerIntent.EXTRA_MAX_RESULTS,5);                  sr.startListening(intent);                 Log.i("111111","11111111");             }    } }

Define main.xml with a button and give RECORD_AUDIO permission in manifest

answered Oct 22 '22 12:10

png

Related questions
                            
                                in_array - 'in_object' equivalent?
                            
                                How to replace underscores with spaces using a regex in Javascript
                            
                                Drag the Range of a UI Input Range Slider
                            
                                How can I handle ESC key in Cocoa app?
                            
                                Javascript self executing function "is not a function"
                            
                                Can't Install Curb - Having problems with native extensions.
                            
                                Django: how to do get_or_create() in a threadsafe way?
                            
                                Make google maps plugin black/white or with sepia filter
                            
                                How to use of the Maven Enforcer plugin?
                            
                                execute two shell commands in single exec php statement
                            
                                How can I tell if the image returned from didFinishPickingMediaWithInfo was from the camera or the photoalbum?
                            
                                Checksum on string

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Is there a way to use the SpeechRecognizer API directly for speech input?

Tags:

vladimir.vivien

People also ask

1 Answers

png

Recent Activity

Donate For Us