Is there any way to send audio file to the speech-to-text recognition

2 Answers

I suppose it works in a similar way to the chrome api - http://mikepultz.com/2011/03/accessing-google-speech-api-chrome-11/

As he has mentioned you can convert the microphone file into a .flac file and send it to the speech api, and you will get the same result. So you can use SOX and convert it yourself.

Hope it helps. Dias

answered Oct 21 '22 11:10

wizgot

cmusphinx.sourceforge.net/wiki/tutorialandroid Just found that link sounds like someone has created a android version of Sphinx.

Looking at the Android api doing this doesn't seem to be supported. (http://developer.android.com/reference/android/speech/package-summary.html)

You might be able to using another API.

I know that Microsoft's C# api allows this but in order for that to be useful you would probably need to setup a server with a program you wrote record the sound file on the phone and then send it to the server.

CMUSphinx (http://cmusphinx.sourceforge.net/wiki/) is written in Java so it might be possible to get it running on an Android device. On that api you create a StreamSpeechReconizer.

StreamSpeechRecognizer recognizer = new StreamSpeechRecognizer(configuration);
recognizer.startRecognition(new File("speech.wav").toURI().toURL());
SpeechResult result = recognizer.getResult();
recognizer.stopRecognition();

I found this https://gist.github.com/alotaiba/1730160 with a quick web search (google "speech recognition api accepts file") so there might be other services available on the web that would accept a file to be sent to them.

answered Oct 21 '22 12:10

Travis

Related questions
                            
                                Rotate a bitmap using render script android
                            
                                Annotation Processors generated resources not packaged to APK
                            
                                Android Dual Sim Emulator
                            
                                What is the InstallerPackageName when app is in "pending publication" phase and used/reviewed by reviewers/testers (Google Play Store)?
                            
                                Android camera preview stretched using Grafika CameraCapture code
                            
                                Transparent proxy for testing server responses offline in Android
                            
                                Telegram: get fileid from telegram client
                            
                                How to prevent conflict animation with device rotation in android?
                            
                                What is the right approach to "This AsyncTask class should be static or leaks might occur" in Kotlin Android?
                            
                                Can <meta name="theme-color"> be tested locally on a desktop?
                            
                                How to internationalize/localize your FCM push notifications, especially topics?
                            
                                Collapsing Toolbar bouncing glitch since support lib 26.0.0
                            
                                Android Studio Device File Explorer run-as:Package 'my project name' is unknown
                            
                                Amazon S3 Upload issue Android SDK, com.amazonaws.AmazonClientException: More data read (4567265) than expected (4561427)
                            
                                Android Exception FATAL:flutter/shell/platform/android/platform_view_android_jni.cc in flutter
                            
                                Gson: NoSuchMethodException for Android 4.2.2 devices
                            
                                Mocking static classes with Powermock2 and Kotlin
                            
                                UAMP MediaBrowser orientation
                            
                                Detect when the outgoing call starts playing ringback tone
                            
                                Architecture Navigation Component : onCreateView gets called every time

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Is there any way to send audio file to the speech-to-text recognition

Tags:

file

android

speech-recognition

wav

nonozor

People also ask

2 Answers

wizgot

Travis

Recent Activity

Donate For Us