Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

google cloud speech api returning empty result

I have been using the Chromium Google Speech API and switched over to using the Google Cloud Speech API recently. Ever since the Google cloud speech API got announced, the performance seems to have degraded in terms of the accuracy of recognition. Also I see that there are more and more "empty results" coming back for audio streamed.

I stream audio simultaneously to multiple different services and Google Cloud Speech API is returning empty result while some of the other services are returning transcribed text. Makes me wonder if there is anything changed in the way the Chromium Speech API and the Google Cloud Speech API work?

I validated the audio for proper headers and validated that I am streaming audio to Google.

Is anyone experiencing that Google sometimes (more like majority of the time) returning empty result?

like image 282
rajanb Avatar asked Sep 26 '16 21:09

rajanb


2 Answers

This type of question is more appropriate for Public Issue Tracker as it would require further details in order to reproduce your exact errors. Make sure to fill in this form with the required information or at least with a minimal working example of your code clearly highlighting the problem. For an accurate reproduction, It would be important to provide the sample codes or commands that you executed and which returned the error alongside the configuration files and the URIs(or files) of the audio files you streamed and which returned empty results.

As a matter of fact, there exists known issues with the speech API that is currently in the Beta and so may prevent the transcription from working correctly. In the meantime, You may refer to the following documentation to determine if any of the best practices would apply to your case.

like image 126
Alex Avatar answered Sep 29 '22 09:09

Alex


I was also receiving empty responses but eventually got results by encoding with different settings.

sox async.wav -t raw --channels=1 --bits=16 --rate=16000 --encoding=signed-integer --endian=little async.raw

like image 27
Nat Taylor Avatar answered Sep 29 '22 09:09

Nat Taylor