What audio file types does Google Cloud Speech API recognize?

Tags:

I'm trying to use Google's Cloud Speech API. There's documentation and code examples here:

https://cloud.google.com/speech/docs/basics
https://cloud.google.com/speech/docs/rest-tutorial

I can get the sample code to run just fine if I point it to an included file, audio.raw, but not with a brief .wav file.

I have no idea what format the audio sample file is:

$ file audio.raw 
audio.raw: data

With my .wav file that has maybe 10 seconds of audio I get an empty result.

I'm aware of this answer.

google cloud speech api returning empty result

My question was asked before but there was not an answer to the question.

What types of audio are supported by Cloud Speech API?

I can't imagine that I would have to get the properties of the audio file just right to get this to work. I assume a common use case, mine, is that someone records a meeting, has no idea of the parameters of the recording and just wants a text file.

267

asked Oct 15 '16 14:10

Sol

1 Answers

EDIT May 2020: seems things improved and this answer is no longer correct: see new docs for details about supported formats (including WAV).

As of 2016 the WAVe format does not seem to be supported. These formats are documented as supported though:

LINEAR16 Uncompressed 16-bit signed little-endian samples. This is the only encoding that may be used by speech.asyncrecognize.
FLAC This is the recommended encoding for speech.syncrecognize and StreamingRecognize because it uses lossless compression; therefore recognition accuracy is not compromised by a lossy codec. Only 16-bit samples are supported. Not all fields in STREAMINFO are supported
MULAW 8-bit samples that compand 14-bit audio samples using G.711 PCMU/mu-law.
AMR Adaptive Multi-Rate Narrowband codec. sampleRate must be 8000 Hz.
AMR_WB Adaptive Multi-Rate Wideband codec. sampleRate must be 16000 Hz.

https://cloud.google.com/speech/reference/rest/v1beta1/RecognitionConfig#AudioEncoding

198

answered Nov 15 '22 15:11

Marcin Orlowski

Related questions
                            
                                How to detect string when pitch-tracking on electric guitar?
                            
                                Recording output audio with Swift
                            
                                Android play audio file during phone call [duplicate]
                            
                                Audio Player play in background and should work based off of hardware mute switch
                            
                                OSX equivalent of piping sound to linux's aplay
                            
                                How to unit test android audio recording app using robolectric
                            
                                how to load m4a file in python
                            
                                Remove start of new track from end of audio file
                            
                                iOS 13 webaudio is totally broken for html audio elements within a webview
                            
                                Dynamic Audio Generation Actionscript 3
                            
                                How do I play a tone in Linux using C?
                            
                                Can I get an audio session / apply audio units to playback from MPMusicPlayerController?
                            
                                Rendering HTML5 animation server-side?
                            
                                Adding Accents to Speech Generation
                            
                                How to use QtMultimedia to play a wav file?
                            
                                iOS6 multi route audio
                            
                                Programatically record audio output from web page using jS or html5?
                            
                                Audio capturing using ALSA library - snd_pcm_open => No such file or directory
                            
                                No audio is played after saving video with AVCaptureMovieFileOutput
                            
                                get the amplitude data from an mp3 audio files using python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What audio file types does Google Cloud Speech API recognize?

Tags:

google-cloud-platform

audio

google-speech-api

google-voice-search

Sol

People also ask

1 Answers

Marcin Orlowski

Recent Activity

Donate For Us