Recently Google released a new "Read it" function of Google Assistant.
How can I try it with Python? I enabled Google Assistant API in console and created service account credentials. The documentation does not provide any Python implementation.
I'd like to run a simple "Hello world" example: send a text and get a wav file.
The Google Assistant Service exposes a low level API that lets you directly manipulate the audio bytes of an Assistant request and response. Bindings for this API can be generated for languages like Node. js, Go, C++, Java for all platforms that support gRPC.
Google uses the term "Assistant" somewhat loosely, and a feature being available on the Assistant for one platform isn't always available on others, even if there is no obvious reason for it not to be.
Reading long-form audio may seem to make sense to be available on all audio Assistant platforms - but it isn't. At least as of this writing, it seems to only be available on Assistant for Android platforms. This suggests that it is a feature of that implementation specifically, and not a full Assistant feature.
Also note that the Assistant SDK is officially listed for hobbyist use only. Production hardware with the Assistant seems to use a different development platform that handles other features including streaming audio.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With