I need an API or library (preferably free) that will convert voice/speech through a microphone, into text (string).
Additionally, I will need an API or library that can do text-to-speech.
I'd like to use C# and .NET, but other languages will suffice.
Thanks.
In Windows 11, dictation has been updated and renamed as voice typing. Like dictation, voice typing uses online speech recognition technologies to power its speech-to-text transcription service. You no longer need to turn on the Online Speech recognition setting to use voice typing.
You can use CMU Sphinx as it is pretty open and scalable solution and I think it can be used at both client and server side:
http://cmusphinx.sourceforge.net/
If you are looking for a Microsoft desktop solution then you can use SAPI:
http://msdn.microsoft.com/en-us/magazine/cc163663.aspx
On server side, you can use Microsoft Unified Communication, but do consider licencing as well:
http://www.microsoft.com/uc/en/gb/default.aspx
Update:
This thread has also some good reference:
C# Speech Recognition - Is this what the user said?
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With