NaturalReader, Speechify, and Amazon Polly have the most lifelike human-sounding voices of all text-to-speech applications. Polly's Neural Text-to-Speech (NTTS) makes it a leading choice, with Speechify coming in close behind.
Google today open-sourced the speech engine that powers its Android speech recognition transcription tool Live Transcribe. The company hopes doing so will let any developer deliver captions for long-form conversations. The source code is available now on GitHub. Google released Live Transcribe in February.
Text-to-speech (TTS) refers to the ability of computers to read text aloud. A TTS engine converts written text to a phonemic representation, then converts the phonemic representation to waveforms that can be output as sound.
I think you'd find Flite an excellent project. It is widely used, open source, and multi platform. Developed at CMU, it has more rigor than a random developer's project.
However, it is written entirely in C for portability to mobile objects, perhaps reducing its academic value to you, if that is what you are after (looking for C++)
If you are not interested in a C library, that same site has a link to Festival, a C++ library from University of Edinburgh that is much more robust. However, it is much less portable and you'll have to work at it to get it working in your environment I'm sure.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With