Looking for a library (with Java or Python APIs) that converts speech to text. 100% accuracy is not an absolute requirement because I just need to run some experiments for a prototype. Ideally it should accept an input file (e.g., .wav) and return the output as text.
Thanks.
FLAC (for some systems) If using Windows (x86 or x86-64), OS X (Intel Macs only, OS X 10.6 or higher), or Linux (x86 or x86-64), this is already bundled with this library - you do not need to install anything.
Translation of Speech to Text: First, we need to import the library and then initialize it using init() function. This function may take 2 arguments. After initialization, we will make the program speak the text using say() function. This method may also take 2 arguments.
The speech recognition software breaks the speech down into bits it can interpret, converts it into a digital format, and analyzes the pieces of content. It then makes determinations based on previous data and common speech patterns, making hypotheses about what the user is saying.
You can use the Sphinx like kdazzle has suggested for you or you can also check out other java implementation here.
For python library, check out pyspeech or dragonfly. If the library can output the text, I think the library should be possible to print out the text into a file.
Sphinx is pretty good. It's made by the folks at Carnegie Mellon.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With