Is there any software out there capable of taking audio files and outputting phonological (IPA) text?
I understand much of the software out there takes it straight to a language, but is there one that is 'teachable'?
IBM Watson Speech to Text is a cloud-based speech to text recognition software. It has the option to transcribe in real-time, as well as the ability to download multiple audio files and then transcribe and translate them collectively.
With voice typing, you can enter text on your PC by speaking. Voice typing uses online speech recognition, which is powered by Azure Speech services.
There are two primary options to convert voice recordings to text documents. You can either use AI transcription or human transcription services. AI transcription is quicker and more affordable, but less accurate than human transcription.
Speech recognition technology allows computers to take spoken audio, interpret it and generate text from it.
CMU Sphinx might be able to do what you want. There are a few different versions, but the one I'm familiar with is Sphinx 3. In the FAQ it says you can get phone segmentations by making your "words" be individual phones (they're not IPA, though).
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With