Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

How to implement speech recognition and text-to-speech in C++?

I want to know about various techniques to do speech recognition and text to speech conversion. Also please let me know about any resources like links, tutorials ,ebooks etc. on it.

Which is the most efficient technique to achieve it ?

like image 422
Ashish Avatar asked Dec 22 '09 17:12

Ashish


2 Answers

I'm going to answer the part about speech recognition (since I don't know much about text-to-speech):

http://ecx.images-amazon.com/images/I/4190SZC61CL._BO2,204,203,200_PIsitb-sticker-arrow-click,TopRight,35,-76_AA240_SH20_OU01_.jpg

This book, "Statistical Methods for Speech Recognition" is a classic that explains the mathematical foundations of statistical speech recognition, written by the founder of that area, Frederick Jelinek.

The most important concept you have to know is Hidden Markov Models. People have been using them in speech recognition for decades. A recent approach uses Conditional Random Fields, see the paper (PDF) and the associated software toolkit SCARF.

It is fairly hard to write your own speech recognizer. It's an active research area with several scientific conferences, e.g. ASRU, Interspeech, ICASSP.

like image 109
Frank Avatar answered Oct 12 '22 12:10

Frank


Both are very wide areas. About recognition: In this this schema you will find how to build a basic automatic speech recognition system. It isn't by any means close to the start of the art, but it is something achievable and it works. If you want to do something more advanced, read about cepstral coefficients and Hidden Markov Models. Have a look into HTK, it is a widely used toolkit for Hidden Markov Models.

About text to speech: I'd have a look at Festival.

like image 37
nacmartin Avatar answered Oct 12 '22 12:10

nacmartin