Need text to speech and speech recognition tools for Linux

Tags:

I'm planning on writing a program for Linux that uses text to speech and speech recognition. What are the best tools/libraries for this? Should I use Windows instead to be able to use better tools? The tools need to be easily callable from a console or C program.

287

asked May 18 '09 12:05

Cory Walker

2 Answers

For speech recognition there are the various Sphinxes. The different variants have different pros and cons, there is a comparison here Comparison of Sphinx versions. Sphinx 4 is Java, but the others are C, I believe.

199

answered Sep 20 '22 15:09

Matt G

It depends quite a bit on what speech you are trying to recognize.

This is an article from 2005 that explains some of the difficulties in creating a dictation program: http://www.cs.cmu.edu/~archan/personal/whyNoOpenSourceDictationDraft4.html . If you want that, the Julius speech recognition engine seems promising, but you will need to add your own acoustic and language models. You might be able to use the voxforge acoustic model.

If you are not trying to write a dictation program then you have a much easier task. Command programs have limited vocabularies, for example 'If you would like to continue in English, say "English"'.

I was able to get pretty good results using pocketsphinx and gstreamer to make a program that automatically edits most occurrences of the word "twitter" out of the TWiT podcast. It didn't work at all until I used my own language model based on transcripts of the podcast; the machine transcriptions from the speech recognizer are useless/hilarious but they do an okay job of finding the keyword.

answered Sep 16 '22 15:09

joeforker

Related questions
                            
                                Xkb: How to convert a keycode to keysym
                            
                                How to add a cron job in linux [closed]
                            
                                Connecting to a protected WiFi from Python on Linux
                            
                                How to do password authentication for a user using LDAP?
                            
                                How to release hugepages from the crashed application
                            
                                JavaFX missing from JDK 1.7/1.8 in Linux?
                            
                                What are the bounds of the heap?
                            
                                What is the maximum number of subdirectories allowed in Ext4? [closed]
                            
                                linux GNU getopt: ignore unknown optional arguments?
                            
                                Socat terminates after connection close
                            
                                maven-resources-plugin:2.6 - Cannot create resource output directory
                            
                                How do I change users in FileZilla?
                            
                                Linux: Cannot allocate more than 32 GB/64 GB of memory in a single process due to virtual memory limit
                            
                                Why is _init from glibc's csu/init-first.c called before _start even if _start is the ELF entry point?
                            
                                Understanding the msghdr structure from sys/socket.h
                            
                                Get the pid of a running playbook for use within the playbook
                            
                                customizing completion of GtkComboBoxText
                            
                                Why doesn't time() from time.h have a syscall to sys_time?
                            
                                Tips for optimizing an sqlite database with over a gig of data in it? [closed]
                            
                                What's best way to secure a database connection string?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Need text to speech and speech recognition tools for Linux

Tags:

linux

speech-recognition

text-to-speech

Cory Walker

People also ask

2 Answers

Matt G

joeforker

Recent Activity

Donate For Us