I've managed to finally build and run pocketsphinx (pocketsphinx_continuous). The problem I'm running into, is how to a improve accuracy. From what I understand, you can specify a dictionary file (-dict test.dic). So I took the default dictionary file and added some more pronunciations of the same words, for example: <pre class="prettyprint"><code>pencil P EH N S AH L pencil(2) P EH N S IH L spaghetti S P AH G EH T IY spaghetti(2) S P UH G EH T IY </code></pre> Yet pocketsphinx still does not recognize either word at all. I know there is a jsgf file you can specify as well , but that seems more for phrases and grammar. How can I get pocketsphinx to recognize common words such as pencil and spaghetti? thanks -Mike

With something like this, you can't be certain, but I can offer the following suggestions: <ol> <li>Perhaps the language model somehow has low probabilities for "spaghetti" and "pencil". As you suggested, you could use a JSGF to test out how it does for recognition if it doesn't use the N-gram models, but instead does a simple grammar (give it like twenty words, including spaghetti and pencil). This way you can see if it is perhaps the language model which makes it difficult to recognize these words, and it can do okay if it considers all the words to have equal probability.</li> <li>Perhaps you simply pronounce these words poorly, even with the alternative dictionary entries. Try either A. Testing other peoples' voices, or B. Adapting the acoustic model to your voice (see http://cmusphinx.sourceforge.net/wiki/tutorialam)</li> <li>Also, what is it recognizing them as when it is failing? If possible, remove the words it misrecognizes as from the dictionary.</li> </ol> Again, for overall accuracy, only three things are going to really help you: restricting the grammar, adapting the accoustic model, and perhaps getting higher quality recording input.

To improve accuracy you may want to try adapting the acoustic model to your voice. http://cmusphinx.sourceforge.net/wiki/tutorialadapt To learn how to add new words: http://ghatage.com/tech/2012/12/13/Make-Pocketsphinx-recognize-new-words/

Pocketsphinx - Adding words and Improving accuracy

Tags:

sphinx

speech-recognition

speech-to-text

I've managed to finally build and run pocketsphinx (pocketsphinx_continuous). The problem I'm running into, is how to a improve accuracy. From what I understand, you can specify a dictionary file (-dict test.dic). So I took the default dictionary file and added some more pronunciations of the same words, for example:

pencil P EH N S AH L
pencil(2) P EH N S IH L

spaghetti S P AH G EH T IY
spaghetti(2) S P UH G EH T IY

Yet pocketsphinx still does not recognize either word at all. I know there is a jsgf file you can specify as well , but that seems more for phrases and grammar. How can I get pocketsphinx to recognize common words such as pencil and spaghetti?

thanks

-Mike

511

asked Dec 26 '10 20:12

Mike6679

2 Answers

With something like this, you can't be certain, but I can offer the following suggestions:

Perhaps the language model somehow has low probabilities for "spaghetti" and "pencil". As you suggested, you could use a JSGF to test out how it does for recognition if it doesn't use the N-gram models, but instead does a simple grammar (give it like twenty words, including spaghetti and pencil). This way you can see if it is perhaps the language model which makes it difficult to recognize these words, and it can do okay if it considers all the words to have equal probability.
Perhaps you simply pronounce these words poorly, even with the alternative dictionary entries. Try either A. Testing other peoples' voices, or B. Adapting the acoustic model to your voice (see http://cmusphinx.sourceforge.net/wiki/tutorialam)
Also, what is it recognizing them as when it is failing? If possible, remove the words it misrecognizes as from the dictionary.

Again, for overall accuracy, only three things are going to really help you: restricting the grammar, adapting the accoustic model, and perhaps getting higher quality recording input.

answered Sep 17 '22 13:09

Jeremy Salwen

To improve accuracy you may want to try adapting the acoustic model to your voice. http://cmusphinx.sourceforge.net/wiki/tutorialadapt

To learn how to add new words: http://ghatage.com/tech/2012/12/13/Make-Pocketsphinx-recognize-new-words/

answered Sep 19 '22 13:09

Anup

Related questions
                            
                                undefined method `next_result' for Mysql2 (rails 3)
                            
                                How to query Sphinx for an exact matching phrase?
                            
                                How to erase a realtime index in Sphinx?
                            
                                Create a filter in Sphinx with text/string value
                            
                                Does Heroku support Thinking Sphinx?
                            
                                SQL - Give me 3 hits for each type only
                            
                                Rake task aborted, undefined method 'indexes' for Thinking Sphinx?
                            
                                Thinking sphinx doesn't start - "Failed to start searchd daemon"
                            
                                Any ideas why Thinking Sphinx Rake tasks are not running?
                            
                                Sphinx Search Engine & Python API
                            
                                How do I add the condition "IS NOT NULL" to a Thinking Sphinx search
                            
                                Popularity decay algorithm for popular website posts
                            
                                Problem in implementing Sphinx API along with Cake php
                            
                                Error with configuring thinking sphinx and flying sphinx
                            
                                Ordering items with matching tags by number of tags that match
                            
                                php mysql fulltext search: lucene, sphinx, or?
                            
                                Sphinx PHP search
                            
                                Sphinx vs. MySql - Search through list of friends (efficiency/speed)
                            
                                How gracefully restart Sphinx search daemon after reindexing

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With