I'm trying to figure out how to use sphinx4 or pocketsphinx with the english voxforge model but I can't get it working. I have tried to read doc pages (like this one http://cmusphinx.sourceforge.net/sphinx4/doc/UsingSphinxTrainModels.html ) but it does not help me. What I want is an executable where I can specify which model to use and which audio file to use as source and have the executable print out it's best guess about what the voice on the recording says. I hade some luck with: pocketsphinx_continuous -infile recording.wav 2> /dev/null But it aborts before the complete audio file is transcribed and the default model has waay to few words to create a readable text from the audio. I have compiled and tested the demos in sphinx4 source package but all the examples seem to have to few words and needs a model loke the voxforge one to be useful to me. How can I set this up?

It's very simple to plug in Voxforge acoustic model. The main document covering the API is cmusphinx tutorial: http://cmusphinx.sourceforge.net/wiki/tutorialsphinx4 It's recommended to read it before you start. Please also note that it is recommended to use En_US English Generic acoustic model, it is more accurate than Voxforge. Step by step you need to do the following: <ul> <li>Download voxforge model from sourceforge and unpack it to a folder</li> <li>Checkout sphinx4 from github and build it with gradle</li> <li>Run TranscriberDemo</li> <li>Go to sphinx4-samples/src/main/java/edu/cmu/sphinx/demo/transcriber folder, open Transcriber demo and edit the acoustic model path as below.</li> <li>Edit the location of the audio file in sources if you need another audio file</li> <li>Run demo again and enjoy</li> </ul> That would be it <pre class="prettyprint"><code> // Load model from the folder in your project configuration.setAcousticModelPath("file:voxforge-en-0.4/model_parameters/voxforge_en_sphinx.cd_cont_5000"); </code></pre>

How to use CMU Sphinx 4 for speech to text with english voxforge models

Tags:

java

speech-to-text

cmusphinx

I'm trying to figure out how to use sphinx4 or pocketsphinx with the english voxforge model but I can't get it working. I have tried to read doc pages (like this one http://cmusphinx.sourceforge.net/sphinx4/doc/UsingSphinxTrainModels.html ) but it does not help me.

What I want is an executable where I can specify which model to use and which audio file to use as source and have the executable print out it's best guess about what the voice on the recording says.

I hade some luck with: pocketsphinx_continuous -infile recording.wav 2> /dev/null

But it aborts before the complete audio file is transcribed and the default model has waay to few words to create a readable text from the audio.

I have compiled and tested the demos in sphinx4 source package but all the examples seem to have to few words and needs a model loke the voxforge one to be useful to me.

How can I set this up?

959

asked Dec 31 '11 00:12

tirithen

1 Answers

It's very simple to plug in Voxforge acoustic model. The main document covering the API is cmusphinx tutorial:

http://cmusphinx.sourceforge.net/wiki/tutorialsphinx4

It's recommended to read it before you start. Please also note that it is recommended to use En_US English Generic acoustic model, it is more accurate than Voxforge.

Step by step you need to do the following:

Download voxforge model from sourceforge and unpack it to a folder
Checkout sphinx4 from github and build it with gradle
Run TranscriberDemo
Go to sphinx4-samples/src/main/java/edu/cmu/sphinx/demo/transcriber folder, open Transcriber demo and edit the acoustic model path as below.
Edit the location of the audio file in sources if you need another audio file
Run demo again and enjoy

That would be it

   // Load model from the folder in your project
   configuration.setAcousticModelPath("file:voxforge-en-0.4/model_parameters/voxforge_en_sphinx.cd_cont_5000");

146

answered Sep 17 '22 01:09

Nikolay Shmyrev

Related questions
                            
                                Getting device/driver information related to a COM port?
                            
                                Keyboard shortcut to run maven goals in Intellij IDEA?
                            
                                Capturing speaker output in Java
                            
                                What is the difference between Java's equals() and C++'s operator ==?
                            
                                Jackson JSON do not wrap attributes of nested object
                            
                                How can I dynamically change the email subject using Log4J SMTPAppender?
                            
                                Online Java GUI Builder? At least Layout Manager
                            
                                Request params and PUT method
                            
                                Overriding a method using type erasure
                            
                                How to make pixel perfect Line2D in - Graphics2D
                            
                                FileChannel.transferTo for large file in windows
                            
                                When using Hibernate ORM should i model first a class diagram or DB diagram?
                            
                                Error symbol not shown in Navigator view
                            
                                How to assemble multimodule maven project into one WAR?
                            
                                When Shutdown Hooks Break Bad
                            
                                Java: pick several different random numbers from array in one time
                            
                                SSHD Java example
                            
                                When to choose JMS API over UDP socket API or vice versa?
                            
                                Java visitor pattern instead of instanceof switch
                            
                                Mac selenium webdriver chrome window always starts with a small window

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With