Live recognition with Python and Pocketsphinx

Tags:

python

cmusphinx

I have recently been working with pocket sphinx in python. I have successfully got the example below to work recognising a recorded wav.

#!/usr/bin/env python

import sys,os



def decodeSpeech(hmmd,lmdir,dictp,wavfile):

    """

    Decodes a speech file

    """

    try:

        import pocketsphinx as ps

        import sphinxbase

    except:

        print """Pocket sphinx and sphixbase is not installed

        in your system. Please install it with package manager.

        """

    speechRec = ps.Decoder(hmm = hmmd, lm = lmdir, dict = dictp)

    wavFile = file(wavfile,'rb')

    wavFile.seek(44)

    speechRec.decode_raw(wavFile)

    result = speechRec.get_hyp()



    return result[0]



if __name__ == "__main__":

    hmdir = "/home/jaganadhg/Desktop/Docs_New/kgisl/model/hmm/wsj1"

    lmd = "/home/jaganadhg/Desktop/Docs_New/kgisl/model/lm/wsj/wlist5o.3e-7.vp.tg.lm.DMP"

    dictd = "/home/jaganadhg/Desktop/Docs_New/kgisl/model/lm/wsj/wlist5o.dic"

    wavfile = "/home/jaganadhg/Desktop/Docs_New/kgisl/sa1.wav"

    recognised = decodeSpeech(hmdir,lmd,dictd,wavfile)

    print "%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%"

    print recognised

    print "%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%"

The problem is how can I do real time speech recognition from a microphone? In a while loop with a if statement so that if a set word is recognised from the microphone a function can be called?

480

asked Jan 13 '13 20:01

Tonderai Ratisai

1 Answers

The code for realtime recognition looks like this:

config = Decoder.default_config()
config.set_string('-hmm', path.join(MODELDIR, 'en-us/en-us'))
config.set_string('-lm', path.join(MODELDIR, 'en-us/en-us.lm.bin'))
config.set_string('-dict', path.join(MODELDIR, 'en-us/cmudict-en-us.dict'))
config.set_string('-logfn', '/dev/null')
decoder = Decoder(config)

import pyaudio
p = pyaudio.PyAudio()
stream = p.open(format=pyaudio.paInt16, channels=1, rate=16000, input=True, frames_per_buffer=1024)
stream.start_stream()

in_speech_bf = False
decoder.start_utt()
while True:
    buf = stream.read(1024)
    if buf:
        decoder.process_raw(buf, False, False)
        if decoder.get_in_speech() != in_speech_bf:
            in_speech_bf = decoder.get_in_speech()
            if not in_speech_bf:
                decoder.end_utt()
                print 'Result:', decoder.hyp().hypstr
                decoder.start_utt()
    else:
        break
decoder.end_utt()

You can also use gstreamer python bindings in pocketsphinx, check livedemo.py

answered Sep 29 '22 02:09

Nikolay Shmyrev

Related questions
                            
                                Killing processes with psutil
                            
                                Pika worker throws exception when running channel.declare_queue
                            
                                In Python, what is the difference between "class name(object):" and "class name():"
                            
                                Why does comparison of a numpy array with a list consume so much memory?
                            
                                Make Python unittest fail on exception from any thread
                            
                                How to set the name of a QThread in pyqt?
                            
                                Integer to boolean conversion in count() method
                            
                                Vectorize this convolution type loop more efficiently in numpy
                            
                                OpenCV face detection is slow on Raspberry Pi
                            
                                Python tkinter: stopping event propagation in text widgets tags
                            
                                What is a good storage candidate for soft-realtime data acquisition under Linux?
                            
                                Package-scoped fixtures in pytest 2.3
                            
                                Drawing lines between pairs in Python
                            
                                Python/Django: automatically log when exceptions occur, including request info
                            
                                How to find accented characters in a string in Python?
                            
                                How to generate all multiplicative partitions of a number if I have a list of primes/exponents?
                            
                                How to install ZBar for 64-bit Windows and Python 2.7?
                            
                                No color in vi when called from python's script
                            
                                How to step through Python expression evaluation process?
                            
                                python socket.connect -> timed out why?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With