Accuracy of MS System.Speech.Recognizer and the SpeechRecognitionEngine

Tags:

I am currently testing the SpeechRecognitionEngine by loading from an xml file a pretty simple rule. In fact it is a simple between ("decrypt the email", "remove encryption") or ("encrypt the email", "add encryption").

I have trained my Windows 7 PC and additionally added the words encrypt and decrypt as I realize they are very similar. The recognizer already has a problem with making a difference between these two.

The issue I am having is that it recognizes things too often. I have set the confidence to 0.93 because with my voice in a quiet room when saying the exact words sometimes only gets to 0.93. But then if I turn on the radio the voice of the announcer or a song can mean that this recognizer thinks it has heard with over 0.93 confidence with words "decrpyt the email".

Maybe Lady Gaga is backmasking Applause to secretly decrypt emails :-)

Can anyone help in working out how to do something to make this recognizer workable.

In fact the recognizer is also picking up keyboard noise as "decrypt the email". I don't understand how this is possible.

Further to my editing buddy there are at least two managed namespaces for MS Speech Microsoft.Speech and System.Speech - It is important for this question that it be know that it is System.Speech.

799

asked Sep 16 '13 06:09

darbid

1 Answers

If the only thing the System.Speech recognizer is listening for is "encrypt the email", then the recognizer will generate lots of false positives. (Particularly in a noisy environment.) If you add a DictationGrammar (particularly a pronunciation grammar) in parallel, the DictationGrammar will pick up the noise, and you can check the (e.g.) name of the grammar in the event handler to discard the bogus recognitions.

A (subset) example:

    static void Main(string[] args)
    {
        Choices gb = new Choices();
        gb.Add("encrypt the document");
        gb.Add("decrypt the document");
        Grammar commands = new Grammar(gb);
        commands.Name = "commands";
        DictationGrammar dg = new DictationGrammar("grammar:dictation#pronunciation");
        dg.Name = "Random";
        using (SpeechRecognitionEngine recoEngine = new SpeechRecognitionEngine(new CultureInfo("en-US")))
        {
        recoEngine.SetInputToDefaultAudioDevice();
        recoEngine.LoadGrammar(commands);
        recoEngine.LoadGrammar(dg);
        recoEngine.RecognizeCompleted += recoEngine_RecognizeCompleted;
        recoEngine.RecognizeAsync();

        System.Console.ReadKey(true);
        recoEngine.RecognizeAsyncStop();
        }
    }

    static void recoEngine_RecognizeCompleted(object sender, RecognizeCompletedEventArgs e)
    {
        if (e.Result.Grammar.Name != "Random")
        {
            System.Console.WriteLine(e.Result.Text);
        }
    }

129

answered Sep 30 '22 19:09

Eric Brown

Related questions
                            
                                Constructor chaining passing computed values for parameters
                            
                                How can disable redirection on win64
                            
                                Cast List<object> to AnonymousTypes list
                            
                                Automatically resize TableLayoutPanel row when window is resized
                            
                                Type 'T' is not awaitable
                            
                                C#, JSON Parsing, dynamic variable. How to check type?
                            
                                SqlParameter does not allows Table name - other options without sql injection attack?
                            
                                Simulate a keypress for X seconds
                            
                                Invalid index 6 for this SqlParameterCollection with Count= 6
                            
                                Using String.Replace() with an index instead of a string for the argument?
                            
                                which method is first called when a xaml page is loaded in windows phone by default?
                            
                                Hide Taskbar in Windows 8
                            
                                How to ignore a class property using Dapper.net Extensions?
                            
                                Uri(Uri, String) constructor does not works properly?
                            
                                Webdeploy permission issue
                            
                                C# Tuple versus List Considerations
                            
                                Why "Equals" method resolution with generics differs from explicit calls
                            
                                Creating a dynamic query using IQueryable
                            
                                Is it possible to catch exception thrown from base class constructor inside derived class constructor
                            
                                Regex Match Collection multiple matches

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Accuracy of MS System.Speech.Recognizer and the SpeechRecognitionEngine

Tags:

c#

.net

vb.net

speech-recognition

darbid

People also ask

1 Answers

Eric Brown

Recent Activity

Donate For Us