Speech recognition API duplicated phrases on Android

Tags:

I found, that speech recognition API duplicates result phrases on my Android (and does not duplicate on desktop).

For each phrase said, it returns two results. First one is

enter image description here

and the second one is

enter image description here

As you see, in the second return, phrase is duplicated, each copy is marked as final and second one is beyond resultIndex. In first return there is only one copy, it is final and it is beyond resultIndex.

I would take only second return, but the problem is that it happens on mobile Chrome, but does not happen on desktop Chrome. Desktop Chrome returns only first return.

So, the question is: is this by design behavior? Then how to distinguish single final phrase then commonly for all computers?

Or may be this is some error like sound echo, then the question is how to avoid/check echo?

UPDATE

Html is follows:

<input id="recbutton" type="button" value="Recognize">
<div id="output">

  <div>
    Initial text
  </div>

</div>

Code is follows:

var recognition = null;
var recognitionStarted = false;
var printcount = 1;
var lastPhrase = null;

$(function() {
  attachRecognition();
});

$('#recbutton').click( function() {
    if( !recognitionStarted ) {
    recognition.start();
  }
  else {
    recognition.stop();
  }
});

function printOut(text) {
    var id = 'printcount' + printcount;
  printcount++;

    $('#output').append(
    "<div id='" + printcount + "'>" + text + "</div>"
  );

    $("#output").animate({ scrollTop: $("#output").prop('scrollHeight')});

  return printcount;

}


function attachRecognition() {

  if (!('webkitSpeechRecognition' in window)) {

    $('button').prop('disabled', true);

    recognition = null;

  } else {
    $('button').prop('disabled', false);

    recognition = new webkitSpeechRecognition();

    recognition.continuous = true;
    recognition.interimResults = true;
    recognition.lang = "en-US";

    recognition.onstart = function(event) {
      recognitionStarted = true;
      printOut("speech recognition started");
    };

    recognition.onend = function(event) {
            recognitionStarted = false;
            printOut("speech recognition stopped");
    };

    recognition.onresult = function(event) {

      var finalPhrase = '';
      var interimPhrase = '';
      var result;
      var printcount;

      for(var i=0; i<event.results.length; ++i) {
        result = event.results[i];
        if( result.isFinal ) {
          finalPhrase = finalPhrase.trim() + ' ' + result[0].transcript;
        }
        else {
          interimPhrase = interimPhrase.trim() + ' ' + result[0].transcript;
        }
      }

      if( !lastPhrase ) {
        printcount = printOut('');
        lastPhrase = $('#' + printcount);
      }

      lastPhrase.html(finalPhrase.trim() + ' ' + interimPhrase.trim());

      if( finalPhrase.trim() ) {
        lastPhrase = null;
      }


    };
  }
}

JsFiddle: https://jsfiddle.net/dimskraft/envwao8o/1/

460

asked Jan 31 '16 10:01

Dims

1 Answers

The results provided on Chrome mobile regarding the result.isFinal property seem to have a bug or in any case to differ from the ones on Chrome desktop. A possible workaround is to check the confidence attribute of the (first) alternative:

onResultHandler(event) {
    let i = event.resultIndex;
    let result = event.results[i];
    let isFinal = result.isFinal && (result[0].confidence > 0);
}

It also looks like that sometimes the final result is emitted twice (with the same confidence value), in that case you may want to debounce it or just process the first event, like this:

if (isFinal) {
    transcript = result[0].transcript;

    if(transcript == lastDebounceTranscript) {
        return;
    }

    lastDebounceTranscript = transcript;

}

where lastDebounceTranscript is a variable that you initialize outside of the scope of the event handler

142

answered Sep 23 '22 18:09

u.dev

Related questions
                            
                                how to install node js canvas on windows
                            
                                How do you run an xPath query in IE11?
                            
                                Get original transcluded content in Angular directive
                            
                                How to set cookies for two-letter domains in IE8?
                            
                                Show a pdf stream in a new window
                            
                                How to implement JavaScriptCore debugger?
                            
                                Line number of SyntaxError in Node.js
                            
                                Idiomatic way to cache computed values based on the state in React?
                            
                                Large "idle" bars in Chrome dev tools Frames Timeline
                            
                                Meaning of HTML <a> element with href="javascript:;" [duplicate]
                            
                                In Firefox and IE how can change the cursor while dragging over different targets?
                            
                                Jasmine spies callThrough and callFake
                            
                                Intercept browser requests for resources
                            
                                How to detect if a mobile device is emulated by Google Chrome? [closed]
                            
                                OPTIONS 405 (Method Not Allowed)
                            
                                Building array of objects from parsed csv files in node
                            
                                Creating a Set With Soundcloud's API
                            
                                Chrome extension permission for "about:blank" page
                            
                                Can I use triple equals for JavaScript string comparison?
                            
                                Babel v6: How/Can I write a plugin that adds a new syntax (ie a new operator)?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Speech recognition API duplicated phrases on Android

Tags:

javascript

android

google-chrome

speech-recognition

webkitspeechrecognition

Dims

People also ask

1 Answers

u.dev

Recent Activity

Donate For Us