The right way to use SSML with Web Speech API

Tags:

Web Speech API specification says:

text attribute
This attribute specifies the text to be synthesized and spoken for this utterance. This may be either plain text or a complete, well-formed SSML document. For speech synthesis engines that do not support SSML, or only support certain tags, the user agent or speech engine must strip away the tags they do not support and speak the text.

It does not provide an example of using text with an SSML document.

I tried the following in Chrome 33:

var msg = new SpeechSynthesisUtterance();
msg.text = '<?xml version="1.0"?>\r\n<speak version="1.0" xmlns="http://www.w3.org/2001/10/synthesis" xml:lang="en-US">ABCD</speak>';
speechSynthesis.speak(msg);

It did not work -- the voice attempted to narrate the XML tags. Is this code valid?
Do I have to provide a XMLDocument object instead?

I am trying to understand whether Chrome violates the specification (which should be reported as a bug), or whether my code is invalid.

368

asked Feb 22 '14 10:02

Andrey Shchekin

1 Answers

In Chrome 46, the XML is being interpreted properly as an XML document, on Windows, when the language is set to en; however, I see no evidence that the tags are actually doing anything. I heard no difference between the <emphasis> and non-<emphasis> versions of this SSML:

var msg = new SpeechSynthesisUtterance();
msg.text = '<?xml version="1.0"?>\r\n<speak version="1.0" xmlns="http://www.w3.org/2001/10/synthesis" xml:lang="en-US"><emphasis>Welcome</emphasis> to the Bird Seed Emporium.  Welcome to the Bird Seed Emporium.</speak>';
msg.lang = 'en';
speechSynthesis.speak(msg);

The <phoneme> tag was also completely ignored, which made my attempt to speak IPA fail.

var msg = new SpeechSynthesisUtterance();
msg.text='<?xml version="1.0" encoding="ISO-8859-1"?> <speak version="1.0" xmlns="http://www.w3.org/2001/10/synthesis" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.w3.org/2001/10/synthesis http://www.w3.org/TR/speech-synthesis/synthesis.xsd" xml:lang="en-US"> Pavlova is a meringue-based dessert named after the Russian ballerina Anna Pavlova. It is a meringue cake with a crisp crust and soft, light inside, usually topped with fruit and, optionally, whipped cream.  The name is pronounced <phoneme alphabet="ipa" ph="p&aelig;v&#712;lo&#650;v&#601;">...</phoneme> or <phoneme alphabet="ipa" ph="p&#593;&#720;v&#712;lo&#650;v&#601;">...</phoneme>, unlike the name of the dancer, which was <phoneme alphabet="ipa" ph="&#712;p&#593;&#720;vl&#601;v&#601;">...</phoneme> </speak>';
msg.lang = 'en';
speechSynthesis.speak(msg);

This is despite the fact that the Microsoft speech API does handle SSML correctly. Here is a C# snippet, suitable for use in LinqPad:

var str = "Pavlova is a meringue-based dessert named after the Russian ballerina Anna Pavlova. It is a meringue cake with a crisp crust and soft, light inside, usually topped with fruit and, optionally, whipped cream.  The name is pronounced /pævˈloʊvə/ or /pɑːvˈloʊvə/, unlike the name of the dancer, which was /ˈpɑːvləvə/.";
var regex = new Regex("/([^/]+)/");
if (regex.IsMatch(str))
{
    str = regex.Replace(str, "<phoneme alphabet=\"ipa\" ph=\"$1\">word</phoneme>");
    str.Dump();
}   
SpeechSynthesizer synth = new SpeechSynthesizer();
PromptBuilder pb = new PromptBuilder();
pb.AppendSsmlMarkup(str);
synth.Speak(pb);

answered Sep 20 '22 14:09

Ross Presser

Related questions
                            
                                WebView with service worker (what are ServiceWorkerController and ServiceWorkerWebSettings?)
                            
                                Clients are unable to connect to server during selenium tests
                            
                                How to run asynchronous code in chrome devtools when script execution is paused?
                            
                                Why Chrome Dev Tool shows a 200 status code instead of 304
                            
                                chrome.runtime.sendMessage not working in Chrome Extension
                            
                                ExecCommand works incorrectly in Chrome 60 after removing node from selection range
                            
                                Recommended solution for AJAX, CORS, Chrome & HTTP error codes (401,403,404,500)
                            
                                Programmatically generated/activated file input doesn't always fire `input` event
                            
                                Why isn't this HTML table working properly in Chrome?
                            
                                Chrome Javascript Debugger, when paused, won't reload page
                            
                                Chrome tab memory keeps growing, heap size stays the same
                            
                                How to make fixed-content go above iOS keyboard?
                            
                                How to run Selenium ChromeDriver from python3 on wsl2?
                            
                                Optional Chaining - Function.prototype.apply was called on undefined, which is an undefined and not a function
                            
                                img { max-height: 100%; } causes img to exceed bounds
                            
                                Synchronous Ajax - does Chrome have a timeout on trusted events?
                            
                                Is it possible to monitor HTTP Traffic in Chrome using an extension?
                            
                                How to repeat a POST request using Chrome's developer tools?
                            
                                Using `console.log` inside of a Worker in Chrome prints the same message twice
                            
                                How to filter and show only applied CSS in Chrome Developer Tools (like Firebug in Firefox)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

The right way to use SSML with Web Speech API

Tags:

google-chrome

speech-synthesis

webspeech-api

Andrey Shchekin

People also ask

1 Answers

Ross Presser

Recent Activity

Donate For Us