I have a PHP web application and am looking for an open source, high-accuracy speech-to-text recognition implementation that will take voice commands to open web pages from users. Examples: "Make Sales" (this will open Create Sales PHP page), "Make Purchase order", "Open END-OF-DAY reports", etc. My Question : I want to know if we can we use Mozilla DeepSpeech to take .wav audio from a Firefox browser and return speech to text. If yes, what will be the flow from recording voice from Firefox using mic TO convert text using the DeepSpeech engine? How to make wakeup/launch call similar to OK-GOOGLE that will be ready to listen for commands?

You can achieve that by creating a server and sending requests back and forth using assinchronious requests/AJAX or web sockets. You can find Server installation instructions using the link below: https://pypi.org/project/deepspeech-server/ After you have installed the server you can start making requests from any browser that supports "WebRTC API: getUserMedia()". Generate audio Blob data and send it in base64 format to the backend server. On the backend, save the blob to a temporary audio file: <pre class="prettyprint"><code>$encodedData = base64_decode($data); // write the data out to the file $fp = fopen($full_file_path, 'wb'); fwrite($fp, $encodedData); fclose($fp); </code></pre> Then convert audio file to text by making CURL request to your own Mozzila DeepSpeech Node.js server: <pre class="prettyprint"><code>curl -X POST --data-binary @testfile.wav http://localhost:8080/stt </code></pre> Create methods on your backend to loop through generated text and try to identify keywords/commands. If triggered send it back to the front end. Perhaps you just want to grant users ability to write long messages using their speech? - Return the whole text back - every time. You do however still want to "listen" to the keywords, in order to give users ability to set punctuation, start and finish writing. Happy coding everyone ;)

How to implement Mozilla DeepSpeech into PHP web app to convert Speech-to-text?

Tags:

php

speech-recognition

speech-to-text

webspeech-api

mozilla-deepspeech

I have a PHP web application and am looking for an open source, high-accuracy speech-to-text recognition implementation that will take voice commands to open web pages from users. Examples: "Make Sales" (this will open Create Sales PHP page), "Make Purchase order", "Open END-OF-DAY reports", etc.

My Question :

I want to know if we can we use Mozilla DeepSpeech to take .wav audio from a Firefox browser and return speech to text. If yes, what will be the flow from recording voice from Firefox using mic TO convert text using the DeepSpeech engine?

How to make wakeup/launch call similar to OK-GOOGLE that will be ready to listen for commands?

221

asked May 29 '18 10:05

Priyesh

1 Answers

You can achieve that by creating a server and sending requests back and forth using assinchronious requests/AJAX or web sockets.

You can find Server installation instructions using the link below:

https://pypi.org/project/deepspeech-server/

After you have installed the server you can start making requests from any browser that supports "WebRTC API: getUserMedia()". Generate audio Blob data and send it in base64 format to the backend server. On the backend, save the blob to a temporary audio file:

$encodedData = base64_decode($data); 

// write the data out to the file
$fp = fopen($full_file_path, 'wb');
      fwrite($fp, $encodedData);
      fclose($fp);

Then convert audio file to text by making CURL request to your own Mozzila DeepSpeech Node.js server:

curl -X POST --data-binary @testfile.wav http://localhost:8080/stt

Create methods on your backend to loop through generated text and try to identify keywords/commands. If triggered send it back to the front end. Perhaps you just want to grant users ability to write long messages using their speech? - Return the whole text back - every time. You do however still want to "listen" to the keywords, in order to give users ability to set punctuation, start and finish writing.

Happy coding everyone ;)

155

answered Oct 20 '22 00:10

SergeDirect

Related questions
                            
                                PHP to EasyPHP MySQL server 1 second connection delay
                            
                                Convert String To date in PHP
                            
                                Convert Number to Words in Indian currency format with paise value [duplicate]
                            
                                How do you register a namespace with Silex autoloader
                            
                                Find last character in a string in PHP
                            
                                Emptying a file with php [duplicate]
                            
                                Custom headers for WebSocket JS
                            
                                4GB HTTP File Uploads Using jQuery-File-Upload, Apache and PHP
                            
                                Does PHP set memory limits on arrays?
                            
                                No query results for model [App\Models\Match]
                            
                                Hindi language not displaying correctly on tcpdf
                            
                                php-resque job perform function not executed?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With