Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

How to identify speaker from voice pattern using Microsoft Speech?

Tags:

c#

speech

I'm using Microsoft Speech C# API for Home Automation commands

I'd like to know if there is a way or built-in C# method to hash Voice Input and recognize who's speaking. If it is Alice or Bob to say "Hello Alice" or "Hello Bob".

EDIT:

Microsoft Speech API can provides a .wav of the recording. It might be able to hash, process, ... to understand who's speaking:

  • Loud voice, slow modulation, ... => Bob
  • High voice, fast modulation, ... => Alice
like image 326
Jean-Philippe Encausse Avatar asked Aug 03 '12 12:08

Jean-Philippe Encausse


2 Answers

Speaker recognition is a hard problem and is still an active research area. I don't think Microsoft speech api has any speaker recognition support, but not 100% sure.

I found the following article really helpful while researching the topic. It introduces the subject and also provides a very crude implementation. Probably a good place to start.

http://www.ibm.com/developerworks/opensource/library/os-sndpeek/index.html

like image 131
Faisal Mansoor Avatar answered Sep 19 '22 17:09

Faisal Mansoor


You can use Microsoft Speaker Recognition APIs for doing this task: https://www.microsoft.com/cognitive-services/en-us/speaker-recognition-api

Microsoft is providing two APIs for this purpose: Speaker Verification & Speaker Identification.

You can find their C# & Python SDKs here: https://github.com/Microsoft/ProjectOxford-ClientSDK/tree/master/SpeakerRecognition

like image 32
Mohamed Sherief Avatar answered Sep 21 '22 17:09

Mohamed Sherief