Any simple VAD implementation?

2 Answers

Google's open-source WebRTC code has a VAD module written in C. It uses a Gaussian Mixture Model (GMM), which is typically much more effective than a simple energy-threshold detector, especially in a situation with dynamic levels and types of background noise. In my experience it's also much more effective than the Moattar-Homayounpour VAD that Gilad mentions in their comment.

The VAD code is part of the much, much larger WebRTC repository, but it's very easy to pull it out and compile it on its own. E.g. the webrtcvad Python wrapper includes just the VAD C source.

The WebRTC VAD API is very easy to use. First, the audio must be mono 16 bit PCM, with either a 8 KHz, 16 KHz or 32 KHz sample rate. Each frame of audio that you send to the VAD must be 10, 20 or 30 milliseconds long.

Here's an outline of an example that assumes audio_frame is 10 ms (320 bytes) of audio at 16000 Hz:

#include "webrtc/common_audio/vad/include/webrtc_vad.h"
// ...
VadInst *vad;
WebRtcVad_Create(&vad);
WebRtcVad_Init(vad);
int is_voiced = WebRtcVad_Process(vad, 16000, audio_frame, 160);

159

answered Sep 17 '22 15:09

John Wiseman

There are open source implementations in the Sphinx and Freeswitch projects. I think they are all energy based detectors do won't need any kind model.

Sphinx 4 (Java but it should be easy to port to C/C++)

PocketSphinx

Freeswitch

answered Sep 20 '22 15:09

Paul Dixon

Related questions
                            
                                open google maps app from a browser with default start location on android and iphone
                            
                                Objective-C, cancel a dispatch queue using UI event
                            
                                NSURLIsExcludedFromBackupKey can not be set correctly
                            
                                OpenGL ES 2.0 Rendering with a Texture
                            
                                Distinguish between iPhone web browser and iPhone app user agent
                            
                                What are the supported image file formats for display on the iPhone?
                            
                                URLForUbiquityContainerIdentifier returns nil even if configured correctly
                            
                                Title footer for Group in Setting bundle
                            
                                add button to uitextfield
                            
                                MFMailComposeViewController does not dismiss
                            
                                How to delete the last row of a section?
                            
                                Getting the 'scale' from a CATransform3D
                            
                                Getting XCode to include, compile and link existing (C++) codebase in XCode 4.3(.1)
                            
                                iPhone 5 TabBar not functioning in proper position
                            
                                How do I update the StatusBar Style as part of a custom transition
                            
                                Do you tag your UIViews or retain them as properties?
                            
                                Is there any danger in leaving NSLog statements when building an app for distribution?
                            
                                UISegmentedControl delegate/Touch Events
                            
                                iPhone (iOS): copying files from main bundle to documents folder causes crash
                            
                                Drawing app on iPad using OpenGL

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Any simple VAD implementation?

Tags:

c++

c

iphone

audio

voice

Gilad Novik

People also ask

2 Answers

John Wiseman

Paul Dixon

Recent Activity

Donate For Us