Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

How utterance length affect neural network in speaker recognition?

I'm learning neural networks and trying to create speaker recognition system with tensorflow. I wanted to know how utterance length affect neural network. For example I have 1000 different sound recordings with the same lengths and 1000 different sound recordings with different lenghts. So how theoretically will work neural network with these kind of datas? Will neural network with database of same length recordings will do better or worse? Why?

like image 205
Nikas Žalias Avatar asked Oct 18 '22 19:10

Nikas Žalias


1 Answers

I assume your question can be reformulated as How a neural network can process audio of different length?

The trick is that the signal of an arbitrary size is converted into a sequence of fixed-size feature vectors. See my answers here and here.

like image 103
Dmytro Prylipko Avatar answered Dec 13 '22 12:12

Dmytro Prylipko