How utterance length affect neural network in speaker recognition?

Question

I'm learning neural networks and trying to create speaker recognition system with tensorflow. I wanted to know how utterance length affect neural network. For example I have 1000 different sound recordings with the same lengths and 1000 different sound recordings with different lenghts. So how theoretically will work neural network with these kind of datas? Will neural network with database of same length recordings will do better or worse? Why?

Dmytro Prylipko · Accepted Answer

I assume your question can be reformulated as How a neural network can process audio of different length?

The trick is that the signal of an arbitrary size is converted into a sequence of fixed-size feature vectors. See my answers here and here.

How utterance length affect neural network in speaker recognition?

Tags:

machine-learning

neural-network

tensorflow

audio

Nikas Žalias

1 Answers

Dmytro Prylipko

Recent Activity

Donate For Us

How utterance length affect neural network in speaker recognition?

Tags:

machine-learning

neural-network

tensorflow

audio

Nikas Žalias

1 Answers

Dmytro Prylipko

Related questions

Recent Activity

Donate For Us