Why isn't speech recognition advancing? [closed]

1 Answers

Auditory processing is a very complex task. Human evolution has produced a system so good that we don't realize how good it is. If three persons are talking to you at the same time you will be able to focus in one signal and discard the others, even if they are louder. Noise is very well discarded too. In fact, if you hear human voice played backwards, the first stages of the auditory system will send this signal to a different processing area than if it is real speech signal, because the system will regard it as "no-voice". This is an example of the outstanding abilities humans have.

Speech recognition advanced quickly from the 70s because researchers were studying the production of voice. This is a simpler system: vocal chords excited or not, resonation of vocal tractus... it is a mechanical system easy to understand. The main product of this approach is the cepstral analysis. This led automatic speech recognition (ASR) to achieve acceptable results. But this is a sub-optimal approach. Noise separation is quite bad, even when it works more or less in clean environments, it is not going to work with loud music in the background, not as humans will.

The optimal approach depends on the understanding of the auditory system. Its first stages in the cochlea, the inferior colliculus... but also the brain is involved. And we don't know so much about this. It is being a difficult change of paradigm.

Professor Hynek Hermansky compared in a paper the current state of the research with when humans wanted to fly. We didn't know what was the secret —The feathers? wings flapping?— until we discovered Bernoulli's force.

142

answered Sep 23 '22 15:09

nacmartin

Related questions
                            
                                Is there a way to find sum of digits of 100!?
                            
                                C++: Converting Hexadecimal to Decimal
                            
                                Random playlist algorithm
                            
                                Algorithm to find if two sets intersect
                            
                                Calculating Manhattan Distance
                            
                                How can I add business days to the current date in Java?
                            
                                Find a number in an array
                            
                                sorting int array with only 3 elements
                            
                                Fastest way to find 2 missing numbers in an array
                            
                                Understanding Knuth-Morris-Pratt Algorithm
                            
                                dijkstra's algorithm - in c++?
                            
                                determine if a string has all unique characters?
                            
                                How do I iterate over Binary Tree?
                            
                                bit vector implementation of set in Programming Pearls, 2nd Edition
                            
                                Number of substrings in a string: n-squared or exponential
                            
                                Algorithm - How to delete duplicate elements in a list efficiently?
                            
                                Find four consecutive numbers that sum to given number
                            
                                What is the right way to solve Codility's PermMissingElem test? (Java)
                            
                                Determining if two line segments intersect? [duplicate]
                            
                                Factorial in C without conditionals, loops and arithmetic operators

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why isn't speech recognition advancing? [closed]

Tags:

algorithm

speech-recognition

Yuval Adam

People also ask

1 Answers

nacmartin

Recent Activity

Donate For Us