What are the different techniques of speech recognition?
Several speech recognition techniques are used to capture spoken words and convert them to data that can be used by a software program. There are three wide ways to analyze speech in an effort to determine what they say. The first is called discrete speech, which means that only one word is spoken suddenly. The latter is known as interconnected speech and words must be spoken in some way to understand. Finally, there is a continuous speech, which is how most people normally speak. This system includes large data trees of phonemas or basic sounds and syllables that are divided by the statistical probability of one sound after another. By comparing each phonema with a node in the sounds of the sounds, the actual completed word can be determined with a high degree of accuracy in a relatively short period of time.
One problem that is difficult overomes with some techniques of speech recognition is insulating where the word begins and ends. This task is complicated by noise in the background in the room and the fact that some SLABikes have a sound signature that resembles a break between words. For this reason, the most accurate techniques are discreet and interconnected.
Another factor that separates different techniques of speech recognition is the problem of software vocabulary. The software that interprets speech can either have a very limited vocabulary with a high accuracy or a large vocabulary that must be adapted to individual patterns of a particular user speech. When the program uses the assembly method of HMM words, the less the number of words that are understood, the program can be more accurate. This is the method most automated telephone systems use to decrypt numbers or answers to questions.
Recotechnics Gniti, which understands large vocabulary, are usually designed to interact with very little or only one user. This is because the program must be trained to understand speechto the patterns of a speaking person. Training includes reading pre -created text paragraphs to software. Readed words are known, so the program is able to create a statistical model of phonemas specific to users. This gives the program a much greater chance of understanding the user, but it could also prevent the understanding of people with whom it did not practice.
The most difficult of speech recognition techniques is an interpretation of continuous or natural speech. Many people tend to control words together and speak at different speeds, so the accuracy of programs that translate continuous speech is lower than other methods. However, there are still programs that can translate this type of speech, some of them use fuzzy logic and neural networks to help recognize and isolate words.