Академический Документы
Профессиональный Документы
Культура Документы
INTRODUCTION
Voice recognition uses the acoustic features of speech that have been found to differ between individuals. It makes possible to use the speaker's voice to verify their identity and control access to services. Voice recognition systems employ two styles of spoken input:
VOICE RECOGNITION
Voice recognition encompasses Verification & Identification.
Voice Verification:
Verifies the identity claimed from persons voice.
Voice Identification:
There is no identity claim The system decides who the person is, What Group the person is a member of, Or that the person is unknown.
SPEECH PROCESSING
Speech signal acquisition Speech production
Includes
- Laryngal pharynx - Oral pharynx - Oral cavity - Nasal pharynx - Nasal cavity
Misspoken or misread prompted phrases Extreme emotional states Time varying microphone placement Poor room acoustics Channel mismatch Sickness Ageing
Massive parallelism Learning ability Robustness Inherent contextual information processing Adaptability Generalization ability Fault tolerance capability Low energy consumption
Regularity detection
Based on the learning process, Neural networks are classified as Fixed Networks and Adaptive Networks
Supervised Learning
Incorporates an external teacher so that each output unit is told what its desired response ought to be.
o Back propagation
Unsupervised Learning
Uses no external teacher and is based on only local information.
o Hebbian Learning o Competitive Learning
Initialise the weights to small random values. Randomly choose an input pattern. Propagate the signal forward through the network. Compute error in the output layer. Compute the errors for the preceding layers by propagating the errors backwards. Update weights. Go to step 2 and repeat for the next pattern until the error in the output layer is below a pre-specified threshold or a maximum
Communication Corporate customer relations Banking transactions Education Military Policing Security Systems
systems
engineering
machine interface
FUTURE SCOPE FOR VOICE RECOGNITION TECHNOLOGY Flight controls Interactive internet searching Interactive shopping carts Robotic space arm control Self driving car interface Death of remote control
CONCLUSION
If the 21st century is to be the age of intelligent machines, artificial neural networks will become an integral part of our lives. In order that software engineers can lead us to this promised life they must begin by utilising the emerging technology of artificial neural networks. As users begin to take advantage of the technology and demand grows for better software, manual interfaces of all kinds will become a thing of the past. This method of using Voice Recognition for interaction is well on its way to becoming a reality. But even this is a temporary stage in the evolution of manmachine interaction. One day there will be a symbiosis between the two.
REFERENCES
[1] J.P. Campbell, JR, Speaker Recognition: A Tutorial, Proceedings of the IEEE, Vol. 85, No. 9, September 1997. [2] Patricia Melin, Jerica Urias, Daniel Solano, Miguel Soto, Voice Recognition with Neural Networks, Type-2 Fuzzy Logic and Genetic Algorithms [3] R.L. Kashyap, Speaker Recognition from a Unknown Utterance and Speaker-Speech Interaction, IEEE Tran on Acoustics, Speech and Signal Processing, vol. assp-24, no. 6, pp. 481-488, December 1976. [4] H.Gish and M.Schmidt,Text-independent speaker identification, IEEE Signal Process. Mag., vol.18, pp.18-32, Oct, 2002 [5] Morgan, D., Scofield,C., and Adcock,J. (1991). Multiple Neural Network Topologies Applied to Keyword Spotting, In Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing, 1991.