Вы находитесь на странице: 1из 25

VOICE RECOGNITION USING ARTIFICIAL NEURAL NETWORKS

BY Ms. SONA PREM BHASIN E-305

UNDER THE GUIDANCE OF MRS. KIRAN DANGE

INTRODUCTION
Voice recognition uses the acoustic features of speech that have been found to differ between individuals. It makes possible to use the speaker's voice to verify their identity and control access to services. Voice recognition systems employ two styles of spoken input:

Text-dependent : require the speaker to say key words or sentences.

Text-independent: do not rely on a specific text being spoken.

VOICE RECOGNITION
Voice recognition encompasses Verification & Identification.

Voice Verification:
Verifies the identity claimed from persons voice.

Voice Identification:
There is no identity claim The system decides who the person is, What Group the person is a member of, Or that the person is unknown.

SPEECH PROCESSING
Speech signal acquisition Speech production

Vocal tract is speech production organ

Includes

- Laryngal pharynx - Oral pharynx - Oral cavity - Nasal pharynx - Nasal cavity

VOICE RECOGNITION STEPS INVOLVED


Feature Selection Pattern Matching Template models DTW VQ source modeling Nearest neighbors Stochastic Model HMM Hypothesis Testing

VOICE RECOGNITION SYSTEM

SOURCES OF VERIFICATION ERROR IN VOICE RECOGNITION

Misspoken or misread prompted phrases Extreme emotional states Time varying microphone placement Poor room acoustics Channel mismatch Sickness Ageing

WHY USE ARTIFICIAL NEURAL NETWORKS ?


Massive parallelism Learning ability Robustness Inherent contextual information processing Adaptability Generalization ability Fault tolerance capability Low energy consumption

ARCHITECTURE OF NEURAL NETWORKS Network Layers - Single layer - Multilayer

Feed Forward Networks Signals travel in one way only

Feedback Networks Signals travel in both directions

Perceptrons Calculates weighted sum of inputs and compares it to a threshold

ANN LEARNING PROCESS

Associative mapping - Auto association - Hetero association


Nearest neighbor recall Interpolative recall

Regularity detection
Based on the learning process, Neural networks are classified as Fixed Networks and Adaptive Networks

LEARNING ALGORITHMS OF NEURAL NETWORKS

Supervised Learning
Incorporates an external teacher so that each output unit is told what its desired response ought to be.
o Back propagation

Unsupervised Learning
Uses no external teacher and is based on only local information.
o Hebbian Learning o Competitive Learning

BACK PROPAGATION NETWORK

BACK PROPAGATION ALGORITHM


Initialise the weights to small random values. Randomly choose an input pattern. Propagate the signal forward through the network. Compute error in the output layer. Compute the errors for the preceding layers by propagating the errors backwards. Update weights. Go to step 2 and repeat for the next pattern until the error in the output layer is below a pre-specified threshold or a maximum

Implementation of applications involve:


Application Problem Formulation Algorithm Analysis Neural Model Architecture Implementation

APPLICATIONS OF NEURAL NETWORKS

Neural Networks in practice:


Sales forecasting Industrial process control Customer research Target marketing

Neural Networks in Medicine:


Modeling and diagnosing cardiovascular system Instant physician

LIMITATIONS OF ARTIFICIAL NEURAL NETWORKS

Operational problems encountered while simulating parallelism of Neural Networks.

Verification and integration of ANNs into real life.


Inability to obtained. explain certain results

APPLICATIONS OF VOICE RECOGNITION

Communication Corporate customer relations Banking transactions Education Military Policing Security Systems

FUTURE SCOPE FOR ANNs


User-specific Genetic Man

systems

engineering

machine interface

FUTURE SCOPE FOR VOICE RECOGNITION TECHNOLOGY Flight controls Interactive internet searching Interactive shopping carts Robotic space arm control Self driving car interface Death of remote control

CONCLUSION
If the 21st century is to be the age of intelligent machines, artificial neural networks will become an integral part of our lives. In order that software engineers can lead us to this promised life they must begin by utilising the emerging technology of artificial neural networks. As users begin to take advantage of the technology and demand grows for better software, manual interfaces of all kinds will become a thing of the past. This method of using Voice Recognition for interaction is well on its way to becoming a reality. But even this is a temporary stage in the evolution of manmachine interaction. One day there will be a symbiosis between the two.

REFERENCES
[1] J.P. Campbell, JR, Speaker Recognition: A Tutorial, Proceedings of the IEEE, Vol. 85, No. 9, September 1997. [2] Patricia Melin, Jerica Urias, Daniel Solano, Miguel Soto, Voice Recognition with Neural Networks, Type-2 Fuzzy Logic and Genetic Algorithms [3] R.L. Kashyap, Speaker Recognition from a Unknown Utterance and Speaker-Speech Interaction, IEEE Tran on Acoustics, Speech and Signal Processing, vol. assp-24, no. 6, pp. 481-488, December 1976. [4] H.Gish and M.Schmidt,Text-independent speaker identification, IEEE Signal Process. Mag., vol.18, pp.18-32, Oct, 2002 [5] Morgan, D., Scofield,C., and Adcock,J. (1991). Multiple Neural Network Topologies Applied to Keyword Spotting, In Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing, 1991.

Вам также может понравиться