Академический Документы
Профессиональный Документы
Культура Документы
College of Engineering,
Kankavali, Sindhudurg, Maharashtra
Certificate
This is to certify that, the synopsis report of the project entitled “Handwritten
Character Recognition to obtain Editable Text ” is successfully submitted by
for the partial fulfillment of the project stage-I of the degree of Bachelor of Engineering
in Electronics and Telecommunication Engineering.
Principal
A Synopsis Report On
Bachelor of Engineering
in
Electronics and Telecommunication Engineering
University of Mumbai
Academic Year 2019-2020
ACKNOWLEDGEMENT
We have taken efforts in this project synopsis work. We are highly indebted to Prof.
V.V.Mainkar for his guidance and constant supervision for providing necessary information
regarding the project synopsis work. This project synopsis work has been carried out under the
direct supervision and leadership of Prof. S.S.Velling, Head of the Electronics and
Telecommunication Engineering Department, without whose supervision and support it was
merely impossible to accomplish the task. We are very greatful for many discussions we had,
especially on analysis of theory.
We offer our humble and sincere thanks to Dr.A.C.Gangal ,Principal ,S.S.P.M’s College Of
Engineering,Kankavli for his all possible cooperation.We express our sincere thanks to all staff
members of Electronics And Telecommunication Department of S.S.P.M’s College Of
Engineering,Kankavli for their keen interest and an encouragement during the project synopsis
work.
This is to declare that this report has been written by us. No part of the
report is plagiarized from other sources. All information included from
other sources has been duly acknowledged. We aware that if any part of the
report is found to be plagiarized, we are shall taken full responsibility for
it.
Character Recognition for read the text from image which is the Huge Area for research to
Develop Computer Based Application. Nowadays, there is a storing of information from
handwritten documents to computer readable format for future use. One of the simple way to
store the information from paper document is to first capture or scan the paper document and
save them as an image. ‘Optical Character Recognition’ it is the method to transform
handwritten data into electronic format. The main challenge is to recognize the character of
different people having different style of handwriting. Thus we will design a system that
recognize the handwritten character from old documents.
The main problem is the handwriting style of every different people has its own approach to
handwriting in different languages .This problem motivated us to build a system that will
recognize character (English)given as an input image
1.2 Objective
The main requirement for this project is to design a module that can recognize character
using the neural network method. Therefore, the following objectives need to be achieved
to satisfy the development of the project.
To study Neural Network algorithm and develop a system that is able to recognize
characters
To detect, extract and recognize characters using Neural Network.
To reduce noise from handwritten documents.
HCR is used the stages like preprocessing, segmentation, feature extraction and
recognition using neural network. In Preprocessing image document to make use for
segmentation. In segmentation the image is segmented into individual character then
feature extraction technique is apply on character image.
1.4 Artificial Neural Network (ANN)
Artificial Neural Network (ANN) is a computing model of brain, having paralleled distributed
processing elements. It can be used for computational processors for different tasks like data
compression, classification, combinatorial optimization problem solving, pattern recognition
etc. ANN has many benefits over the other classical methods. These methods include Artificial
Neural Networks (ANNs), Kernel Methods including Support Vector Machines (SVM) and
multiple classifier combination.
2. LITERATURE REVIEW
2.3 Image preprocessing for optical character recognition using neural networks
In this paper forward-feed neural networks is used to processing of text for optical character.
Application was developed and its characteristics were set according to results of practical
experiments.[3]
2.5 Improving Offline Handwritten Text Recognition with Hybrid HMM/ANN Models
In this paper Hybrid Hidden Markov Model (HMM) is used for recognizing offline handwritten
texts. In this paper, different techniques are applied to remove slope and slant from handwritten
text and to normalize the size of text images with supervised learning methods. The key features
of this recognition system were to develop a system having high accuracy in preprocessing and
recognition, which are both based on ANNs.[5]
3. METHODOLOGY
A character recognition system receives an input in the form of image which contains some
text information. The output of this system is in electronic format. There are three modules:
(A) pre-processing (B) text recognition (C) post-processing. Each module is further described
in detail as bellow:
Scan Image(input image)
Noise Removal
Pre-Processing
Module
Normalization
Filtered image
Segmentation
Classification
Post processing
Store Text data in
module
proper format
The document is captured by the camera and is converted in the form of a picture. It is the
combinations of pixels. At this stage we have the data in the form of image and this image so
that’s the important information can be retrieved. So to improve quality of the input image,
few operation are performed for enhancement of image such as noise removal, normalization,
binarization etc.
Due to this quality of the image will increase and it will effect recognition process for better
text recognition in images. And it results in generation of more accurate output at the end of
character recognition processing. There are many methods for image noise removal such as
mean filter, min-max filter, Gaussian filter etc.
3.1.2 Normalization:
The process for which the data need to be organized in the database where range of pixel
intensity values changes.
3.1.3 Binarization:
A handwritten document is first scanned and is converted into a gray scale image. Gray scale
images are converted to binary images by using binarization.
This module can be used for text recognition in output image of pre-processing model and give
output data which are in computer understandable form. Hence in this module following
techniques are used
3.2.1 Segmentation:
In recognition module, the segmentation is the most important process. Segmentation is done
to make the separation between the individual characters of an image. A user can write text in
the form of lines. Thus the image is first segmented into line. Then each individual line is
segmented into word. Finally each word is segmented into individual character.
3.2.2 Feature Extraction:
Feature extraction is the process to separate the most important data from the raw data. There
are different classes are made to store the different features of a character. There are many
technique used for feature extraction like Principle Component Analysis (PCA), Linear
Discriminate Analysis (LDA), Independent Component Analysis (ICA), Chain Code (CC),
Gradient Based features, Histogram etc.
3.2.3 Classification:
Input to this stage is output of the feature extraction process. The input feature with stored
pattern is compared and find out best matching class for input. There are many technique used
for classification such as Artificial Neural Network (ANN), Template Matching, Support
Vector Matching (SVM) etc.
The output of recognition module is in the form text data which is understand by computer,
So there need to store it in to some proper format( i.e. text or MS-Word )for farther use such
as editing or searching in that data.
3.3.1 Block Diagram of The Work
HCR System
Recognition
Image pre- Segmentation Feature
using Neural
processing Extraction
Network
Divide words
Binarization into letters
4. Application
Character recognition technology is apply the entire spectrum of industries. This technology
need to scan documents to recognize the text content by computers. With the help of this
technology, no need to manually retype important documents when convert them into
electronic format. For e.g. Banking, Healthcare, Government offices .
5. Schedule
2 Project Analysis 2
7 Installation of software 1
7. Coding 1
8. Software testing 1
9. Implementation 1