Академический Документы
Профессиональный Документы
Культура Документы
4/11/12
Curtain Raiser
OCR -a Software of great utility.
full text document from a image with out actually typing it.
according to requirement.
4/11/12
long time to reach the homes of commons, the record maintenance has always been done on paper.
such a software tool has now aroused which can help the gruesome task of conversion of written document in electronic or digital format. document digitizing and preservation to handwritten text recognition.
4/11/12
Practical applications
One can always write a document at some place
where he is not possessing a computer and rest assured that the document can be digitized without any further time investment. converted to the digital format without a manual operator actually typing the whole text
4/11/12
more than just a single character, therefore debarring the user from recognizing the text. the task of Optical Character Recognition. They require the users to buy the complete hardware.
4/11/12
software that provides the user with liberty of doing this mammoth task of digitizing a paper based document in a very friendly and easy manner.
recognition which makes it even more desirable in case the user wants a digitized document of his own handwritten text.
4/11/12
Construction Process
4/11/12
Information gathering
For this project the information we collected was mainly from the internet.
We searched for existing softwares on the internet. We discussed the project scope with the regular users
of word processors and people related to programming specially in field of artificial networks. we develop the project.
4/11/12
Technologies used
We developed the tool mainly using
Java concepts of Core, Java Swings IDE for Java programming- Netbeans. MySQL for back end database connectivity.
4/11/12
4/11/12
4/11/12
Activity diagram
4/11/12
4/11/12
Class diagram
4/11/12
Sequence diagram
4/11/12
Sequence diagram
4/11/12
Architecture Design
4/11/12
4/11/12
from where it reaches the pixel extractor module in its original form i.e. in image format. equivalent array form of the image pixels. The image is converted into a grey scaled version of the input image. A corresponding array of the then grey scaled image. where the system evaluates each pixel of the input image and separates the pixels forming the text and the background. 4/11/12
Continued
Each of the separate array so formed is fed to the
neural network where each pixel value forms an input node and at output nodes are those nodes which are obtained from the database. winning neuron from the output nodes.
The SOM then identifies the character and suggest the This output neuron called as the winning neuron
4/11/12
Modular approach
Modules used
a) Image loading/ processing module b) Pixel extractor c) Segmentation module d) Scanning e) Self Organizing map f) Conversion to text g) Spell checker h) Saving
4/11/12
Testing
4/11/12
Purpose of Testing
Software testing an unavoidable step in software
intent of finding an error, eliminating errors to produce an error free software which meets the specification.
possible after they occur and identify the cause of the fault so that the remedial steps can be taken.
recognition. As an input we gave username and corresponding password . she holds a valid account and must be denied entry if they dont have a account. the picture upload module. else
4/11/12
4/11/12
The input given in the text pane was a word with wrong spelling like brwn
Expected outcome was the suggestion of the word
4/11/12
4/11/12
Validaton-unless the input is taken as image no button should work. Expexted- Message should be displayed asking user to load the image first of all. Output- An alert appears asking user to open an image first.
4/11/12
4/11/12
limitations
The image should be of identifiable quality. The image should be in valid image file format ie file
trained first by the users handwritten documents image. Only then the system would be able to recognize the input image. handwriting.
Image should not contain text contained in cursive Input image file should be aligned upside down. 4/11/12
Future Scope
Things that could be added at some later Point in time to enhance the functionality of the project
4/11/12
4/11/12