4.0

4.0 | 1 rating Rate this file 119 Downloads (last 30 days) File Size: 2.44 MB File ID: #40499

Optical Character Recognition 2.0 (OCR 2.0)

by Martin PIEGAY

 

25 Feb 2013

improvements of Optical Character Recognition (OCR)by Diego Orlando

| Watch this File

File Information
Description

We are engineering students at the school of Telecom Saint-Etienne in France, we are specialized in information technologies. For a school project we chose to work on computer vision and especially on the OCR.
Please find enclosed our script files much comment in English.
The main improvement we have made is machine learning characters.
For the segmentation of characters we stop the method of labeling to select the same method used for segmentation lines.
Finally we tried a method of recognizing spaces, but the results are not very conclusive. However, I strongly feel that we are not far from successful.
We would be very happy to receive your feedback on our changes and even your help to find a method of recognition of functional spaces.
Best regards

PS:
First run create_templates_perso2.m and brows the picture Trebuchet_MS.png

After that you can run OCR_perso2.m and try to OCR a picture written with the same police as the templates (Trebuchet MS)

Required Products Image Processing Toolbox
MATLAB release MATLAB 8.0 (R2012b)
Tags for This File  
Everyone's Tags
communications, data import, image processing, optimization, system identification
Tags I've Applied
Add New Tags Please login to tag files.
Please login to add a comment or rating.
Comments and Ratings (1)
24 Mar 2013 Chandra Shekhar

For italic text try to use connected component based extraction instead of vertical segmenting individual character, because in your code Harlow_Solid.jpg image
will not work your method.

Contact us