File Exchange

image thumbnail

Optical Character Recognition 2.0 (OCR 2.0)

version (2.44 MB) by Martin PIEGAY
improvements of Optical Character Recognition (OCR)by Diego Orlando


Updated 25 Feb 2013

View License

We are engineering students at the school of Telecom Saint-Etienne in France, we are specialized in information technologies. For a school project we chose to work on computer vision and especially on the OCR.
Please find enclosed our script files much comment in English.
The main improvement we have made is machine learning characters.
For the segmentation of characters we stop the method of labeling to select the same method used for segmentation lines.
Finally we tried a method of recognizing spaces, but the results are not very conclusive. However, I strongly feel that we are not far from successful.
We would be very happy to receive your feedback on our changes and even your help to find a method of recognition of functional spaces.
Best regards

First run create_templates_perso2.m and brows the picture Trebuchet_MS.png

After that you can run OCR_perso2.m and try to OCR a picture written with the same police as the templates (Trebuchet MS)

Cite As

Martin PIEGAY (2020). Optical Character Recognition 2.0 (OCR 2.0) (, MATLAB Central File Exchange. Retrieved .

Comments and Ratings (3)


plz could you tell me what is the entry function of this algorithm

For italic text try to use connected component based extraction instead of vertical segmenting individual character, because in your code Harlow_Solid.jpg image
will not work your method.

MATLAB Release Compatibility
Created with R2012b
Compatible with any release
Platform Compatibility
Windows macOS Linux