how to read a text from a image file.

22 views (last 30 days)
Ayushi Saxena
Ayushi Saxena on 29 Feb 2016
Answered: Sagar Hukkire on 7 Jan 2021
I have a scanned document & I want to extract the text from it & wanted to store the same in some text file... I have no clue how to start this. would be grateful if anyone could help me on this..
Please suggest me is there any way to convert an image file into a text file. Moreover I want to do this without using image processing toolbox.

Answers (2)

Walter Roberson
Walter Roberson on 29 Feb 2016
ocr() from the Computer Vision toolbox.
  3 Comments
Ayushi Saxena
Ayushi Saxena on 29 Feb 2016
Moreover when I check for pixel intensities I am getting 255 for all pixels.
Walter Roberson
Walter Roberson on 29 Feb 2016
I have not been able to determine why you are doing all of this.
If you are a student working on a student project, then your Student Version license covers Computer Vision and Image Processing toolkits, unless you went for the cheapest possible license, in which case the price for them is about $US10 each.
If you are a Home license user, then the toolkits are about $US45 each.
If you are working on a Professional / Commercial license, then although the cost of the toolboxes is much more, you should take into account the cost of your time to write good OCR functionality. OCR is something that people have been working on for decades -- the Post Office was working on OCR in the 1970s for example. OCR is not easy to do well. It will take many many hours of your time to implement, after having spent many many hours researching OCR papers and books (1.2 million hits in Google Scholar I think it was.)
I do not recommend implementing OCR yourself. Buy the functionality. Or buy third-party software that you can invoke on scanned images to do the OCR. Or look for already implemented OCR in C++ or in Java. It is not worth the effort to do it yourself.

Sign in to comment.


Sagar Hukkire
Sagar Hukkire on 7 Jan 2021
Hi Ayushi
This is exactly you are looking I guess ( to get text from scan pdf). Please let us know if you need further help on it
Thanks
Sagar

Categories

Find more on Convert Image Type in Help Center and File Exchange

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!