C#,vb.net,MVC,Jquery,javascript,jscript,vbscript,html,vb,sharepoint,COM,WPF,WCF,Wwf,Asp,Asp.net,questions & answers,

Latest in Sports

Sunday, June 17, 2012

Reading text from scanned PDF using c#.net

We have requirement to read a text from scanned pdf document which has text as embedded image. We have considered using itextsharp and ghost script for this requirement. Please help us if there any other open source which would be best suitable for our requirement. Any suggestion would be really appreciated

SOLUTION 1:

You can use below Microsoft's SDK for this.

Microsoft Office Document Imaging -
http://social.technet.microsoft.com/Forums/en-US/officeappcompat/thread/93d6f285-dc98-46e2-b7e0-872bba9c4e35/[^]

I had evaluated several Third Party OCR SDK's in one of my assignment. In case if you are open for Third Party OCR SDK then search below SDK's on Google.

1) Nuance OmniPage OCR
2) Accusoft SmartZone OCR

No comments:

Post a Comment