Text Recognition - Extract Text from Scanned Images or PDF - ByteScout

Text Recognition SDK to Read, Extract Text from Image Files Scanned or Created From Photo, Read Text from PDF

  • Home
  • /
  • Text Recognition SDK to Read, Extract Text from Image Files Scanned or Created From Photo, Read Text from PDF

Text Recognition SDK helps developers to
extract and recognize any text from scanned documents. PDF, PNG, TIFF, or JPEG support.

What is Text Recognition?

Text Recognition is the process of detecting and converting images or documents (e.g. PDF) that contain typed or printed text into a computer encoded text using OCR (Optical Character Recognition) process powered by Machine Learning and AI.

Check the following screenshots to see how it works:

Extract Text from Areas

Text Recognition SDK - Extract Text from Areas

Recognize Text from Document

Text Recognition SDK - Recognize Text from Document

Image Pre-processing Filters

Text Recognition SDK - Use Image Pre-processing Filters

Text Auto-Correction

Text Recognition SDK - Use Text Auto Correction



  • Automates tedious tasks such as data entry from specific documents such as driver licenses, passports, receipts, technical documents, bank statements, etc.
  • Does not require any 3rd-party applications or tools installed (e.g. Adobe Reader or any other software)
  • Works offline! No internet connection is required.
  • OCR support for 102 languages


Text Recognition SDK Key Benefits

  • Reads and extracts text from scanned images, photos, pictures;
  • Preserves the original text formatting and layout;
  • Low-level functions to get precise coordinates of each recognized text piece;
  • Image preprocessing filters to improve the recognition confidence on low-quality scans;
  • Functions to specify rectangular areas of an image those are subject to the recognition with optional rotation and flipping;
  • OCR text recognition;
  • Text filters to automatically fix typical OCR errors;
  • Supports PDF, PNG, JPG, TIFF (single or multi-page) as input;
  • Mono .NET and .NET Core Frameworks compatible;
  • Supports ActiveX/COM controls for legacy programming languages (C++, Visual Basic 6, Delphi) and scripting (VBScript, JScript, and others).

Why choose ByteScout Text Recognition SDK?

  • You’ll definitely find the type of customer support you were looking for. Most of our customers are happy to admit that ByteScout provides fast and detailed help. Check our Testimonials.
  • We combine very sophisticated technologies with any tools you’ll find on the website. We make our SDKs respond to your needs.
  • If you are looking for tutorials and explanations, source codes and documentation will give you a better understanding of what is going on.
  • The program interface is always user-friendly and intuitive to work with, no matter if you are a beginner or have some experience.
  • We’re glad to improve our SDKs on a regular basis as needed by our faithful users.