SDK for data extraction from PDF, scans, and images
SDK for document capture and forms processing, document recognition, and linguistic technologies and services
Feature | ByteScout PDF Extractor SDK | ABBYY FineReader |
---|---|---|
Plain text extraction support | yes | yes |
OCR (Optical Character Recognition) | yes | yes |
Images extraction | yes | yes |
Attachments extraction | yes | yes |
Text search with Regular expressions | yes | yes |
Tables detection and analysis | yes | yes |
Export to JSON, XML, CSV, TXT | yes | yes |
PDF splitting | yes | yes |
PDF merging | yes | yes |
Text removal | yes | yes |
Text replacement | yes | yes |
Sensitive data detection | yes | no |
Large PDF support | yes | yes |
Searchable and Unsearchable PDF maker | yes | yes |