SDK for data extraction from PDF, scans, and images
Enterprise Content Management System
Feature | ByteScout PDF Extractor SDK | Alfresco Content Services |
---|---|---|
Plain text extraction support | yes | yes |
OCR (Optical Character Recognition) | yes | yes |
Images extraction | yes | no |
Attachments extraction | yes | yes |
Text search with Regular expressions | yes | yes |
Tables detection and analysis | yes | no |
Export to JSON, XML, CSV, TXT | yes | no |
PDF splitting | yes | yes |
PDF merging | yes | yes |
Text removal | yes | no |
Text replacement | yes | yes |
Sensitive data detection | yes | no |
Large PDF support | yes | yes |
Searchable and Unsearchable PDF maker | yes | no |