SDK for data extraction from PDF, scans, and images
A modern technical computing system
Feature | ByteScout PDF Extractor SDK | Wolfram Mathematica |
---|---|---|
Plain text extraction support | yes | yes |
OCR (Optical Character Recognition) | yes | yes |
Images extraction | yes | yes |
Attachments extraction | yes | yes |
Text search with Regular expressions | yes | yes |
Tables detection and analysis | yes | yes |
Export to JSON, XML, CSV, TXT | yes | yes |
PDF splitting | yes | no |
PDF merging | yes | yes |
Text removal | yes | yes |
Text replacement | yes | yes |
Sensitive data detection | yes | ? |
Large PDF support | yes | no |
Searchable and Unsearchable PDF maker | yes | no |