SDK for data extraction from PDF, scans, and images
Open-source library for generating and manipulating PDF
Feature | ByteScout PDF Extractor SDK | iText |
---|---|---|
Plain text extraction support | yes | yes |
OCR (Optical Character Recognition) | yes | yes |
Images extraction | yes | yes |
Attachments extraction | yes | yes |
Text search with Regular expressions | yes | yes |
Tables detection and analysis | yes | yes |
Export to JSON, XML, CSV, TXT | yes | yes |
PDF splitting | yes | yes |
PDF merging | yes | yes |
Text removal | yes | yes |
Text replacement | yes | yes |
Sensitive data detection | yes | no |
Large PDF support | yes | yes |
Searchable and Unsearchable PDF maker | yes | no |