- Home
- /
- Blog
- /
- ByteScout Updated its Key Products with NEW Functionalities
ByteScout Updated its Key Products with NEW Functionalities
ByteScout team has launched new versions’ updates and fixes for SDK and business tools.
ByteScout continues to fight against COVID-19 by offering our tools for rapid data extraction and document processing at NO CHARGE to any developers working on coronavirus prevention, analysis, treatment, research projects in hospitals, and non-profits. If you know someone working on these projects, please ask them to not hesitate to contact us.
Here’s the list of updated SDK for developers and business apps for end-users.
All the updates above are available in PDF.co Web API (cloud) and API Server (on-prem version of Web API).
- New column detection mode ‘ColumnDetectionMode.ContentGroupsAI’ that works better on tables without borders and on pages with multiple tables.
- Greatly improved tables detection in ‘TableDetector2’.
- Improved filtering of shadow-like text (‘ExtractShadowLikeText’ option).
- Improved the ‘LineGroupingMode.JoinOrphanedRows’.
- ‘DocumentMerger’: Improved merging of PDF forms. Now it can link fields with matching names or rename them to avoid unwanted linking. See the property ‘RenameMatchingFieldsDuringMerge’.
- ‘JSONExtractor’ and ‘XMLExtractor’ now output the page size for each page.
- All extractor classes now support the extraction of page ranges.
- Added properties ‘DetectUnderlineTextStyle’ and ‘DetectStrikeoutTextStyle’ to ‘CSVExtractor’ and ‘XLSExtractor’. They help to prevent underlined text affecting the line grouping in table cells.
- Improved background color detection for the option ‘ConsiderBackgroundColors’.
- Added property ‘NormalizeText’ to all extractors. It replaced Unicode spaces and hyphens in the extracted text with normal ‘ ‘ and ‘-‘ characters.
- ‘Remover2’: fixed handling of PDF page rotation.
- ‘Remover2’: making unsearchable now performed only for editing pages.
- ‘XMLExtractor’: Added property ‘IndentedXML’ to control indentation.
- ‘JSONExtractor’: Added property ‘IndentedJSON’ to control indentation.
- ‘Stamper’: fixed stamping of rotated pages.
- Added new OCR mode – ‘OCRMode.AutoRepairFonts’. It automatically tries to detect PDF documents with corrupted text and forces OCR font repair for them. Works only for English texts.
- Added property ‘PageSeparator’ to CSV and XLS extractors.
- ‘XLSExtractor’: improved negative numbers detection.
- ‘TextExtractor.FindAll()’ method was ignoring the case sensitivity option. Fixed now.
- Added property ‘OCRDetectLines’ that helps to detect table structure in scanned documents.
- ‘JSONExtractor’ and ‘XMLExtractor’ now outputs the number of pages in the result and the number of pages for which OCR was performed.
- Added property ‘OCRPageCount’ to extractors that contains the number of pages for which OCR was performed during the last extraction.
- ‘JSONExtractor’: Added property ‘OutputStructure’ that allows selecting a structure of output JSON.
- ‘JSONExtractor’: Added property ‘OutputTransformation’ that allows applying the JSONPath expression to the output JSON.
- Performance improvements.
- Improved parsing of PDF documents.
- Other minor fixes and improvements.
PDF Multitool 13.0.0.4253 (October 4, 2021)
- Made PDF Multitool DPI-aware. Now it looks better on high-resolution displays.
- Added row numbers to the grid in the extraction preview window.
- Added ‘All Option’ tab to all tools.
- Added ‘Classifier test tool’.
- ‘OCR Analyzer’: Added button to copy analysis results to extractors.
- Performance improvements.
- Improved parsing and rendering of PDF documents.
- Other minor fixes and improvements.
- Fixed crash in CSV export on malformed regex table without columns.
- Added ‘globalTextFilters’ parameter in the template options. It allows removing some text from extracted document text before parsing.
- Added new macros: ‘{{Email}}’, ‘{{ITIN}}’, ‘{{SSN}}’.
- Template Editor: fixed crash on adding objects if the recent template file was deleted before the app started.
- Implemented key-value regex fields.
- DocumentParser: Added support for page ranges.
- Improved ‘Classifier’.
- Improved DMY dates detection.
- Fixed false-positives of ‘{{Money}}’ macro.
- Improved PDF extraction and rendering.
- Other minor bug fixes and improvements.
- Improved rendering of PDF form fields.
- DocumentPrinter: suppressed possible printing progress dialogs.
- Performance improvements.
- Improved parsing and rendering of PDF documents.
- Other minor fixes and improvements.
PDF SDK 3.1.0.535 (October 4, 2021)
- Improved merging of PDF documents with fillable forms. New method ‘Document.MergeDocuments()’ and property ‘Document.RenameMatchingFieldsDuringMerge’ allow to control linking of form fields during the merge.
- Improved flattening.
- Fixed filling of combo boxes with compound items (label-value pairs).
- Improved parsing of PDF documents.
- Other minor fixes and improvements.
- Added support for page ranges. See overloads of methods ‘GetHTML’ and ‘SaveHtml*’.
- Added page numbers in the ‘page start’ and ‘page end’ HTML comments.
- Performance improvements.
- Improved parsing of PDF documents.
- Other minor fixes and improvements.
PDF Viewer SDK 13.0.0.4253 (October 4, 2021)
- Added ‘DisplayAttachments’ property that allows showing/hide the floating “attachments” button.
- Improved parsing and rendering of PDF documents.
- Other minor fixes and improvements.
- Fixed ISO-8601 date parsing.
- Fixed formula parsing.
- Fixed handling of shared formulas.
- Fixed black charts background in XLS format.
- Other minor fixes and improvements.
- Fixed unhandled exception on decoding timeout.
- Fixed barcode rectangle adjustment after applying scaling filters.
- Improved parsing and rendering of PDF documents.
- Other minor improvements and bug fixes.
Barcode SDK 7.3.0.1177 (October 4, 2021)
- Implemented round dots appearance for QR Code, DataMatrix, and Aztec. See ‘Barcode.RoundDots’ property.
- Minor improvements and bug fixes.
QR Code SDK 1.7.0.1178 (October 4, 2021)
- Implemented round dots appearance for QR Code. See ‘QRCode.RoundDots’ property.
- Minor improvements and bug fixes.
- Added ‘WhiteList’ and ‘BlackList’ properties that allow doing define a set of characters allowed or disallowed to be recognized from a scanned document.
- Improved parsing and rendering of PDF documents.
- Other minor improvements and bug fixes.
- Added ‘Round Dots’ advanced option for QR Code, DataMatrix, and Aztec.
- Minor improvements and bug fixes.
- Fixed barcode rectangle adjustment after applying scaling filters.
- Improved parsing and rendering of PDF documents.
- Other minor improvements and bug fixes.
- Fixed ISO-8601 date parsing.
- Fixed formula parsing.
- Fixed handling of shared formulas.
- Other minor fixes and improvements.
Since May 2019 started, we already have a splendid UPDATE for our SDK and freeware products! As always, you can trust in ByteScout professional approach...
Bytescout PDF Extractor SDK This new version 8.8.0.3015 appeared on January 22, 2018. Here are major performance improvements: The following was fixed: OCR preprocessing filters...