Our ByteScout SDK products are sunsetting as we focus on expanding new solutions.
Learn More
Important Update
ByteScout SDK Sunsetting Notice
Our ByteScout SDK products are sunsetting as we focus on our new & improved solutions.Thank you for being part of our journey, and we look forward to supporting you in this next chapter!
ByteScout Updated its Key Products with NEW Functionalities
ByteScout Updated its Key Products with NEW Functionalities
ByteScout team has launched new versions’ updates and fixes for SDK and business tools.
ByteScout continues to fight against COVID-19 by offering our tools for rapid data extraction and document processing at NO CHARGE to any developers working on coronavirus prevention, analysis, treatment, research projects in hospitals, and non-profits. If you know someone working on these projects, please ask them to not hesitate to contact us.
Here’s the list of updated SDK for developers and business apps for end-users.
New column detection mode ‘ColumnDetectionMode.ContentGroupsAI’ that works better on tables without borders and on pages with multiple tables.
Greatly improved tables detection in ‘TableDetector2’.
Improved filtering of shadow-like text (‘ExtractShadowLikeText’ option).
Improved the ‘LineGroupingMode.JoinOrphanedRows’.
‘DocumentMerger’: Improved merging of PDF forms. Now it can link fields with matching names or rename them to avoid unwanted linking. See the property ‘RenameMatchingFieldsDuringMerge’.
‘JSONExtractor’ and ‘XMLExtractor’ now output the page size for each page.
All extractor classes now support the extraction of page ranges.
Added properties ‘DetectUnderlineTextStyle’ and ‘DetectStrikeoutTextStyle’ to ‘CSVExtractor’ and ‘XLSExtractor’. They help to prevent underlined text affecting the line grouping in table cells.
Improved background color detection for the option ‘ConsiderBackgroundColors’.
Added property ‘NormalizeText’ to all extractors. It replaced Unicode spaces and hyphens in the extracted text with normal ‘ ‘ and ‘-‘ characters.
‘Remover2’: fixed handling of PDF page rotation.
‘Remover2’: making unsearchable now performed only for editing pages.
‘XMLExtractor’: Added property ‘IndentedXML’ to control indentation.
‘JSONExtractor’: Added property ‘IndentedJSON’ to control indentation.
‘Stamper’: fixed stamping of rotated pages.
Added new OCR mode – ‘OCRMode.AutoRepairFonts’. It automatically tries to detect PDF documents with corrupted text and forces OCR font repair for them. Works only for English texts.
Added property ‘PageSeparator’ to CSV and XLS extractors.
Improved merging of PDF documents with fillable forms. New method ‘Document.MergeDocuments()’ and property ‘Document.RenameMatchingFieldsDuringMerge’ allow to control linking of form fields during the merge.
Improved flattening.
Fixed filling of combo boxes with compound items (label-value pairs).
Added ‘WhiteList’ and ‘BlackList’ properties that allow doing define a set of characters allowed or disallowed to be recognized from a scanned document.
ByteScout Team of WritersByteScout has a team of professional writers proficient in different technical topics. We select the best writers to cover interesting and trending topics for our readers. We love developers and we hope our articles help you learn about programming and programmers.
ByteScout PDF Tools are now connected to Zapier! The functions available in our tools can be automated with other services on the Zapier platform. You...
Bytescout products acquired new incredible functions for smooth and productive work. As we usually post updates on the Blog, you can check previous versions here....