ByteScout team has released a set of new version updates and fixes for SDK and business tools.
ByteScout continues to fight against COVID-19 by offering our tools for rapid data extraction and document processing at NO CHARGE to any developers working on coronavirus prevention, analysis, treatment, research projects in hospitals, and non-profits. If you know someone working on these projects, please ask them to not hesitate to contact us.
Check the list of updated SDK for developers and business tools:
Added property ‘TextExtractor.FuzzySearch’ that enables a ‘fuzzy’ text search algorithm. It allows finding ‘approximately equal’ strings.
Added ‘DocumentSplitter2’ class that splits document by found text.
Added ‘CSVExtractor.NormalizeCSV’ property. It makes CSV data produced from different document pages contain the same number of columns.
Added property ‘JSONExtractor.OutputStructure’ that allows changing the structure of the generated JSON to one of the predefined variants for easier postprocessing.
Added property ‘JSONExtractor.OutputTransformation’ that allows applying JSONPath expression to the generated JSON.
Added property ‘OCRPageCount’ to extractor classes that contains a number of pages for which OCR was performed.
‘JSONExtractor’ and ‘XMLExtractor’ now add to the generated JSON and XML result the number of process pages and the number of pages for which OCR was performed.
Added property ‘OCRDetectLines’ to extractor classes that improve column detection in scanned documents.
Added property ‘ConsiderBackgroundColors’ to extractor classes that enables detection of background color under text objects. It may help to improve row and column detection in tables without borders but with color stripes.
Added properties ‘DocumentMerger.GenerateBookmarks’ and ‘DocumentMerger.BookmarkTitles’ to enable automatic generation of bookmarks pointing to the merged parts.
Improved PDF optimization in ‘DocumentSplitter’.
‘DocumentMerger’ now uses the first input document as the base for the merged document. This allows keeping document information properties and outlines.
DocumentMerger: added support for profiles.
MultimediaExtractor: added support for more media types.
‘TextExtractor.FindAll()’ method was ignoring the case sensitivity option.
Fixed issue with junk empty temporary files generated during OCR.
Added ‘AltName’ property to form fields. It contains a fixed identifier (‘Name’) of the form fields where the ID missing, duplicated, or contains invalid characters. You can use the ‘AltName’ to retrieve the field from the ‘Document.Annotations’ collection in the same way as the original ‘Name’.
Added properties ‘ListBox.SelectedIndices’ and ‘ListBox.SelectedItems’ allowing to selected multiple items in ‘ListBox’ form field.
Improved editable and not-editable ‘ComboBox’ fields appearance.
Fixed selected items’ appearance in the ‘ListBox’ form fields.
Fixed value assignment in the ‘RadioButton’ form fields.
Fixed invisible values in form fields in some cases.
Fixed digital signature appearance.
Fixed profiles parsing on platforms with non-English locale.
ByteScout Team of WritersByteScout has a team of professional writers proficient in different technical topics. We select the best writers to cover interesting and trending topics for our readers. We love developers and we hope our articles help you learn about programming and programmers.