If you want to quickly learn then these fast application programming interfaces of ByteScout PDF Suite for VB.NET plus the guideline and the VB.NET code below will help you quickly learn repair text with pdf extractor sdk. If you want to implement this functionality, you should copy and paste code below into your app using code editor. Then compile and run your application. VB.NET application implementation mostly involves various stages of the software development so even if the functionality works please check it with your data and the production environment.
On our website you may get trial version of ByteScout PDF Suite for free. Source code samples are included to help you with your VB.NET application.
Imports Bytescout.PDFExtractor Module Program Sub Main() Try Using extractor As New TextExtractor() ' Load PDF document extractor.LoadDocumentFromFile("sample.pdf") ' Set the font repairing OCR mode extractor.OCRMode = OCRMode.TextFromImagesAndVectorsAndRepairedFonts ' Set the location of OCR language data files extractor.OCRLanguageDataFolder = "c:\Program Files\Bytescout PDF Extractor SDK\ocrdata\" ' Set OCR language extractor.OCRLanguage = "eng" ' "eng" For english, "deu" For German, "fra" For French, "spa" For Spanish etc - according To files In "ocrdata" folder ' Find more language files at https://github.com/bytescout/ocrdata ' Set PDF document rendering resolution extractor.OCRResolution = 300 ' Read all text Dim allText = extractor.GetText() Console.WriteLine("Extracted Text: ") Console.WriteLine(allText) End Using Catch ex As Exception Console.WriteLine(ex.Message) End Try Console.WriteLine("Press any key to exit...") Console.ReadLine() End Sub End Module