This sample source code below will display you how to extract text by pages from pdf with pdf extractor sdk in VB.NET. Want to extract text by pages from pdf with pdf extractor sdk in your VB.NET app? ByteScout Data Extraction Suite is designed for it. ByteScout Data Extraction Suite is the bundle that includes three SDK tools for data extraction from PDF, scans, images and from spreadsheets: PDF Extractor SDK, Data Extraction SDK, Barcode Reader SDK.
This prolific sample source code in VB.NET for ByteScout Data Extraction Suite contains various functions and other necessary options you should do calling the API to extract text by pages from pdf with pdf extractor sdk. IF you want to implement the functionality, just copy and paste this code for VB.NET below into your code editor with your app, compile and run your application. Further improvement of the code will make it more robust.
ByteScout Data Extraction Suite free trial version is available on our website. VB.NET and other programming languages are supported.
Imports Bytescout.PDFExtractor Class Program Friend Shared Sub Main(args As String()) ' Create Bytescout.PDFExtractor.TextExtractor instance Dim extractor As New TextExtractor() extractor.RegistrationName = "demo" extractor.RegistrationKey = "demo" ' Load sample PDF document extractor.LoadDocumentFromFile(".\sample2.pdf") ' Get page count Dim pageCount As Integer = extractor.GetPageCount() For i As Integer = 0 To pageCount - 1 Dim fileName As String = "page" & i & ".txt" ' Save extracted page text to file extractor.SavePageTextToFile(i, fileName) Next ' Cleanup extractor.Dispose() ' Open result file in default associated application (for demo purposes) System.Diagnostics.Process.Start(".\page1.txt") End Sub End Class