The coding tutorials are designed to help you test the features without need to write your own code. ByteScout PDF Extractor SDK can extract text by pages from PDF. It can be used from VB.NET. ByteScout PDF Extractor SDK is the Software Development Kit (SDK) that is designed to help developers with data extraction from unstructured documents like pdf, tiff, scans, images, scanned and electronic forms. The library is powered by OCR, computer vision and AI to provide unique functionality like table detection, automatic table structure extraction, data restoration, data restructuring and reconstruction. Supports PDF, TIFF, PNG, JPG images as input and can output CSV, XML, JSON formatted data. Includes full set of utilities like pdf splitter, pdf merger, searchable pdf maker.
This code snippet below for ByteScout PDF Extractor SDK works best when you need to quickly extract text by pages from PDF in your VB.NET application. In order to implement the functionality, you should copy and paste this code for VB.NET below into your code editor with your app, compile and run your application. Detailed tutorials and documentation are available along with installed ByteScout PDF Extractor SDK if you’d like to dive deeper into the topic and the details of the API.
Our website provides trial version of ByteScout PDF Extractor SDK for free. It also includes documentation and source code samples.
Imports Bytescout.PDFExtractor Class Program Friend Shared Sub Main(args As String()) ' Create Bytescout.PDFExtractor.TextExtractor instance Dim extractor As New TextExtractor() extractor.RegistrationName = "demo" extractor.RegistrationKey = "demo" ' Load sample PDF document extractor.LoadDocumentFromFile(".\sample2.pdf") ' Get page count Dim pageCount As Integer = extractor.GetPageCount() For i As Integer = 0 To pageCount - 1 Dim fileName As String = "page" & i & ".txt" ' Save extracted page text to file extractor.SavePageTextToFile(i, fileName) Next ' Cleanup extractor.Dispose() ' Open result file in default associated application (for demo purposes) System.Diagnostics.Process.Start(".\page1.txt") End Sub End Class