ByteScout PDF Extractor SDK – VB.NET – Extract Text By Columns from PDF

Home
/
Articles
/
ByteScout PDF Extractor SDK – VB.NET – Extract Text By Columns from PDF

printable version:
ByteScout-PDF-Extractor-SDK-VB-NET-Extract-Text-By-Columns-from-PDF.pdf

How to extract text by columns from PDF in VB.NET and ByteScout PDF Extractor SDK

The tutorial below will demonstrate how to extract text by columns from PDF in VB.NET

The code below will help you to implement an VB.NET app to extract text by columns from PDF. ByteScout PDF Extractor SDK can extract text by columns from PDF. It can be used from VB.NET. ByteScout PDF Extractor SDK is the SDK that helps developers to extract data from unstructured documents, pdf, images, scanned and electronic forms. Includes AI functions like automatic table detection, automatic table extraction and restructuring, text recognition and text restoration from pdf and scanned documents. Includes PDF to CSV, PDF to XML, PDF to JSON, PDF to searchable PDF functions as well as methods for low level data extraction.

This rich sample source code in VB.NET for ByteScout PDF Extractor SDK includes the number of functions and options you should do calling the API to extract text by columns from PDF. Just copy and paste the code into your VB.NET application’s code and follow the instruction. Code testing will allow the function to be tested and work properly with your data.

Free trial version of ByteScout PDF Extractor SDK is available for download from our website. Get it to try other source code samples for VB.NET.

On-demand (REST Web API) version:
Web API (on-demand version)

On-premise offline SDK for Windows:
60 Day Free Trial (on-premise)

Program.vb

      Imports Bytescout.PDFExtractor

Class Program
	Friend Shared Sub Main(args As String())

		' Create Bytescout.PDFExtractor.TextExtractor instance
		Dim extractor As New TextExtractor()
		extractor.RegistrationName = "demo"
		extractor.RegistrationKey = "demo"

		' Load sample PDF document
        extractor.LoadDocumentFromFile(".\columns.pdf")

		' Extract text by columns (useful if PDF document is designed in column layout like a newspaper)
		extractor.ExtractColumnByColumn = true

		' Save extracted text to file
		extractor.SaveTextToFile(".\result.txt")

		' Cleanup
		extractor.Dispose()

		' Open result file in default associated application (for demo purposes)
		System.Diagnostics.Process.Start(".\result.txt")
		
	End Sub
End Class