ByteScout PDF Extractor SDK – VB.NET – Find Keyword in PDF And Extract Text

Home
/
Articles
/
ByteScout PDF Extractor SDK – VB.NET – Find Keyword in PDF And Extract Text

printable version:
ByteScout-PDF-Extractor-SDK-VB-NET-Find-Keyword-in-PDF-And-Extract-Text.pdf

How to find keyword in PDF and extract text in VB.NET and ByteScout PDF Extractor SDK

The tutorial below will demonstrate how to find keyword in PDF and extract text in VB.NET

The coding tutorials are designed to help you test the features without need to write your own code. ByteScout PDF Extractor SDK is the Software Development Kit (SDK) that is designed to help developers with data extraction from unstructured documents like pdf, tiff, scans, images, scanned and electronic forms. The library is powered by OCR, computer vision and AI to provide unique functionality like table detection, automatic table structure extraction, data restoration, data restructuring and reconstruction. Supports PDF, TIFF, PNG, JPG images as input and can output CSV, XML, JSON formatted data. Includes full set of utilities like pdf splitter, pdf merger, searchable pdf maker. It can find keyword in PDF and extract text in VB.NET.

You will save a lot of time on writing and testing code as you may just take the VB.NET code from ByteScout PDF Extractor SDK for find keyword in PDF and extract text below and use it in your application. Follow the instructions from the scratch to work and copy the VB.NET code. Enjoy writing a code with ready-to-use sample VB.NET codes.

Trial version of ByteScout PDF Extractor SDK can be downloaded for free from our website. It also includes source code samples for VB.NET and other programming languages.

On-demand (REST Web API) version:
Web API (on-demand version)

On-premise offline SDK for Windows:
60 Day Free Trial (on-premise)

Program.vb

      Imports System.Drawing
Imports Bytescout.PDFExtractor

Namespace FindText

	Class Program

		Friend Shared Sub Main(args As String())

            ' Create Bytescout.PDFExtractor.TextExtractor instance
			Dim extractor As New TextExtractor()
			extractor.RegistrationName = "demo"
			extractor.RegistrationKey = "demo"

			' Load sample PDF document
			extractor.LoadDocumentFromFile(".\sample2.pdf")

			Dim pageCount As Integer = extractor.GetPageCount()

			' Search each page for some keyword 
			For i As Integer = 0 To pageCount - 1
				If extractor.Find(i, "References", False) Then
					' If page contains the keyword, extract a text from it.
					' For demonstration we'll extract the text from top part of the page only
					extractor.SetExtractionArea(0, 0, 600, 200)
					Dim text As String = extractor.GetTextFromPage(i)
					Console.WriteLine(text)
				End If
			Next

			' Cleanup
			extractor.Dispose()

			Console.WriteLine()
			Console.WriteLine("Press any key to continue...")
			Console.ReadLine()

		End Sub
		
	End Class

End Namespace