ByteScout PDF Extractor SDK - VB.NET - Find Keyword in PDF And Extract Text - ByteScout

ByteScout PDF Extractor SDK – VB.NET – Find Keyword in PDF And Extract Text

  • Home
  • /
  • Articles
  • /
  • ByteScout PDF Extractor SDK – VB.NET – Find Keyword in PDF And Extract Text

How to find keyword in PDF and extract text in VB.NET and ByteScout PDF Extractor SDK

The tutorial below will demonstrate how to find keyword in PDF and extract text in VB.NET

The coding tutorials are designed to help you test the features without need to write your own code. ByteScout PDF Extractor SDK is the Software Development Kit (SDK) that is designed to help developers with data extraction from unstructured documents like pdf, tiff, scans, images, scanned and electronic forms. The library is powered by OCR, computer vision and AI to provide unique functionality like table detection, automatic table structure extraction, data restoration, data restructuring and reconstruction. Supports PDF, TIFF, PNG, JPG images as input and can output CSV, XML, JSON formatted data. Includes full set of utilities like pdf splitter, pdf merger, searchable pdf maker. It can find keyword in PDF and extract text in VB.NET.

You will save a lot of time on writing and testing code as you may just take the VB.NET code from ByteScout PDF Extractor SDK for find keyword in PDF and extract text below and use it in your application. Follow the instructions from the scratch to work and copy the VB.NET code. Enjoy writing a code with ready-to-use sample VB.NET codes.

Trial version of ByteScout PDF Extractor SDK can be downloaded for free from our website. It also includes source code samples for VB.NET and other programming languages.

Try ByteScout PDF Extractor SDK today: Get 60 Day Free Trial or sign up for Web API

Program.vb
      
Imports System.Drawing Imports Bytescout.PDFExtractor Namespace FindText Class Program Friend Shared Sub Main(args As String()) ' Create Bytescout.PDFExtractor.TextExtractor instance Dim extractor As New TextExtractor() extractor.RegistrationName = "demo" extractor.RegistrationKey = "demo" ' Load sample PDF document extractor.LoadDocumentFromFile(".\sample2.pdf") Dim pageCount As Integer = extractor.GetPageCount() ' Search each page for some keyword For i As Integer = 0 To pageCount - 1 If extractor.Find(i, "References", False) Then ' If page contains the keyword, extract a text from it. ' For demonstration we'll extract the text from top part of the page only extractor.SetExtractionArea(0, 0, 600, 200) Dim text As String = extractor.GetTextFromPage(i) Console.WriteLine(text) End If Next ' Cleanup extractor.Dispose() Console.WriteLine() Console.WriteLine("Press any key to continue...") Console.ReadLine() End Sub End Class End Namespace

Try ByteScout PDF Extractor SDK today: 60 Day Free Trial (on-premise version) or sign up for Web API (on demand version)

VIDEO

ON-PREMISE VERSION

Get 60 Day Free Trial or Visit ByteScout PDF Extractor SDK page

Explore ByteScout PDF Extractor SDK documentation

WEB API

Sign Up for free Web API key

Explore Web API Documentation

Tutorials:

prev
next