ByteScout Data Extraction Suite - VB.NET - Find text in pdf with regex with pdf extractor sdk - ByteScout

ByteScout Data Extraction Suite – VB.NET – Find text in pdf with regex with pdf extractor sdk

  • Home
  • /
  • Articles
  • /
  • ByteScout Data Extraction Suite – VB.NET – Find text in pdf with regex with pdf extractor sdk

How to find text in pdf with regex with pdf extractor sdk in VB.NET and ByteScout Data Extraction Suite

Learn to find text in pdf with regex with pdf extractor sdk in VB.NET

An easy to understand sample source code to learn how to find text in pdf with regex with pdf extractor sdk in VB.NET ByteScout Data Extraction Suite is the set that includes 3 SDK products for data extraction from PDF, scans, images and from spreadsheets: PDF Extractor SDK, Data Extraction SDK, Barcode Reader SDK and you can use it to find text in pdf with regex with pdf extractor sdk with VB.NET.

Want to quickly learn? This fast application programming interfaces of ByteScout Data Extraction Suite for VB.NET plus the guidelines and the code below will help you quickly learn how to find text in pdf with regex with pdf extractor sdk. Follow the instructions from scratch to work and copy the VB.NET code. Use of ByteScout Data Extraction Suite in VB.NET is also described in the documentation included along with the product.

ByteScout Data Extraction Suite free trial version is available on our website. VB.NET and other programming languages are supported.

On-demand (REST Web API) version:
 Web API (on-demand version)

On-premise offline SDK for Windows:
 60 Day Free Trial (on-premise)

Program.vb
      
Imports System.Drawing Imports Bytescout.PDFExtractor Class Program Friend Shared Sub Main(args As String()) ' Create Bytescout.PDFExtractor.TextExtractor instance Dim extractor As New TextExtractor() extractor.RegistrationName = "demo" extractor.RegistrationKey = "demo" ' Load sample PDF document extractor.LoadDocumentFromFile(".\Invoice.pdf") extractor.RegexSearch = True ' Enable the regular expressions Dim pageCount As Integer = extractor.GetPageCount() ' Search through pages For i As Integer = 0 To pageCount - 1 ' Search dates in format 12/31/1999 Dim regexPattern As String = "[0-9]{2}/[0-9]{2}/[0-9]{4}" ' See the complete regular expressions reference at https://msdn.microsoft.com/en-us/library/az24scfc(v=vs.110).aspx ' Search each page for the pattern If extractor.Find(i, regexPattern, False) Then Do Console.WriteLine("") Console.WriteLine(("Found on page " & i & " at location ") + extractor.FoundText.Bounds.ToString()) Console.WriteLine("") ' Iterate through each element in the found text For Each element As ISearchResultElement In extractor.FoundText.Elements Console.WriteLine(" Text: " + element.Text) Console.WriteLine(" Font is bold: " + element.FontIsBold.ToString()) Console.WriteLine(" Font is italic:" + element.FontIsItalic.ToString()) Console.WriteLine(" Font name: " + element.FontName) Console.WriteLine(" Font size:" + element.FontSize.ToString()) Console.WriteLine(" Font color:" + element.FontColor.ToString()) Console.WriteLine() Next Loop While extractor.FindNext() End If Next ' Cleanup extractor.Dispose() Console.WriteLine() Console.WriteLine("Press any key to continue...") Console.ReadLine() End Sub End Class

ON-PREMISE OFFLINE SDK

60 Day Free Trial or Visit ByteScout Data Extraction Suite Home Page

Explore ByteScout Data Extraction Suite Documentation

Explore Samples

Sign Up for ByteScout Data Extraction Suite Online Training

ON-DEMAND REST WEB API

Get Your API Key

Explore Web API Docs

Explore Web API Samples

VIDEO

ON-PREMISE OFFLINE SDK

60 Day Free Trial or Visit ByteScout Data Extraction Suite Home Page

Explore ByteScout Data Extraction Suite Documentation

Explore Samples

Sign Up for ByteScout Data Extraction Suite Online Training

ON-DEMAND REST WEB API

Get Your API Key

Explore Web API Docs

Explore Web API Samples

Tutorials:

prev
next