ByteScout Premium Suite - VB.NET - Find text in pdf with regex with pdf extractor sdk - ByteScout

ByteScout Premium Suite – VB.NET – Find text in pdf with regex with pdf extractor sdk

  • Home
  • /
  • Articles
  • /
  • ByteScout Premium Suite – VB.NET – Find text in pdf with regex with pdf extractor sdk

How to find text in pdf with regex with pdf extractor sdk in VB.NET and ByteScout Premium Suite

Step-by-step tutorial on how to find text in pdf with regex with pdf extractor sdk in VB.NET

The documentation is designed for a specific purpose to help you to apply the features on your side. Want to find text in pdf with regex with pdf extractor sdk in your VB.NET app? ByteScout Premium Suite is designed for it. ByteScout Premium Suite is the set that includes 12 SDK products from ByteScout including tools and components for PDF, barcodes, spreadsheets, screen video recording.

These VB.NET code samples for VB.NET guide developers to speed up coding of the application when using ByteScout Premium Suite. Just copy and paste the code into your VB.NET application’s code and follow the instructions. Use of ByteScout Premium Suite in VB.NET is also described in the documentation included along with the product.

ByteScout provides the free trial version of ByteScout Premium Suite along with the documentation and source code samples.

On-demand (REST Web API) version:
 Web API (on-demand version)

On-premise offline SDK for Windows:
 60 Day Free Trial (on-premise)

Program.vb
      
Imports System.Drawing Imports Bytescout.PDFExtractor Class Program Friend Shared Sub Main(args As String()) ' Create Bytescout.PDFExtractor.TextExtractor instance Dim extractor As New TextExtractor() extractor.RegistrationName = "demo" extractor.RegistrationKey = "demo" ' Load sample PDF document extractor.LoadDocumentFromFile(".\Invoice.pdf") extractor.RegexSearch = True ' Enable the regular expressions Dim pageCount As Integer = extractor.GetPageCount() ' Search through pages For i As Integer = 0 To pageCount - 1 ' Search dates in format 12/31/1999 Dim regexPattern As String = "[0-9]{2}/[0-9]{2}/[0-9]{4}" ' See the complete regular expressions reference at https://msdn.microsoft.com/en-us/library/az24scfc(v=vs.110).aspx ' Search each page for the pattern If extractor.Find(i, regexPattern, False) Then Do Console.WriteLine("") Console.WriteLine(("Found on page " & i & " at location ") + extractor.FoundText.Bounds.ToString()) Console.WriteLine("") ' Iterate through each element in the found text For Each element As ISearchResultElement In extractor.FoundText.Elements Console.WriteLine(" Text: " + element.Text) Console.WriteLine(" Font is bold: " + element.FontIsBold.ToString()) Console.WriteLine(" Font is italic:" + element.FontIsItalic.ToString()) Console.WriteLine(" Font name: " + element.FontName) Console.WriteLine(" Font size:" + element.FontSize.ToString()) Console.WriteLine(" Font color:" + element.FontColor.ToString()) Console.WriteLine() Next Loop While extractor.FindNext() End If Next ' Cleanup extractor.Dispose() Console.WriteLine() Console.WriteLine("Press any key to continue...") Console.ReadLine() End Sub End Class

ON-PREMISE OFFLINE SDK

60 Day Free Trial or Visit ByteScout Premium Suite Home Page

Explore ByteScout Premium Suite Documentation

Explore Samples

Sign Up for ByteScout Premium Suite Online Training

ON-DEMAND REST WEB API

Get Your API Key

Explore Web API Docs

Explore Web API Samples

VIDEO

ON-PREMISE OFFLINE SDK

60 Day Free Trial or Visit ByteScout Premium Suite Home Page

Explore ByteScout Premium Suite Documentation

Explore Samples

Sign Up for ByteScout Premium Suite Online Training

ON-DEMAND REST WEB API

Get Your API Key

Explore Web API Docs

Explore Web API Samples

Tutorials:

prev
next