ByteScout Premium Suite - VB.NET - Pdf invoice parsing with pdf extractor sdk - ByteScout

ByteScout Premium Suite – VB.NET – Pdf invoice parsing with pdf extractor sdk

  • Home
  • /
  • Articles
  • /
  • ByteScout Premium Suite – VB.NET – Pdf invoice parsing with pdf extractor sdk

pdf invoice parsing with pdf extractor sdk in VB.NET using ByteScout Premium Suite

Learn to code in VB.NET to make pdf invoice parsing with pdf extractor sdk with this simple How-To tutorial

These source code samples are assembled by their programming language and functions they use. Pdf invoice parsing with pdf extractor sdk in VB.NET can be applied with ByteScout Premium Suite. ByteScout Premium Suite is the bundle that includes twelve SDK products from ByteScout including tools and components for PDF, barcodes, spreadsheets, screen video recording.

If you want to quickly learn then these fast application programming interfaces of ByteScout Premium Suite for VB.NET plus the guideline and the VB.NET code below will help you quickly learn pdf invoice parsing with pdf extractor sdk. If you want to implement this functionality, you should copy and paste code below into your app using code editor. Then compile and run your application. Use of ByteScout Premium Suite in VB.NET is also described in the documentation included along with the product.

ByteScout Premium Suite free trial version is available for download from our website. Free trial also includes programming tutorials along with source code samples.

On-demand (REST Web API) version:
 Web API (on-demand version)

On-premise offline SDK for Windows:
 60 Day Free Trial (on-premise)

Module1.vb
      
Imports System.Drawing Imports Bytescout.PDFExtractor Module Module1 Sub Main() ' Create TextExtractor instance Dim textExtractor = New TextExtractor("demo", "demo") textExtractor.WordMatchingMode = WordMatchingMode.ExactMatch ' Set exact search (default is SmartSearch that works like in Adobe Reader) ' Create XMLExtractor instance Dim xmlExtractor = New XMLExtractor("demo", "demo") ' Load document textExtractor.LoadDocumentFromFile("Invoice.pdf") xmlExtractor.LoadDocumentFromFile("Invoice.pdf") ' Results Dim invoiceNo = String.Empty Dim invoiceDate = String.Empty Dim total = String.Empty Dim tableData = String.Empty ' Iterate pages For i As Integer = 0 To textExtractor.GetPageCount() - 1 Dim pageRectangle = textExtractor.GetPageRectangle(i) Dim tableRect = New RectangleF(0, 0, pageRectangle.Width, 0) ' Search for "Invoice No." If textExtractor.Find(i, "Invoice No.", False) Then ' Get the found text rectangle Dim textRect = textExtractor.FoundText.Bounds ' Assume the text at right is the invoice number. ' Shift the rectangle to the right: textRect.X = textRect.Right textRect.Width = pageRectangle.Right - textRect.Left ' Set the extraction region and extract the text textExtractor.SetExtractionArea(textRect) invoiceNo = textExtractor.GetTextFromPage(i).Trim() End If ' Search for "Invoice Date" and extract text at right If textExtractor.Find(i, "Invoice Date", False) Then Dim textRect = textExtractor.FoundText.Bounds textRect.X = textRect.Right textRect.Width = pageRectangle.Right - textRect.Left textExtractor.SetExtractionArea(textRect) invoiceDate = textExtractor.GetTextFromPage(i).Trim() End If ' Search for "Quantity" keyword to detect the top of the tabular data rectangle If textExtractor.Find(i, "Quantity", False) Then ' Keep the top table coordinate tableRect.Y = textExtractor.FoundText.Bounds.Top ' use textRect.Bottom if you want to skip column headers End If ' Search and extract "TOTAL" (it will be also the bottom of tabular data rectangle) If textExtractor.Find(i, "TOTAL", True) Then ' case sensitive! Dim textRect = textExtractor.FoundText.Bounds textRect.X = textRect.Right textRect.Width = pageRectangle.Right - textRect.Left textExtractor.SetExtractionArea(textRect) total = textExtractor.GetTextFromPage(i).Trim() ' Calculate the table height tableRect.Height = textRect.Top - tableRect.Top End If ' Extract tabular data using XMLExtractor If tableRect.Height > 0 Then xmlExtractor.SetExtractionArea(tableRect) tableData = xmlExtractor.GetXMLFromPage(i) End If Next ' Display extracted data Console.WriteLine("Invoice No.: " + invoiceNo) Console.WriteLine("Invoice Date: " + invoiceDate) Console.WriteLine("TOTAL: " + total) Console.WriteLine("Table Data: ") Console.WriteLine(tableData) ' Cleanup textExtractor.Dispose() xmlExtractor.Dispose() Console.WriteLine() Console.WriteLine("Press any key...") Console.ReadKey() End Sub End Module

ON-PREMISE OFFLINE SDK

60 Day Free Trial or Visit ByteScout Premium Suite Home Page

Explore ByteScout Premium Suite Documentation

Explore Samples

Sign Up for ByteScout Premium Suite Online Training

ON-DEMAND REST WEB API

Get Your API Key

Explore Web API Docs

Explore Web API Samples

VIDEO

ON-PREMISE OFFLINE SDK

60 Day Free Trial or Visit ByteScout Premium Suite Home Page

Explore ByteScout Premium Suite Documentation

Explore Samples

Sign Up for ByteScout Premium Suite Online Training

ON-DEMAND REST WEB API

Get Your API Key

Explore Web API Docs

Explore Web API Samples

Tutorials:

prev
next