ByteScout PDF Extractor SDK - VBScript - Find Text in PDF - ByteScout

ByteScout PDF Extractor SDK – VBScript – Find Text in PDF

  • Home
  • /
  • Articles
  • /
  • ByteScout PDF Extractor SDK – VBScript – Find Text in PDF

How to find text in PDF in VBScript with ByteScout PDF Extractor SDK

How to find text in PDF in VBScript

With this source code sample you may quickly learn how to find text in PDF in VBScript. What is ByteScout PDF Extractor SDK? It is the SDK that helps developers to extract data from unstructured documents, pdf, images, scanned and electronic forms. Includes AI functions like automatic table detection, automatic table extraction and restructuring, text recognition and text restoration from pdf and scanned documents. Includes PDF to CSV, PDF to XML, PDF to JSON, PDF to searchable PDF functions as well as methods for low level data extraction. It can help you to find text in PDF in your VBScript application.

This rich sample source code in VBScript for ByteScout PDF Extractor SDK includes the number of functions and options you should do calling the API to find text in PDF. This VBScript sample code is all you need for your app. Just copy and paste the code, add references (if needs to) and you are all set! Use of ByteScout PDF Extractor SDK in VBScript is also explained in the documentation included along with the product.

Free trial version of ByteScout PDF Extractor SDK is available on our website. Documentation and source code samples are included.

Try ByteScout PDF Extractor SDK today: Get 60 Day Free Trial or sign up for Web API

FindTextAndGetCoordinates.vbs
      
' Create Bytescout.PDFExtractor.TextExtractor object Set extractor = CreateObject("Bytescout.PDFExtractor.TextExtractor") extractor.RegistrationName = "demo" extractor.RegistrationKey = "demo" ' Load sample PDF document extractor.LoadDocumentFromFile("..\..\sample1.pdf") ' Set the matching mode: ' 0 = WordMatchingMode.None - treats the search string as substring; ' 1 = WordMatchingMode.SmartMatch - will find the word in various forms (like Adobe Reader); ' 2 = WordMatchingMode.ExactMatch - treats the search string as separate word. extractor.WordMatchingMode = 2 ' Get page count pageCount = extractor.GetPageCount() For i = 0 To PageCount - 1 If extractor.Find(i, "ipsum", false) Then ' parameters are: page index, string to find, case sensitivity. Do foundMessage = "Found word 'ipsum' on page #" & CStr(i) & " at { " & _ "x = " & CStr(extractor.FoundText.Left) & "; " & _ "y = " & CStr(extractor.FoundText.Top) & "; " & _ "width = " & CStr(extractor.FoundText.Width) & "; " & _ "height = " & CStr(extractor.FoundText.Height) & " }" elementInfo = "" ' Iterate through elements of the found text object For j = 0 to extractor.FoundText.ElementCount - 1 Set element = extractor.FoundText.GetElement(j) elementInfo = elementInfo & "Element #" & CStr(j) & " at { x = " & CStr(element.Left) & "; y = " & CStr(element.Top) & "; width = " & CStr(element.Width) & "; height = " & CStr(element.Height) & vbCRLF elementInfo = elementInfo & "Text: " & CStr(element.Text) & vbCRLF elementInfo = elementInfo & "Font is bold: " & CStr(element.FontIsBold) & vbCRLF elementInfo = elementInfo & "Font is italic: " & CStr(element.FontIsItalic) & vbCRLF elementInfo = elementInfo & "Font name: " & CStr(element.FontName) & vbCRLF elementInfo = elementInfo & "Font size: " & CStr(element.FontSize) & vbCRLF elementInfo = elementInfo & "Font color (as OLE_COLOR): " & CStr(element.FontColorAsOleColor) & vbCRLF & vbCRLF Next WScript.Echo foundMessage & vbCRLF & vbCRLF & elementInfo Loop While extractor.FindNext End If Next WScript.Echo "Done" Set extractor = Nothing

Try ByteScout PDF Extractor SDK today: 60 Day Free Trial (on-premise version) or sign up for Web API (on demand version)

VIDEO

ON-PREMISE VERSION

Get 60 Day Free Trial or Visit ByteScout PDF Extractor SDK page

Explore ByteScout PDF Extractor SDK documentation

WEB API

Sign Up for free Web API key

Explore Web API Documentation

Tutorials:

prev
next