We made thousands of pre-made source code pieces for easy implementation in your own programming projects. What is ByteScout Data Extraction Suite? It is the set that includes 3 SDK products for data extraction from PDF, scans, images and from spreadsheets: PDF Extractor SDK, Data Extraction SDK, Barcode Reader SDK. It can help you to find hyphenated text in pdf with pdf extractor sdk in your VBScript application.
These VBScript code samples for VBScript guide developers to speed up coding of the application when using ByteScout Data Extraction Suite. IF you want to implement the functionality, just copy and paste this code for VBScript below into your code editor with your app, compile and run your application. Further improvement of the code will make it more robust.
You can download free trial version of ByteScout Data Extraction Suite from our website to see and try many others source code samples for VBScript.
On-demand (REST Web API) version:
Web API (on-demand version)
On-premise offline SDK for Windows:
60 Day Free Trial (on-premise)
' Create Bytescout.PDFExtractor.TextExtractor object
Set extractor = CreateObject("Bytescout.PDFExtractor.TextExtractor")
extractor.RegistrationName = "demo"
extractor.RegistrationKey = "demo"
' Load sample PDF document
extractor.LoadDocumentFromFile("..\..\words-with-hyphens.pdf")
' Set the matching mode:
' 0 = WordMatchingMode.None - treats the search string as substring;
' 1 = WordMatchingMode.SmartMatch - will find the word in various forms (like Adobe Reader);
' 2 = WordMatchingMode.ExactMatch - treats the search string as separate word.
extractor.WordMatchingMode = 1
' Get page count
pageCount = extractor.GetPageCount()
For i = 0 To PageCount - 1
If extractor.Find(i, "hyphen", false) Then ' parameters are: page index, string to find, case sensitivity.
Do
foundMessage = "Found substring 'hyphen' on page #" & CStr(i) & " at { " & _
"x = " & CStr(extractor.FoundText.Left) & "; " & _
"y = " & CStr(extractor.FoundText.Top) & "; " & _
"width = " & CStr(extractor.FoundText.Width) & "; " & _
"height = " & CStr(extractor.FoundText.Height) & " }"
elementInfo = ""
' Iterate through elements of the found text object
For j = 0 to extractor.FoundText.ElementCount - 1
Set element = extractor.FoundText.GetElement(j)
elementInfo = elementInfo & "Element #" & CStr(j) & " at { x = " & CStr(element.Left) & "; y = " & CStr(element.Top) & "; width = " & CStr(element.Width) & "; height = " & CStr(element.Height) & vbCRLF
elementInfo = elementInfo & "Text: " & CStr(element.Text) & vbCRLF
elementInfo = elementInfo & "Font is bold: " & CStr(element.FontIsBold) & vbCRLF
elementInfo = elementInfo & "Font is italic: " & CStr(element.FontIsItalic) & vbCRLF
elementInfo = elementInfo & "Font name: " & CStr(element.FontName) & vbCRLF
elementInfo = elementInfo & "Font size: " & CStr(element.FontSize) & vbCRLF
elementInfo = elementInfo & "Font color (as OLE_COLOR): " & CStr(element.FontColorAsOleColor) & vbCRLF & vbCRLF
Next
WScript.Echo foundMessage & vbCRLF & vbCRLF & elementInfo
Loop While extractor.FindNext
End If
Next
WScript.Echo "Done"
Set extractor = Nothing
60 Day Free Trial or Visit ByteScout Data Extraction Suite Home Page
Explore ByteScout Data Extraction Suite Documentation
Explore Samples
Sign Up for ByteScout Data Extraction Suite Online Training
Get Your API Key
Explore Web API Docs
Explore Web API Samples
60 Day Free Trial or Visit ByteScout Data Extraction Suite Home Page
Explore ByteScout Data Extraction Suite Documentation
Explore Samples
Sign Up for ByteScout Data Extraction Suite Online Training
Get Your API Key
Explore Web API Docs
Explore Web API Samples
also available as: