We made thousands of pre-made source code pieces for easy implementation in your own programming projects. What is ByteScout Data Extraction Suite? It is the set that includes 3 SDK products for data extraction from PDF, scans, images and from spreadsheets: PDF Extractor SDK, Data Extraction SDK, Barcode Reader SDK. It can help you to find hyphenated text in pdf with pdf extractor sdk in your VBScript application.
These VBScript code samples for VBScript guide developers to speed up coding of the application when using ByteScout Data Extraction Suite. IF you want to implement the functionality, just copy and paste this code for VBScript below into your code editor with your app, compile and run your application. Further improvement of the code will make it more robust.
You can download free trial version of ByteScout Data Extraction Suite from our website to see and try many others source code samples for VBScript.
On-demand (REST Web API) version:
Web API (on-demand version)
On-premise offline SDK for Windows:
60 Day Free Trial (on-premise)
' Create Bytescout.PDFExtractor.TextExtractor object Set extractor = CreateObject("Bytescout.PDFExtractor.TextExtractor") extractor.RegistrationName = "demo" extractor.RegistrationKey = "demo" ' Load sample PDF document extractor.LoadDocumentFromFile("..\..\words-with-hyphens.pdf") ' Set the matching mode: ' 0 = WordMatchingMode.None - treats the search string as substring; ' 1 = WordMatchingMode.SmartMatch - will find the word in various forms (like Adobe Reader); ' 2 = WordMatchingMode.ExactMatch - treats the search string as separate word. extractor.WordMatchingMode = 1 ' Get page count pageCount = extractor.GetPageCount() For i = 0 To PageCount - 1 If extractor.Find(i, "hyphen", false) Then ' parameters are: page index, string to find, case sensitivity. Do foundMessage = "Found substring 'hyphen' on page #" & CStr(i) & " at { " & _ "x = " & CStr(extractor.FoundText.Left) & "; " & _ "y = " & CStr(extractor.FoundText.Top) & "; " & _ "width = " & CStr(extractor.FoundText.Width) & "; " & _ "height = " & CStr(extractor.FoundText.Height) & " }" elementInfo = "" ' Iterate through elements of the found text object For j = 0 to extractor.FoundText.ElementCount - 1 Set element = extractor.FoundText.GetElement(j) elementInfo = elementInfo & "Element #" & CStr(j) & " at { x = " & CStr(element.Left) & "; y = " & CStr(element.Top) & "; width = " & CStr(element.Width) & "; height = " & CStr(element.Height) & vbCRLF elementInfo = elementInfo & "Text: " & CStr(element.Text) & vbCRLF elementInfo = elementInfo & "Font is bold: " & CStr(element.FontIsBold) & vbCRLF elementInfo = elementInfo & "Font is italic: " & CStr(element.FontIsItalic) & vbCRLF elementInfo = elementInfo & "Font name: " & CStr(element.FontName) & vbCRLF elementInfo = elementInfo & "Font size: " & CStr(element.FontSize) & vbCRLF elementInfo = elementInfo & "Font color (as OLE_COLOR): " & CStr(element.FontColorAsOleColor) & vbCRLF & vbCRLF Next WScript.Echo foundMessage & vbCRLF & vbCRLF & elementInfo Loop While extractor.FindNext End If Next WScript.Echo "Done" Set extractor = Nothing
60 Day Free Trial or Visit ByteScout Data Extraction Suite Home Page
Explore ByteScout Data Extraction Suite Documentation
Explore Samples
Sign Up for ByteScout Data Extraction Suite Online Training
Get Your API Key
Explore Web API Docs
Explore Web API Samples
60 Day Free Trial or Visit ByteScout Data Extraction Suite Home Page
Explore ByteScout Data Extraction Suite Documentation
Explore Samples
Sign Up for ByteScout Data Extraction Suite Online Training
Get Your API Key
Explore Web API Docs
Explore Web API Samples
also available as: