ByteScout PDF Suite – VB.NET – Convert pdf to xml with images with pdf extractor sdk

Home
/
Articles
/
ByteScout PDF Suite – VB.NET – Convert pdf to xml with images with pdf extractor sdk

printable version:
ByteScout-PDF-Suite-VB-NET-Convert-pdf-to-xml-with-images-with-pdf-extractor-sdk.pdf

How to convert pdf to xml with images with pdf extractor sdk in VB.NET and ByteScout PDF Suite

Step-by-step tutorial on how to convert pdf to xml with images with pdf extractor sdk in VB.NET

These sample source codes on this page below are displaying how to convert pdf to xml with images with pdf extractor sdk in VB.NET. ByteScout PDF Suite can convert pdf to xml with images with pdf extractor sdk. It can be applied from VB.NET. ByteScout PDF Suite is the set that includes 6 SDK products to work with PDF from generating rich PDF reports to extracting data from PDF documents and converting them to HTML. This bundle includes PDF (Generator) SDK, PDF Renderer SDK, PDF Extractor SDK, PDF to HTML SDK, PDF Viewer SDK and PDF Generator SDK for Javascript.

Want to save time? You will save a lot of time on writing and testing code as you may just take the VB.NET code from ByteScout PDF Suite for convert pdf to xml with images with pdf extractor sdk below and use it in your application. Simply copy and paste in your VB.NET project or application you and then run your app! Use of ByteScout PDF Suite in VB.NET is also described in the documentation included along with the product.

Our website gives trial version of ByteScout PDF Suite for free. It also includes documentation and source code samples.

On-demand (REST Web API) version:
Web API (on-demand version)

On-premise offline SDK for Windows:
60 Day Free Trial (on-premise)

Module1.vb

      Imports Bytescout.PDFExtractor

Namespace PDF2XML

    Class Program

        Shared Sub Main(ByVal args As String())

            ' Create Bytescout.PDFExtractor.XMLExtractor instance
            Dim extractor As New XMLExtractor()
            extractor.RegistrationName = "demo"
            extractor.RegistrationKey = "demo"

            ' Load sample PDF document
            extractor.LoadDocumentFromFile("sample1.pdf")

            ' Uncomment this line to get rid of empty nodes in XML
            'extractor.PreserveFormattingOnTextExtraction = False

            ' Set output image format
            extractor.ImageFormat = OutputImageFormat.PNG

            ' Save images to external files
            extractor.SaveImages = ImageHandling.OuterFile
            extractor.ImageFolder = "images" ' Folder for external images
            extractor.SaveXMLToFile("result_with_external_images.xml")

            ' Embed images into XML as Base64 encoded string
            extractor.SaveImages = ImageHandling.Embed
            extractor.SaveXMLToFile("result_with_embedded_images.xml")

            ' Cleanup
		    extractor.Dispose()

        End Sub

    End Class

End Namespace