ByteScout PDF Extractor SDK – C# – PDF To JSON With Images

Home
/
Articles
/
ByteScout PDF Extractor SDK – C# – PDF To JSON With Images

printable version:
ByteScout-PDF-Extractor-SDK-C-sharp-PDF-To-JSON-With-Images.pdf

PDF to JSON with images in C# with ByteScout PDF Extractor SDK

Tutorial: how to do PDF to JSON with images in C#

Source code documentation samples provide quick and easy way to add a required functionality into your application. ByteScout PDF Extractor SDK was made to help with PDF to JSON with images in C#. ByteScout PDF Extractor SDK is the SDK that helps developers to extract data from unstructured documents, pdf, images, scanned and electronic forms. Includes AI functions like automatic table detection, automatic table extraction and restructuring, text recognition and text restoration from pdf and scanned documents. Includes PDF to CSV, PDF to XML, PDF to JSON, PDF to searchable PDF functions as well as methods for low level data extraction.

The SDK samples like this one below explain how to quickly make your application do PDF to JSON with images in C# with the help of ByteScout PDF Extractor SDK. To do PDF to JSON with images in your C# project or application you may simply copy & paste the code and then run your app! Enjoy writing a code with ready-to-use sample C# codes to implement PDF to JSON with images using ByteScout PDF Extractor SDK.

Our website provides free trial version of ByteScout PDF Extractor SDK. It comes along with all these source code samples with the goal to help you with your C# application implementation.

On-demand (REST Web API) version:
Web API (on-demand version)

On-premise offline SDK for Windows:
60 Day Free Trial (on-premise)

Program.cs

      using System;
using Bytescout.PDFExtractor;

namespace ConsoleApplication1
{
    class Program
    {
        static void Main(string[] args)
        {
            // Create Bytescout.PDFExtractor.JSONExtractor instance
            JSONExtractor extractor = new JSONExtractor();
            extractor.RegistrationName = "demo";
            extractor.RegistrationKey = "demo";

            // Load sample PDF document
            extractor.LoadDocumentFromFile("sample1.pdf");

            // Uncomment this line to get rid of empty nodes in JSON
            //extractor.PreserveFormattingOnTextExtraction = false;

            // Set output image format
            extractor.ImageFormat = OutputImageFormat.PNG;
            
            // Save images to external files
            extractor.SaveImages = ImageHandling.OuterFile;
            extractor.ImageFolder = "images"; // Folder for external images
            extractor.SaveJSONToFile("result_with_external_images.json");

            // Embed images into JSON as Base64 encoded string
            extractor.SaveImages = ImageHandling.Embed;
            extractor.SaveJSONToFile("result_with_embedded_images.json");

            // Cleanup
			extractor.Dispose();
        }
    }
}