ByteScout Data Extraction Suite - Powershell - Convert PDF To JSON with PDF Extractor SDK - ByteScout

ByteScout Data Extraction Suite – Powershell – Convert PDF To JSON with PDF Extractor SDK

  • Home
  • /
  • Articles
  • /
  • ByteScout Data Extraction Suite – Powershell – Convert PDF To JSON with PDF Extractor SDK

How to convert PDF to JSON with PDF extractor SDK in Powershell and ByteScout Data Extraction Suite

This code in Powershell shows how to convert PDF to JSON with PDF extractor SDK with this how to tutorial

Every ByteScout tool includes simple example Powershell source codes that you can get here or in the folder with installed ByteScout product. ByteScout Data Extraction Suite is the bundle that includes three SDK tools for data extraction from PDF, scans, images and from spreadsheets: PDF Extractor SDK, Data Extraction SDK, Barcode Reader SDK. It can be applied to convert PDF to JSON with PDF extractor SDK using Powershell.

The SDK samples given below describe how to quickly make your application do convert PDF to JSON with PDF extractor SDK in Powershell with the help of ByteScout Data Extraction Suite. Follow the instructions from scratch to work and copy the Powershell code. Further improvement of the code will make it more robust.

Trial version of ByteScout Data Extraction Suite is available for free. Source code samples are included to help you with your Powershell app.

On-demand (REST Web API) version:
 Web API (on-demand version)

On-premise offline SDK for Windows:
 60 Day Free Trial (on-premise)

pdf-to-json.bat
      
@echo off if "%~1"=="" ( echo ----------------------------------------------------- echo Invalid parameter! echo ----------------------------------------------------- echo Usage: pdf-to-json.bat folder_path echo Example: pdf-to-json.bat "c:\documents" echo ----------------------------------------------------- if not "%NOPAUSE%"=="1" pause exit /b 1 ) powershell -NoProfile -ExecutionPolicy Bypass -Command "& .\pdf-to-json.ps1" "%1" echo Script finished with errorlevel=%errorlevel% pause

ON-PREMISE OFFLINE SDK

60 Day Free Trial or Visit ByteScout Data Extraction Suite Home Page

Explore ByteScout Data Extraction Suite Documentation

Explore Samples

Sign Up for ByteScout Data Extraction Suite Online Training

ON-DEMAND REST WEB API

Get Your API Key

Explore Web API Docs

Explore Web API Samples

pdf-to-json.ps1
      
Param( [Parameter(Mandatory = $true)] [string] $InputFolder = "" ) # Add reference to Bytescout.PDFExtractor.dll assembly Add-Type -Path "c:\Program Files\Bytescout PDF Extractor SDK\net4.00\Bytescout.PDFExtractor.dll" # Check input folder exists if ((Test-Path $InputFolder) -eq $false) { Write-Host "Target folder does not exist." -ForegroundColor Red exit 0 } # Create and activate JSONExtractor instance $jsonExtractor = New-Object Bytescout.PDFExtractor.JSONExtractor $jsonExtractor.RegistrationName = "demo" $jsonExtractor.RegistrationKey = "demo" try { # Get PDF files from input folder $files = Get-ChildItem -Path $InputFolder -Recurse -Include "*.pdf" foreach ($file in $files) { Write-Host "Input file" $file.FullName # Construct output file name $jsonFileName = [System.IO.Path]::ChangeExtension($file.FullName, "json") Write-Host " Output file" $jsonFileName # Load PDF document $jsonExtractor.LoadDocumentFromFile($file.FullName) # Disable the formatting reconstruction $jsonExtractor.PreserveFormattingOnTextExtraction = $false # Extract first page to JSON $jsonExtractor.SaveJSONToFile(0, $jsonFileName) # Reset extractor $jsonExtractor.Reset() } } catch { Write-Host $_.Exception.Message } $jsonExtractor.Dispose()

ON-PREMISE OFFLINE SDK

60 Day Free Trial or Visit ByteScout Data Extraction Suite Home Page

Explore ByteScout Data Extraction Suite Documentation

Explore Samples

Sign Up for ByteScout Data Extraction Suite Online Training

ON-DEMAND REST WEB API

Get Your API Key

Explore Web API Docs

Explore Web API Samples

VIDEO

ON-PREMISE OFFLINE SDK

60 Day Free Trial or Visit ByteScout Data Extraction Suite Home Page

Explore ByteScout Data Extraction Suite Documentation

Explore Samples

Sign Up for ByteScout Data Extraction Suite Online Training

ON-DEMAND REST WEB API

Get Your API Key

Explore Web API Docs

Explore Web API Samples

Tutorials:

prev
next