ByteScout Cloud API Server - PDF To XML API - Python - Convert PDF To XML From Uploaded File - ByteScout

ByteScout Cloud API Server – PDF To XML API – Python – Convert PDF To XML From Uploaded File

  • Home
  • /
  • Articles
  • /
  • ByteScout Cloud API Server – PDF To XML API – Python – Convert PDF To XML From Uploaded File

How to convert PDF to XML from uploaded file for PDF to XML API in Python using ByteScout Cloud API Server

Follow this simple tutorial to learn convert PDF to XML from uploaded file to have PDF to XML API in Python

The documentation is written to assist you to apply all the necessary features on your side. ByteScout Cloud API Server was designed to assist PDF to XML API in Python. ByteScout Cloud API Server is API server that is ready to use and can be installed and deployed in less than 30 minutes on your own Windows server or server in a cloud. It can save data and files on your local server-based file storage or in Amazon AWS S3 storage. Data is processed solely on the API server and is powered by ByteScout engine, no cloud services or Internet connection is required for data processing..

Python code snippet like this for ByteScout Cloud API Server works best when you need to quickly implement PDF to XML API in your Python application. Open your Python project and simply copy & paste the code and then run your app! Enjoy writing a code with ready-to-use sample Python codes to add PDF to XML API functions using ByteScout Cloud API Server in Python.

ByteScout Cloud API Server – free trial version is available on our website. Also, there are other code samples to help you with your Python application included into trial version.

On-demand (REST Web API) version:
 Web API (on-demand version)

On-premise offline SDK for Windows:
 60 Day Free Trial (on-premise)

ConvertPdfToXMLFromUploadedFile.py
      
import os import requests # pip install requests # Please NOTE: In this sample we're assuming Cloud Api Server is hosted at "https://localhost". # If it's not then please replace this with with your hosting url. # Base URL for PDF.co Web API requests BASE_URL = "https://localhost" # Source PDF file SourceFile = ".\\sample.pdf" # Comma-separated list of page indices (or ranges) to process. Leave empty for all pages. Example: '0,2-5,7-'. Pages = "" # PDF document password. Leave empty for unprotected documents. Password = "" # Destination XML file name DestinationFile = ".\\result.xml" def main(args = None): uploadedFileUrl = uploadFile(SourceFile) if (uploadedFileUrl != None): convertPdfToXml(uploadedFileUrl, DestinationFile) def convertPdfToXml(uploadedFileUrl, destinationFile): """Converts PDF To XML using PDF.co Web API""" # Prepare URL for 'PDF To XML' API request url = "{}/pdf/convert/to/xml?name={}&password={}&pages={}&url={}".format( BASE_URL, os.path.basename(destinationFile), Password, Pages, uploadedFileUrl ) # Execute request and get response as JSON response = requests.get(url, headers={ "content-type": "application/octet-stream" }) if (response.status_code == 200): json = response.json() if json["error"] == False: # Get URL of result file resultFileUrl = json["url"] # Download result file r = requests.get(resultFileUrl, stream=True) if (r.status_code == 200): with open(destinationFile, 'wb') as file: for chunk in r: file.write(chunk) print(f"Result file saved as \"{destinationFile}\" file.") else: print(f"Request error: {response.status_code} {response.reason}") else: # Show service reported error print(json["message"]) else: print(f"Request error: {response.status_code} {response.reason}") def uploadFile(fileName): """Uploads file to the cloud""" # 1. RETRIEVE PRESIGNED URL TO UPLOAD FILE. # Prepare URL for 'Get Presigned URL' API request url = "{}/file/upload/get-presigned-url?contenttype=application/octet-stream&name={}".format( BASE_URL, os.path.basename(fileName)) # Execute request and get response as JSON response = requests.get(url) if (response.status_code == 200): json = response.json() if json["error"] == False: # URL to use for file upload uploadUrl = json["presignedUrl"] # URL for future reference uploadedFileUrl = json["url"] # 2. UPLOAD FILE TO CLOUD. with open(fileName, 'rb') as file: requests.put(uploadUrl, data=file, headers={ "content-type": "application/octet-stream" }) return uploadedFileUrl else: # Show service reported error print(json["message"]) else: print(f"Request error: {response.status_code} {response.reason}") return None if __name__ == '__main__': main()

ON-PREMISE OFFLINE SDK

60 Day Free Trial or Visit ByteScout Cloud API Server Home Page

Explore ByteScout Cloud API Server Documentation

Explore Samples

Sign Up for ByteScout Cloud API Server Online Training

ON-DEMAND REST WEB API

Get Your API Key

Explore Web API Docs

Explore Web API Samples

VIDEO

ON-PREMISE OFFLINE SDK

60 Day Free Trial or Visit ByteScout Cloud API Server Home Page

Explore ByteScout Cloud API Server Documentation

Explore Samples

Sign Up for ByteScout Cloud API Server Online Training

ON-DEMAND REST WEB API

Get Your API Key

Explore Web API Docs

Explore Web API Samples

Tutorials:

prev
next