How to convert all pages of PDF document to plain HTML in Visual C# with PDF To HTML SDK - ByteScout

How to convert all pages of PDF document to plain HTML in Visual C# with PDF To HTML SDK

  • Home
  • /
  • Articles
  • /
  • How to convert all pages of PDF document to plain HTML in Visual C# with PDF To HTML SDK

The sample source code below is for converting all pages of PDF document to plain HTML in Visual C# with Bytescout PDF To HTML SDK.

Visual C#

using System;
using Bytescout.PDF2HTML;

namespace ExtractHTML
{
	class Program
	{
		static void Main(string[] args)
		{
			// Create Bytescout.PDF2HTML.HTMLExtractor instance
			HTMLExtractor extractor = new HTMLExtractor();
			extractor.RegistrationName = "demo";
			extractor.RegistrationKey = "demo";

			// Set plain HTML extraction mode
			extractor.ExtractionMode = HTMLExtractionMode.PlainHTML;

			// Load sample PDF document
			extractor.LoadDocumentFromFile("sample2.pdf");

			// Save extracted HTML to file
			extractor.SaveHtmlToFile("output.html");

			// Open output file in default associated application
			System.Diagnostics.Process.Start("output.html");
		}
	}
}

Tutorials:

prev
next