How to convert all pages of PDF document to plain HTML in Visual C# with PDF To HTML SDK - ByteScout
Announcement
Our ByteScout SDK products are sunsetting as we focus on expanding new solutions.
Learn More Open modal
Close modal
Announcement Important Update
ByteScout SDK Sunsetting Notice
Our ByteScout SDK products are sunsetting as we focus on our new & improved solutions. Thank you for being part of our journey, and we look forward to supporting you in this next chapter!

How to convert all pages of PDF document to plain HTML in Visual C# with PDF To HTML SDK

  • Home
  • /
  • Articles
  • /
  • How to convert all pages of PDF document to plain HTML in Visual C# with PDF To HTML SDK

The sample source code below is for converting all pages of PDF document to plain HTML in Visual C# with Bytescout PDF To HTML SDK.

Visual C#

using System;
using Bytescout.PDF2HTML;

namespace ExtractHTML
{
	class Program
	{
		static void Main(string[] args)
		{
			// Create Bytescout.PDF2HTML.HTMLExtractor instance
			HTMLExtractor extractor = new HTMLExtractor();
			extractor.RegistrationName = "demo";
			extractor.RegistrationKey = "demo";

			// Set plain HTML extraction mode
			extractor.ExtractionMode = HTMLExtractionMode.PlainHTML;

			// Load sample PDF document
			extractor.LoadDocumentFromFile("sample2.pdf");

			// Save extracted HTML to file
			extractor.SaveHtmlToFile("output.html");

			// Open output file in default associated application
			System.Diagnostics.Process.Start("output.html");
		}
	}
}

Tutorials:

prev
next