New: ByteScout released PDF Extractor SDK 5.10.1747 and PDF To HTML SDK 5.10.1750

  • Home
  • /
  • Blog
  • /
  • New: ByteScout released PDF Extractor SDK 5.10.1747 and PDF To HTML SDK 5.10.1750

ByteScout’s November PDF libraries updates for developer are
PDF Extractor SDK 5.10.1747 and PDF To HTML SDK 5.10.1750.

What’s new ByteScout PDF Extractor SDK 5.10.1747:

  • PDF to XML, PDF to CSV, PDF to Text functions improved;
  • now supports text extraction from text controls;
  • XML extractor now adds font style, size, name, text coordinates into <text> tags;
  • ASP.NET sample for OCR usage added;
  • new property OCRLanguageDataFolder to specify the location of “tessdata” folder;
  • improved support of PDF files;
  • improves support for rotated text;
  • updated source code samples;
  • updated documentation;
  • minor improvements and fixes.

What’s new ByteScout PDF To HTML SDK 5.10.1750:

  • improved pdf to html conversion from ASP.NET and .NET;
  • issue with overlapping content when converting multiple pages from PDF fixed;
  • XHTML output minor fixes;
  • supporting for text opacity added;
  • now outputs unknown characters (0 to 32) as “?”;
  • improving support of pdf images conversion into html;
  • fixing minor issues with output images filenames;
  • minor improvements and fixes.

prev
next