ByteScout’s November PDF libraries updates for developer are
PDF Extractor SDK 5.10.1747 and PDF To HTML SDK 5.10.1750.

What’s new ByteScout PDF Extractor SDK 5.10.1747:

  • PDF to XML, PDF to CSV, PDF to Text functions improved;
  • now supports text extraction from text controls;
  • XML extractor now adds font style, size, name, text coordinates into <text> tags;
  • ASP.NET sample for OCR usage added;
  • new property OCRLanguageDataFolder to specify the location of “tessdata” folder;
  • improved support of PDF files;
  • improves support for rotated text;
  • updated source code samples;
  • updated documentation;
  • minor improvements and fixes.

What’s new ByteScout PDF To HTML SDK 5.10.1750:

  • improved pdf to html conversion from ASP.NET and .NET;
  • issue with overlapping content when converting multiple pages from PDF fixed;
  • XHTML output minor fixes;
  • supporting for text opacity added;
  • now outputs unknown characters (0 to 32) as “?”;
  • improving support of pdf images conversion into html;
  • fixing minor issues with output images filenames;
  • minor improvements and fixes.