PDF to text

Updated Software: ByteScout PDF Extractor SDK 6.11.2149
ByteScout updated a developer library ByteScout PDF Extractor SDK 6.11.2149 August 6, 2015.  What's new ByteScout PDF Extractor SDK 6.11.2149: Batch Processing samples updated to show the use of Reset() method C++ source code sample added for Pages Extraction DocumentMerger adds Merge2(inputfile1, inputfile2, outputfile) method to merge 2 files XLS Extractor minor bug-fixes PDF Multitool now allows to enable/disable text, image, vector layers, adds advanced settings for text extraction XML, CSV, Table extraction improves support for tables with emtpry [...]
ByteScout updated a developer library ByteScout PDF Extractor SDK 6.00.2071 May 14 2015. What's new ByteScout PDF Extractor SDK 6.00.2071:PDF to XML, PDF To CSV, PDF To Text functionality improvedPDF To XLS command line sample added (based on vbscript)PDF To HTML SDK adds new .DetectHyperLinks property (TRUE by default) to enable/disable automated links detection in the textnew SearchablePDFMaker (available for PRO licenses) to convert PDF into searchable PDF filesnew properties in extractor: ConsiderFontNames, ConsiderFontSizes, ConsiderFontColors, ConsiderVerticalBorders in CFG filesheader columns [...]
New software released: PDF Extractor SDK, PDF Renderer SDK, PDF Viewer freeware
  ByteScout updated developer libraries ByteScout PDF Extractor SDK 5.80.1781 and ByteScout PDF Renderer SDK 5.20.1870. Also, the freeware PDF Viewer 5.20.1871 was released in January 2015. What's new ByteScout PDF Extractor SDK 5.80.1781: PDF to XML, PDF to CSV, PDF to Text functionality updated OCRMode now provides 9 modes .DetectLineInsteadOfParagraph now works much better. Set it to False to capture multiline text in table cells! PDF controls support improved FDF and XFDF data extraction and more! What's new ByteScout PDF Renderer SDK [...]
ByteScout's November PDF libraries updates for developer arePDF Extractor SDK 5.10.1747 and PDF To HTML SDK 5.10.1750.What's new ByteScout PDF Extractor SDK 5.10.1747:PDF to XML, PDF to CSV, PDF to Text functions improved;now supports text extraction from text controls;XML extractor now adds font style, size, name, text coordinates into <text> tags;ASP.NET sample for OCR usage added;new property OCRLanguageDataFolder to specify the location of "tessdata" folder;improved support of PDF files;improves support for rotated text;updated source code samples;updated documentation;minor improvements and fixes.What's new ByteScout [...]
Updates for our PDF manipulation products – new versions
  New versions of PDF manipulation SDK products for software developers have been released by ByteScout on June 2nd, 2014. ByteScout offers developers the ready-to-use solutions to implement PDF viewers and convert PDF to text, html, images no any additional software required. Here is a list of some of the new and updated features in ByteScout SDKs: 1) PDF Extractor SDK 4.00.1487. Convert PDF to text, extract images from PDF, convert PDF to CSV for Excel, PDF to XML. What's new PDF Extractor SDK 4.00.1487: improved pdf to text, pdf to [...]
PDF/A (Archival)PDF/A (Archival Portable Document Format) or commonly referred as the archival PDF is a type of standard PDF, which is used to store information for a longer period of time as compared to the traditional PDF. PDF/A is an ISO standardized variation of PDF and is widely used to preserve electronic documents and files, digitally for longer periods of time. History and PurposeThe traditional PDF document format relies on lots of external information when [...]