How to search in PDF and extract the found text using PDF Extractor SDK in C# - ByteScout
Announcement
Our ByteScout SDK products are sunsetting as we focus on expanding new solutions.
Learn More Open modal
Close modal
Announcement Important Update
ByteScout SDK Sunsetting Notice
Our ByteScout SDK products are sunsetting as we focus on our new & improved solutions. Thank you for being part of our journey, and we look forward to supporting you in this next chapter!

How to search in PDF and extract the found text using PDF Extractor SDK in C#

  • Home
  • /
  • Articles
  • /
  • How to search in PDF and extract the found text using PDF Extractor SDK in C#

Use the sample source code below to search for a specific text in a PDF document and extract the found results with the ByteScout PDF Extractor SDK in C#.

How to search in PDF and extract results in C#

using System;
using System.IO;
using System.Text;
using Bytescout.PDFExtractor;
using System.Drawing;
using System.Diagnostics;

namespace Example
{
    class Program
    {
        static void Main(string[] args)
        {

            TextExtractor extractor = new TextExtractor("demo", "demo");

            // load the document
            extractor.LoadDocumentFromFile("sample2.pdf");

            string searchString = "what";
            
            // get page count
            int pageCount = extractor.GetPageCount();
            int count = 0;

            // iterate through pages
            for (int i = 0; i < pageCount; i++)
            {
                // search for text string
                if (extractor.Find(i, searchString, false))
                {
                    do
                    {
                        count++;

                        // output search results
                        Console.WriteLine("Found on page " + i + " at location " + extractor.FoundText.Bounds.ToString());

                        // now we are getting the found text
			string extractedString = extractor.FoundText.Text;
                        Console.WriteLine("Extracted string: " + extractedString);

                    }
                    while (extractor.FindNext()); // search next occurance of the search string
                }
            }



            Console.WriteLine("Press any key to exit...");
            Console.ReadKey();

        }
    }
}

Tutorials:

prev
next