This sample shows how to extract text by words from a PDF page in C# and VB.NET.
Docotic.Pdf library allows you to read PDF text with detailed information (position, font, color) about every text chunk. Use the PdfPage.GetWords method to extract information about all words with their coordinates on a PDF page.