Welcome Guest Search | Active Topics | Sign In | Register

How to read text content from existing PDF Options
Kalyan
Posted: Friday, March 2, 2018 4:10:48 PM
Rank: Newbie
Groups: Member

Joined: 3/2/2018
Posts: 1
Hi,

We have a licensed version of V5 and I am trying to read the content from PDF (text content).Documentation is not sufficient on how to read the content from PDF (text content from PDF).Do you support the functionality?
Code: C#


Sharing my sample logic

PdfDocument reader = new PdfDocument(stream);

foreach(var page in reader.Pages)
{
foreach(var pdfContent in page.Contents)
{
//pdfContent.
Need a function to read the content from the page.
}

}
eo_support
Posted: Friday, March 2, 2018 4:55:43 PM
Rank: Administration
Groups: Administration

Joined: 5/27/2007
Posts: 24,221
Hi,

Unfortunately we do not have any API for you to read existing text from a PDF file. This is not even possible with some PDF file: PDF file was primary designed for printing purpose so its focus is how to "draw" each character, and some PDF file only has this information but does not have any information on exactly what that character is. For those file it is not possible to read text.

Thanks!


You cannot post new topics in this forum.
You cannot reply to topics in this forum.
You cannot delete your posts in this forum.
You cannot edit your posts in this forum.
You cannot create polls in this forum.
You cannot vote in polls in this forum.