Welcome Guest Search | Active Topics | Sign In | Register

EOPdf: Copying text from generated pdfs Options
Erisma
Posted: Monday, June 10, 2013 5:14:32 AM
Rank: Member
Groups: Member

Joined: 8/12/2011
Posts: 16
When copying text from pdfs that are generated from EOPdf results in spaces being replace with line breaks in Adobe Reader. In SummatraPDF the spaces are replaced with question marks.
Is this the intended behaviour?

If you want we can send you a pdf file were the problem occures.
eo_support
Posted: Monday, June 10, 2013 9:32:42 AM
Rank: Administration
Groups: Administration

Joined: 5/27/2007
Posts: 24,195
Hi,

The line break can happen because sometimes PDF doesn't really know the two textblocks are the same line (text block are rendered by "position", not by "line" in PDF). But it shouldn't appear as question mark. So if you can send us a test file we will be happy to take a look. Please also include code that you use the create that file. We will PM you as where to send them.

Thanks!
Erisma
Posted: Tuesday, June 11, 2013 1:35:37 AM
Rank: Member
Groups: Member

Joined: 8/12/2011
Posts: 16
Hi,

Thanks for the answer. The question mark is only a problem for one specific pdf viewer(SummatraPdf) and is not as big of a problem for us as the linebreak for every word. Is that anything that can be fixed? Is there something we can do to resolve this? I see that other pdfs not generated from EOPdf doesn´t have that problem.

The problem for us is that our customers use pdfs generated from EOPdf and sometimes they need to copy paste from the pdfs and they have reported this as an issue to us.

Thanks
Erisma
Posted: Tuesday, June 11, 2013 2:13:39 AM
Rank: Member
Groups: Member

Joined: 8/12/2011
Posts: 16
Hi,

We noticed now when we make a simple project for simulating our issue that the problem disappear. So it could be that when we set properties like PageSize, FooterHtmlFormat etc. on EO.Pdf.HtmlToPdf.Options causes this issue.

We will keep testing to try to narrowing it down. Just wanted you to know about our findings.

Thanks.
Erisma
Posted: Tuesday, June 11, 2013 3:27:45 AM
Rank: Member
Groups: Member

Joined: 8/12/2011
Posts: 16
Hi,

After looking at it in our source code and removing different properties we set on EO.Pdf.HtmlToPdf.Options we can see that the line break issue appears when we set one of the following properties:
HtmlToPdf.Options.FooterHtmlPosition
HtmlToPdf.Options.FooterHtmlFormat
EO.Pdf.HtmlToPdf.Options.PageSize.

The issue appears even if we set EO.Pdf.HtmlToPdf.Options.PageSize to the same value it had before.
So setting these properties triggers recalculating of the document and rendering by position?

Thanks.

eo_support
Posted: Tuesday, June 11, 2013 9:12:58 AM
Rank: Administration
Groups: Administration

Joined: 5/27/2007
Posts: 24,195
Hi,

We are not aware of any of those would make a difference. You probably want to contact SummatraPDF to see if they can give you any information because they are the only one that somehow interprets space as block.

Thanks
Erisma
Posted: Tuesday, June 11, 2013 10:04:30 AM
Rank: Member
Groups: Member

Joined: 8/12/2011
Posts: 16
Hi,

SummatraPdf has question mark instead of line breaks. The line break issue is present in Adobe Reader, but not always. It seem to depend on how we programmaticly adjust set the following properties:
HtmlToPdf.Options.FooterHtmlPosition
HtmlToPdf.Options.FooterHtmlFormat
EO.Pdf.HtmlToPdf.Options.PageSize

But that is just my guess from testing different senarios with and without setting those properties.

Thanks.


You cannot post new topics in this forum.
You cannot reply to topics in this forum.
You cannot delete your posts in this forum.
You cannot edit your posts in this forum.
You cannot create polls in this forum.
You cannot vote in polls in this forum.