ABCpdf 5 编码问题（特殊字符）

发布于 2024-12-23 14:24:11 字数 1009 浏览 4 评论 0原文

我正在使用 ABCpdf 版本 5 将一些 html 页面渲染为 PDF。

我基本上使用 HttpServerUtility.Execute() - 方法来检索 pdf 的 html：

System.IO.StringWriter writer = new System.IO.StringWriter();
server.Execute(requestUrl, writer);
string pageResult = writer.ToString();

WebSupergoo.ABCpdf5.Doc pdfDoc = new WebSupergoo.ABCpdf5.Doc();
pdfDoc.AddImageHtml(pageResult);

response.Buffer = false;
response.ContentType = "application/pdf";
response.AddHeader("Content-Disposition", "attachment;filename=MyPdf_" + 
    FormatDate(DateTime.Now, "yyyy-MM-dd") + ".pdf");
response.BinaryWrite(pdfDoc.GetData());

现在一些特殊字符，如 Umlaute (äöü) 被替换为空格。有趣的是，并非全部。我发现了什么：在我的 html 页面中。

`<meta http-equiv="content-type" content="text/xhtml; charset=utf-8" />`

如果我解析它，所有特殊字符都会正确呈现。但在我看来，这就像一个丑陋的黑客行为。

早些时候，我没有使用 HttpServerUtility.Execute()，但我让 ABCpdf 调用 URL 本身：pdfDoc.AddImageUrl("someUrl");。在那里我没有这样的编码问题。

我还能尝试什么？

原文

I am using ABCpdf Version 5 in order to render some html-pages into PDFs.

I basically use HttpServerUtility.Execute() - Method in order to retrieve the html for the pdf:

System.IO.StringWriter writer = new System.IO.StringWriter();
server.Execute(requestUrl, writer);
string pageResult = writer.ToString();

WebSupergoo.ABCpdf5.Doc pdfDoc = new WebSupergoo.ABCpdf5.Doc();
pdfDoc.AddImageHtml(pageResult);

response.Buffer = false;
response.ContentType = "application/pdf";
response.AddHeader("Content-Disposition", "attachment;filename=MyPdf_" + 
    FormatDate(DateTime.Now, "yyyy-MM-dd") + ".pdf");
response.BinaryWrite(pdfDoc.GetData());

Now some special characters like Umlaute (äöü) are replaced with an empty space. Interestingly not all of them. What I did figure out:
Within the html-page I have.

`<meta http-equiv="content-type" content="text/xhtml; charset=utf-8" />`

If I parse this away, all special chars are rendered correctly. But this seems to me like an ugly hack.

In earlier days I did not use HttpServerUtility.Execute(), but I let ABCpdf call the URL itself: pdfDoc.AddImageUrl("someUrl");. There I had no such encoding-problems.

What could I try else?

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

旧城烟雨 2024-12-30 14:24:11

刚刚遇到这个问题 ABCpdf 8。

在代码中，您检索 HTML 内容并将 pageResult 传递给 AddImageHtml()。作为文档州，

ABCpdf 将此 HTML 保存到临时文件中并呈现该文件
使用“file://”协议说明符。

没有提到的是，临时文件是 UTF-8 编码的，但 HTML 文件中没有说明编码。

<元>标签实际上设置了所需的编码，并解决了我的问题。

避免声明编码的一种方法是使用我希望通过 AddImageUrl() 方法从 HTTP/HTML 响应中检测 HTML 编码。

回复收藏 0 原文

·深蓝 2024-12-30 14:24:11

编码元标记和 AddImageURL 方法可能有助于简单的文档，但在链式情况下则无济于事，在这种情况下，尽管编码了标记，但编码还是会以某种方式丢失。我遇到了这个问题（正如原始问题中所描述的那样 - 一些外来字符（例如变音符号）会消失），并且没有看到解决方案。我正在考虑完全摆脱 ABCPDF 并将其替换为 SSRS，它可以呈现 PDF 格式。

回复收藏 0 原文

~没有更多了~