PrinceXML：“输入不是正确的 UTF-8”

发布于 2024-10-02 17:52:16 字数 1378 浏览 12 评论 0原文

我从数据库生成 HTML，然后将其发送到 PrinceXML 以转换为 PDF。我用来执行此操作的代码是：

string _htmlTemplate = @"<!DOCTYPE html PUBLIC ""-//W3C//DTD XHTML 1.0 Transitional//EN"" ""http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd""><html lang=""en-GB"" xml:lang=""en-GB"" xmlns=""http://www.w3.org/1999/xhtml""><head><meta http-equiv=""Content-type"" content=""text/html;charset=UTF-8"" /><title>Generated PDF Contract</title></head><body>{0}</body></html>";

string _pgeContent = string.Format(_htmlTemplate, sb.ToString());
writer.Write(sb.ToString());
Byte[] arrBytes = UTF8Encoding.Default.GetBytes(_pgeContent);
Stream s = new MemoryStream(arrBytes);

Prince princeConverter = new Prince(ConfigurationManager.AppSettings["PrinceXMLInstallLoc"].ToString());
princeConverter.SetLog(ConfigurationManager.AppSettings["PrinceXMLLogLoc"]);
princeConverter.AddStyleSheet(Server.MapPath(ConfigurationManager.AppSettings["FormsDocGenCssLocl"]));
Response.ClearContent();
Response.ClearHeaders();
Response.ContentType = "application/pdf";
Response.BufferOutput = true;

但是，转换失败并出现错误：

输入的不是正确的UTF-8，请指示编码！字节：0xA0 0x77 0x65 0x62

我已获取生成的 html 并将其上传到 W3C 验证器。它将标记验证为 UTF-8 编码的 XHTML 1.0 Transitional，没有错误或警告。

我还仔细检查了该文件，寻找无效字符。到目前为止什么也没有。

谁能建议我可以尝试其他方法吗？

原文

I'm generating HTML from a database and then sending it to PrinceXML for conversion to PDF. The code I use to do this is:

string _htmlTemplate = @"<!DOCTYPE html PUBLIC ""-//W3C//DTD XHTML 1.0 Transitional//EN"" ""http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd""><html lang=""en-GB"" xml:lang=""en-GB"" xmlns=""http://www.w3.org/1999/xhtml""><head><meta http-equiv=""Content-type"" content=""text/html;charset=UTF-8"" /><title>Generated PDF Contract</title></head><body>{0}</body></html>";

string _pgeContent = string.Format(_htmlTemplate, sb.ToString());
writer.Write(sb.ToString());
Byte[] arrBytes = UTF8Encoding.Default.GetBytes(_pgeContent);
Stream s = new MemoryStream(arrBytes);

Prince princeConverter = new Prince(ConfigurationManager.AppSettings["PrinceXMLInstallLoc"].ToString());
princeConverter.SetLog(ConfigurationManager.AppSettings["PrinceXMLLogLoc"]);
princeConverter.AddStyleSheet(Server.MapPath(ConfigurationManager.AppSettings["FormsDocGenCssLocl"]));
Response.ClearContent();
Response.ClearHeaders();
Response.ContentType = "application/pdf";
Response.BufferOutput = true;

However, conversion fails with the error:

Input is not proper UTF-8, indicate encoding ! Bytes: 0xA0 0x77 0x65 0x62

I've taken the generated html and uploaded it to the W3C validator. It validates the markup as UTF-8 encoded XHTML 1.0 Transitional with no errors or warnings.

I've also gone through the file with a fine tooth-comb looking for invalid characters. So far nothing.

Can anyone suggest something else I could try?

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

捶死心动 2024-10-09 17:52:16

经过一下午的低声咒骂和撕掉剩下的头发后，我找到了解决我的特殊问题的方法。

默认情况下，System.Text.UTF8Encoding 似乎不输出 UTF-8 标识符字节。因此，就我而言，我需要使用带有布尔参数的构造函数来控制其输出。

UTF8Encoding u8enc = new UTF8Encoding(true);//Ensures a UTF8 identifier is emitted.

此后一切都很好。希望这对某人有帮助:-)

Well after an afternoon of muttering curses and tearing out what is left of my hair, I figured out a fix for my particular problem.

It would appear that System.Text.UTF8Encoding doesn't output a UTF-8 identifier byte by default. So in my case I needed to use the constructor that takes a boolean parameter to control output of this.

UTF8Encoding u8enc = new UTF8Encoding(true);//Ensures a UTF8 identifier is emitted.

After this it was all good. Hope this helps someone :-)

回复收藏 0 原文

~没有更多了~

关于作者

执笏见

暂无简介

文章

25 人气

关注发私信

友情链接

文江博客

PrinceXML：“输入不是正确的 UTF-8”

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（1）

关于作者

相关话题

热门标签

推荐作者

忆悲凉

hgfg1645

qq_qLPLYi

戏舞

殊姿

﹂绝世的画

友情链接

PrinceXML：“输入不是正确的 UTF-8”

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（1）

关于作者

相关话题

热门标签

推荐作者

忆悲凉

hgfg1645

qq_qLPLYi

戏舞

殊姿

﹂绝世的画

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。