当前位置：文江博客话题详情

Coldfusion XMLFormat() 不转换所有字符

发布于 2024-08-10 05:47:39 字数 118 浏览 4 评论 0原文

我正在使用 XMLFormat() 对 XML 文档的一些文本进行编码。但是，当我去读取我创建的 XML 文件时，出现无效字符错误。为什么 XMLFormat() 不能正确编码所有字符？

我正在运行CF8。

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

在巴黎塔顶看东京樱花 2024-08-17 05:47:39

您确定以正确的编码输出文件吗？您不能这样做，

<cffile action="write" file="foo.xml" output="#xml#" />

因为结果很可能与您的 XML 所在的字符集不同。除非另有说明（通过编码声明），否则 XML 文件将被视为 UTF-8，您应该这样做：

<cffile action="write" file="foo.xml" output="#xml#" charset="utf-8" />
<!--- and --->
<cffile action="read" file="foo.xml" variable="xml" charset="utf-8" />

Are you sure to output the file in the right encoding? You can't just do

<cffile action="write" file="foo.xml" output="#xml#" />

as the result very likely diverges from the character set your XML is in. Unless otherwise noted (by an encoding declaration), XML files are treated as UTF-8, and you should do:

<cffile action="write" file="foo.xml" output="#xml#" charset="utf-8" />
<!--- and --->
<cffile action="read" file="foo.xml" variable="xml" charset="utf-8" />

回复收藏 0 原文

—━☆沉默づ 2024-08-17 05:47:39

我觉得这是 XMLFormat 中的一个错误。我不确定下面代码片段的原始作者是谁，但这里有一种通过正则表达式捕获额外字符的方法......

  <cfset myText = xmlFormat(myText)>

  <cfscript>
      i = 0;
      tmp = '';
      while(ReFind('[^\x00-\x7F]',myText,i,false))
      {
        i = ReFind('[^\x00-\x7F]',myText,i,false); // discover high chr and save it's numeric string position.
        tmp = '&##x#FormatBaseN(Asc(Mid(myText,i,1)),16)#;'; // obtain the high chr and convert it to a hex numeric chr.
        myText = Insert(tmp,myText,i); // insert the new hex numeric chr into the string.
        myText = RemoveChars(myText,i,1); // delete the redundant high chr from string.
        i = i+Len(tmp); // adjust the loop scan for the new chr placement, then continue the loop.
      }
      return myText;
  </cfscript>

I feel that this is a bug in XMLFormat. I am not sure who the original author of the snippet below is but here is an approach to catch the extra characters via regex...

  <cfset myText = xmlFormat(myText)>

  <cfscript>
      i = 0;
      tmp = '';
      while(ReFind('[^\x00-\x7F]',myText,i,false))
      {
        i = ReFind('[^\x00-\x7F]',myText,i,false); // discover high chr and save it's numeric string position.
        tmp = '&##x#FormatBaseN(Asc(Mid(myText,i,1)),16)#;'; // obtain the high chr and convert it to a hex numeric chr.
        myText = Insert(tmp,myText,i); // insert the new hex numeric chr into the string.
        myText = RemoveChars(myText,i,1); // delete the redundant high chr from string.
        i = i+Len(tmp); // adjust the loop scan for the new chr placement, then continue the loop.
      }
      return myText;
  </cfscript>

回复收藏 0 原文

伴梦长久 2024-08-17 05:47:39

不要忘记将放入在你的模板之上。

回复收藏 0 原文

天荒地未老 2024-08-17 05:47:39

如果您尝试将 XML 直接返回到浏览器，您可能需要尝试类似让用户下载它的方法

<cfheader name="Content-Disposition" charset="utf-8" value="attachment; filename=export.xml">
<cfcontent variable="#someXMLPacket#" type="text/xml"  reset="true">

，或者，如果您希望它作为网页返回（ala REST），那么这应该可以解决问题，

<cfheader charset="utf-8">
<cfcontent variable="#someXMLPacket#" type="text/xml"  reset="true">

希望有帮助

if your trying to return your XML directly to the browser, you might want to try something like for the user to download it

<cfheader name="Content-Disposition" charset="utf-8" value="attachment; filename=export.xml">
<cfcontent variable="#someXMLPacket#" type="text/xml"  reset="true">

or, if you want it returned as a webpage (ala REST) then this should do the trick

<cfheader charset="utf-8">
<cfcontent variable="#someXMLPacket#" type="text/xml"  reset="true">

hope that helps

回复收藏 0 原文

眼睛会笑 2024-08-17 05:47:39

不幸的是，XMLFormat 并不是一个包罗万象的解决方案。它的字符列表非常有限，将取代[文档]。

您需要对 XML 无效但 XMLFormat 未涵盖的字符进行自定义编码。

这绝对不是很有效，但一个潜在的解决方案是逐个字符地循环典型可疑字段的内容（对于初学者来说，用户生成的任何内容），检查 ascii 代码，如果它高于 255，则忽略字符或对其进行正确编码。

回复收藏 0 原文

病毒体 2024-08-17 05:47:39

这对我来说也是一个大问题，事实证明字符集是主要因素，您需要明确指定正确的字符集。

对我来说，我在 xml 中有外语，并且在我输入正确的字符集之前不会被正确解析......

回复收藏 0 原文

~没有更多了~

关于作者

薄荷梦

暂无简介

0 文章

0 评论

23 人气

关注发私信

ni139999

文章 0 评论 0

关注

Smile

文章 0 评论 0

关注

木子李

文章 0 评论 0

关注

仅此而已

文章 0 评论 0

关注

qq_2gSKZM

文章 0 评论 0

关注

内心激荡

文章 0 评论 0

友情链接

文江博客

Coldfusion XMLFormat() 不转换所有字符

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（6）

关于作者

相关话题

热门标签

推荐作者

ni139999

Smile

木子李

仅此而已

qq_2gSKZM

内心激荡

友情链接

Coldfusion XMLFormat() 不转换所有字符

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（6）

关于作者

相关话题

热门标签

推荐作者

ni139999

Smile

木子李

仅此而已

qq_2gSKZM

内心激荡

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。