System.Text.Ecoding.UTF8.GetString 返回垃圾

发布于 2024-09-13 12:46:39 字数 832 浏览 3 评论 0原文

这是一项艰难的任务。我有一个响应过滤器设置，可以在返回到浏览器之前转换 html (http://aspnetresources.com/文章/HttpFilters）。除了我的机器之外，这在每个人的机器上都可以正常工作。实际上它一直在我的机器上运行，直到我因为它锁定而不得不进行硬重置。

public override void Write(byte[] buffer, int offset, int count)
{
    string strBuffer =  System.Text.UTF8Encoding.UTF8.GetString(buffer, offset, count);

对于其他人（以及我以前的人）来说，strBuffer 包含 HTML。现在，无论出于何种原因，它都会为我返回垃圾字符。有什么想法吗？我要拔头发了！！

更新

事实证明，“启用动态内容压缩”导致了该问题。由于某种原因，它在传递到过滤器之前会被压缩。

解决方案

在 web.config 中将“dynamicCompressionBeforeCache”设置为 false 修复了该问题。

<urlCompression doStaticCompression="true" doDynamicCompression="true" dynamicCompressionBeforeCache="false" />

原文

This is a tough one. I have a Response filter setup to transform the html before spitting back out to the browser (http://aspnetresources.com/articles/HttpFilters). This works fine on everyones machine but mine. Actually it was working on my machine until I had to do a hard reset because it locked up.

public override void Write(byte[] buffer, int offset, int count)
{
    string strBuffer =  System.Text.UTF8Encoding.UTF8.GetString(buffer, offset, count);

For everyone else (and mine previosly) strBuffer contains HTML. Now for whatever reason it's returning junk characters for me. Any ideas? I'm pulling my hair out!!

Update

Turns out that "Enable dynamic content compression" is causing the issue. For some reason it's getting gzipped before being passed into the filter.

Solution

Setting the "dynamicCompressionBeforeCache" to false in the web.config fixed the issue.

<urlCompression doStaticCompression="true" doDynamicCompression="true" dynamicCompressionBeforeCache="false" />

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

稀香 2024-09-20 12:46:46

您指定了这些字节： 31, 139, 8, 0, 0, 0, 0, 0, 4

这不是有效的 UTF-8。特别是，这意味着 Unicode 字符 U+0031（“信息分隔符 1”）后跟字节 139 和 8...以及 139 后跟 8 不是有效的 UTF-8 字节序列。即使这些确实形成了一个有效的序列，您也会有 5 个 Unicode U+0000 字符 (NUL)，后跟 U+0004（传输结束）。几乎没有有效的 HTML。

我不知道您实际上在过滤什么，但它不是有效的 UTF-8 文本。事实上，它看起来根本不可能是文本。您是否实际上正在尝试将过滤器应用于图像等二进制数据？

请注意，您的过滤方法还有另一个基本问题：您假设每个缓冲区都包含完整文本。您很可能会收到一个包含字符前半部分的缓冲区，然后收到包含该字符其余部分的第二个缓冲区。这就是 System.Text.Decoder 接口的用途 - 它是有状态的，可以记住部分字符。