为什么 BinaryWriter 在流的开头添加乱码？你如何避免它？

发布于 2024-08-06 05:18:16 字数 383 浏览 7 评论 0原文

我正在调试将对象的一部分写入文件的一些问题，并且我已经了解了打开文件并在其中写入“TEST”的基本情况。我是通过以下方式做到这一点的：

static FileStream fs;
static BinaryWriter w;
fs = new FileStream(filename, FileMode.Create);
w = new BinaryWriter(fs);

w.Write("test");

w.Close();
fs.Close();

不幸的是，这最终会在文件的前面添加一个框，看起来像这样：

TEST，前面有一个有趣的框。为什么会这样，我该如何避免呢？

编辑：这里似乎没有显示该框，但它是看起来像乱码的 unicode 字符。

原文

I'm debugging some issues with writing pieces of an object to a file and I've gotten down to the base case of just opening the file and writing "TEST" in it. I'm doing this by something like:

static FileStream fs;
static BinaryWriter w;
fs = new FileStream(filename, FileMode.Create);
w = new BinaryWriter(fs);

w.Write("test");

w.Close();
fs.Close();

Unfortunately, this ends up prepending a box to the front of the file and it looks like so:

TEST, with a fun box on the front. Why is this, and how can I avoid it?

Edit: It does not seem to be displaying the box here, but it's the unicode character that looks like gibberish.

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

梦在深巷 2024-08-13 05:18:17

听起来像字节顺序标记。

http://en.wikipedia.org/wiki/Byte-order_mark

也许你想要将字符串写入 UTF-8。

回复收藏 0 原文

绮筵 2024-08-13 05:18:16

根据 MSDN：

public virtual void Write(string value);

将长度前缀字符串写入
[该]流

如果您想从该点读回字符串，您将需要该长度前缀。请参阅 BinaryReader.ReadString() 。

附加

因为看起来您实际上想要一个文件头检查器

这是一个问题吗？您读回长度前缀，以便对文件进行类型检查，它工作正常
您可以将字符串转换为 byte[] 数组，可能使用 Encoding.ASCII。但是，您必须使用固定（隐含）长度或...自己添加前缀。读取 byte[] 后，您可以再次将其转换为字符串。
如果您有大量文本要写入，您甚至可以将 TextWriter 附加到同一个流。但要小心，作家们想关闭他们的直播。一般来说我不会建议这样做，但了解一下还是有好处的。在这里，您也必须标记其他读者可以接管的点（固定标题可以正常工作）。

They are not byte-order marks but a length-prefix, according to MSDN:

public virtual void Write(string value);

Writes a length-prefixed string to
[the] stream

And you will need that length-prefix if you ever want to read the string back from that point. See BinaryReader.ReadString().

Additional

Since it seems you actually want a File-Header checker

Is it a problem? You read the length-prefix back so as a type-check on the File it works OK
You can convert the string to a byte[] array, probably using Encoding.ASCII. But hen you have to either use a fixed (implied) length or... prefix it yourself. After reading the byte[] you can convert it to a string again.
If you had a lot of text to write you could even attach a TextWriter to the same stream. But be careful, the Writers want to close their streams. I wouldn't advice this in general, but it is good to know. Here too you will have to mark a Point where the other reader can take over (fixed header works OK).

回复收藏 0 原文

久夏青 2024-08-13 05:18:16

这是因为 BinaryWriter 正在写入字符串的二进制表示形式，包括字符串的长度。如果您要写入直接数据（例如 byte[] 等），它将不包括该长度。

byte[] text = System.Text.Encoding.Unicode.GetBytes("test");
FileStream fs = new FileStream("C:\\test.txt", FileMode.Create);
BinaryWriter writer = new BinaryWriter(fs);
writer.Write(text);
writer.Close();

您会注意到它不包括长度。如果您要使用二进制写入器写入文本数据，则需要首先对其进行转换。

That's because a BinaryWriter is writing the binary representation of the string, including the length of the string. If you were to write straight data (e.g. byte[], etc.) it won't include that length.

byte[] text = System.Text.Encoding.Unicode.GetBytes("test");
FileStream fs = new FileStream("C:\\test.txt", FileMode.Create);
BinaryWriter writer = new BinaryWriter(fs);
writer.Write(text);
writer.Close();

You'll notice that it doesn't include the length. If you're going to be writing textual data using the binary writer, you'll need to convert it first.

回复收藏 0 原文

岁吢 2024-08-13 05:18:16

开头的字节是字符串的长度，它被写为可变长度整数。

如果字符串不超过 127 个字符，则长度将存储为 1 个字节。当字符串达到 128 个字符时，长度会写为 2，并且在某些长度下也会移动到 3 和 4。

这里的问题是您正在使用 BinaryWriter，它写出 BinaryReader 可以稍后读回的数据。如果您希望以自己的自定义格式编写，则必须放弃这样的字符串编写，或者完全放弃使用 BinaryWriter。

回复收藏 0 原文

说好的呢 2024-08-13 05:18:16

正如亨克在这个答案，这是字符串的长度（作为 32 位 int）。

如果您不希望这样，您可以通过将每个字母的 ASCII 字符写为字节来手动编写“TEST”，或者您可以使用：

System.Text.Encoding.UTF8.GetBytes("TEST")

并写入结果数组（不包含 length int）

As Henk pointed out in this answer, this is the length of the string (as a 32-bit int).

If you don't want this, you can either write "TEST" manually by writing the ASCII characters for each letter as bytes, or you could use:

System.Text.Encoding.UTF8.GetBytes("TEST")

And write the resulting array (which will NOT contain a length int)

回复收藏 0 原文

深巷少女 2024-08-13 05:18:16

你看到的实际上是一个7位编码的整数，这是一种整数压缩.
BinaryWriter 在文本前面添加此内容，以便读者（即 BinaryReader）知道写入的字符串有多长。

您可以在 < a href="http://dpatrickcaldwell.blogspot.se/2011/09/7-bit-encoding-with-binarywriter-in-net.html" rel="nofollow">http://dpatrickcaldwell.blogspot.se/ 2011/09/7-bit-encoding-with-binarywriter-in-net.html。

回复收藏 0 原文

清风无影 2024-08-13 05:18:16

您可以将其保存为 UTF8 编码的字节数组，如下所示：

...

BinaryWriter w = new BinaryWriter(fs);

w.Write(UTF8Encoding.Default.GetBytes("test"));

...

You can save it as a UTF8 encoded byte array like this:

...

BinaryWriter w = new BinaryWriter(fs);

w.Write(UTF8Encoding.Default.GetBytes("test"));

...

回复收藏 0 原文

迷雾森÷林ヴ 2024-08-13 05:18:16

这很可能是字节顺序标记。这是因为流的编码设置为 Unicode。

回复收藏 0 原文

凝望流年 2024-08-13 05:18:16

请记住，Java 字符串在内部以 UTF-16 编码。

因此，“测试”实际上是由字节 0xff、0xfe（一起字节顺序标记）、0x74、0x00、0x65、0x00、0x73、0x00、0x74、0x00 组成。

您可能想使用字节而不是字符流。

回复收藏 0 原文

~没有更多了~

关于作者

你的往事

暂无简介

0 文章

0 评论

23 人气

关注发私信

友情链接

文江博客

为什么 BinaryWriter 在流的开头添加乱码？你如何避免它？

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（9）

附加

Additional

关于作者

相关话题

热门标签

推荐作者

qq_E2Iff7

Archangel

freedog

Hunk

18819270189

wenkai

友情链接

为什么 BinaryWriter 在流的开头添加乱码？你如何避免它？

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（9）

附加

Additional

关于作者

相关话题

热门标签

推荐作者

qq_E2Iff7

Archangel

freedog

Hunk

18819270189

wenkai

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。