UTF-8 和 ISO 8859-1 之间的转换：

发布于 2025-01-06 10:27:45 字数 667 浏览 2 评论 0原文

我在SO中找到了以下代码。这真的有效吗？

String xml = new String("áéíóúñ");
byte[] latin1 = xml.getBytes("UTF-8");
byte[] utf8 = new String(latin1, "ISO-8859-1").getBytes("UTF-8");

我的意思是，第二行中的 latin1 是 UTF-8 编码的，但第三行中是 ISO-8859-1 编码的？这能行得通吗？

并不是说我不想批评引用的代码，我只是感到困惑，因为我遇到了一些非常相似的遗留代码，它们似乎有效，但我无法解释原因。

编辑：我想在原来的帖子中，第 2 行中的“UTF-8”只是一个拼写错误。但我不确定...

编辑2：在我最初发布后，有人编辑了上面的代码并将第二行更改为 byte[] latin1 = xml.getBytes("ISO-8859-1");。我不知道是谁干的，也不知道他为什么这么做，但显然这件事搞砸了。向所有看到错误版本代码的人表示抱歉。我不知道是谁编辑的。上面的代码现在是正确的。

原文

I found the following code in SO. Does this really work?

String xml = new String("áéíóúñ");
byte[] latin1 = xml.getBytes("UTF-8");
byte[] utf8 = new String(latin1, "ISO-8859-1").getBytes("UTF-8");

I mean, latin1 is UTF-8-encoded in the second line, but read als ISO-8859-1-encoded in the third? Can this ever work?

Not that I did not want to criticize the cited code, I am just confused since I ran into some legacy code that is very similar, that seems to work, and I cannot explain why.

EDIT: I guess in the original post, "UTF-8" in line 2 was just a TYPO. But I am not sure ...

EDIT2: After my initial posting, someone edited the code above and changed the 2nd line to byte[] latin1 = xml.getBytes("ISO-8859-1");. I don't know who did that and why he did it, but clearly this messed up pretty much. Sorry to all who saw the wrong version of the code. I don't know who edited it. The code above is correct now.

分享到QQ

分享到微博