可以匹配此转换的字符编码是什么:来自“§”到“”xC7;”?

发布于 2024-09-06 21:03:26 字数 312 浏览 4 评论 0原文

下面的行是我拥有的许多字符编码错误的文件之一的示例;

REAPRESENTA§AO VIA DTENTRY

正确的表述应该是这样的:

REAPRESENTAÇAO VIA DTENTRY

编码错误的字符较多。我该如何纠正这个问题?

替代文本 http://nerull.webs.com/screen.JPG

The line bellow is as an example of one of many files with wrong character encoding that I have;

REAPRESENTA§AO VIA DTENTRY

The correct presentation should be this:

REAPRESENTAÇAO VIA DTENTRY

There's more characters with wrong encoding. How do I correct this?

alt text http://nerull.webs.com/screen.JPG

如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。

扫码二维码加入Web技术交流群

发布评论

需要 登录 才能够评论, 你可以免费 注册 一个本站的账号。

评论(1

花之痕靓丽 2024-09-13 21:03:26

文件本身没有错误的编码,只是当您读取文件时使用了错误的编码来解码它们。

更正方法是使用与编码文件相同的编码来解码该文件。

如果您不知道那是什么编码,则应该在解码之前找出有问题的字符的字节代码,并查找字符集的编码,其中字符代码与您想要的字符相匹配。

例如,可以使用 IBM905 对文件进行编码,以便将字符“Ç”编码为字节码 74。如果您随后使用 IBM278 对文件进行解码,则字节码 74 将被解释为字符“§”。

以下是我在内置编码中找到的可能组合的列表:

from cp875 to IBM290
from cp875 to IBM420
from cp875 to x-EBCDIC-KoreanExtended
from cp875 to IBM-Thai
from cp875 to IBM880
from IBM290 to IBM290
from IBM290 to IBM420
from IBM290 to x-EBCDIC-KoreanExtended
from IBM290 to IBM-Thai
from IBM290 to IBM880
from IBM420 to IBM290
from IBM420 to IBM420
from IBM420 to x-EBCDIC-KoreanExtended
from IBM420 to IBM-Thai
from IBM420 to IBM880
from IBM424 to IBM290
from IBM424 to IBM420
from IBM424 to x-EBCDIC-KoreanExtended
from IBM424 to IBM-Thai
from IBM424 to IBM880
from x-EBCDIC-KoreanExtended to IBM290
from x-EBCDIC-KoreanExtended to IBM420
from x-EBCDIC-KoreanExtended to x-EBCDIC-KoreanExtended
from x-EBCDIC-KoreanExtended to IBM-Thai
from x-EBCDIC-KoreanExtended to IBM880
from IBM-Thai to IBM290
from IBM-Thai to IBM420
from IBM-Thai to x-EBCDIC-KoreanExtended
from IBM-Thai to IBM-Thai
from IBM-Thai to IBM880
from IBM880 to IBM290
from IBM880 to IBM420
from IBM880 to x-EBCDIC-KoreanExtended
from IBM880 to IBM-Thai
from IBM880 to IBM880
from cp1025 to IBM290
from cp1025 to IBM420
from cp1025 to x-EBCDIC-KoreanExtended
from cp1025 to IBM-Thai
from cp1025 to IBM880
from IBM1026 to IBM01143
from IBM1026 to IBM278
from IBM905 to IBM01143
from IBM905 to IBM278

The files themselves doesn't have the wrong encoding, it's when you read the file that you use the wrong encoding to decode them.

The correction is to use the same encoding to decode the file that was used to encode it.

If you don't know what encoding that is, you should find out the byte code for the problematic characters before they are decoded, and look for an encoding with a character set where the character code matches the character that you want.

For example, the file could be encoded using IBM905 so that the character "Ç" is encoded into the byte code 74. If you then decode the file using IBM278, the byte code 74 is interpreted as the character "§".

Here is a list of the possible combinations that I found in the built in encodings:

from cp875 to IBM290
from cp875 to IBM420
from cp875 to x-EBCDIC-KoreanExtended
from cp875 to IBM-Thai
from cp875 to IBM880
from IBM290 to IBM290
from IBM290 to IBM420
from IBM290 to x-EBCDIC-KoreanExtended
from IBM290 to IBM-Thai
from IBM290 to IBM880
from IBM420 to IBM290
from IBM420 to IBM420
from IBM420 to x-EBCDIC-KoreanExtended
from IBM420 to IBM-Thai
from IBM420 to IBM880
from IBM424 to IBM290
from IBM424 to IBM420
from IBM424 to x-EBCDIC-KoreanExtended
from IBM424 to IBM-Thai
from IBM424 to IBM880
from x-EBCDIC-KoreanExtended to IBM290
from x-EBCDIC-KoreanExtended to IBM420
from x-EBCDIC-KoreanExtended to x-EBCDIC-KoreanExtended
from x-EBCDIC-KoreanExtended to IBM-Thai
from x-EBCDIC-KoreanExtended to IBM880
from IBM-Thai to IBM290
from IBM-Thai to IBM420
from IBM-Thai to x-EBCDIC-KoreanExtended
from IBM-Thai to IBM-Thai
from IBM-Thai to IBM880
from IBM880 to IBM290
from IBM880 to IBM420
from IBM880 to x-EBCDIC-KoreanExtended
from IBM880 to IBM-Thai
from IBM880 to IBM880
from cp1025 to IBM290
from cp1025 to IBM420
from cp1025 to x-EBCDIC-KoreanExtended
from cp1025 to IBM-Thai
from cp1025 to IBM880
from IBM1026 to IBM01143
from IBM1026 to IBM278
from IBM905 to IBM01143
from IBM905 to IBM278
~没有更多了~
我们使用 Cookies 和其他技术来定制您的体验包括您的登录状态等。通过阅读我们的 隐私政策 了解更多相关信息。 单击 接受 或继续使用网站,即表示您同意使用 Cookies 和您的相关数据。
原文