当前位置：文江博客知识库 en-US 文档 Glossary Character_set

Character set - MDN Web Docs Glossary: Definitions of Web-related terms 编辑

A character set is an encoding system to let computers know how to recognize Character, including letters, numbers, punctuation marks, and whitespace.

In earlier times, countries developed their own character sets due to their different languages used, such as Kanji JIS codes (e.g. Shift-JIS, EUC-JP, etc.) for Japanese, Big5 for traditional Chinese, and KOI8-R for Russian. However, Unicode gradually became most acceptable character set for its universal language support.

If a character set is used incorrectly (For example, Unicode for an acticle encoded in Big5), you may see nothing but broken characters, which are called Mojibake.

分享到QQ

分享到微博