在Qt中，QTextCodec::codecForName(“UTF-16”)和codecForName(“UTF-32”)如何决定使用的字节序？

发布于 2024-12-05 06:22:13 字数 273 浏览 8 评论 0原文

在 Qt 文档中，它指出（除其他外）支持以下 Unicode 字符串编码：

UTF-8
UTF-16
UTF-16BE
UTF-16LE
UTF-32
UTF-32BE
UTF-32LE

由于为 2 和列出了三种不同的编解码器4个八位字节编码的Unicode，我想知道：两个非字节编解码器（“UTF-16”和“UTF-32”）如何决定哪个使用字节序？

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

强者自强 2024-12-12 06:22:13

根据 src/corelibs/codecs/ 中的源代码，Qt 似乎使用主机的 UTF-16 和 UTF-32 字节顺序。

如果您使用 QTextCodec 读取具有 BOM 的现有 Unicode 字符串，并且您没有明确要求忽略标头，则将使用在字符串中检测到的字节顺序。

在 *qutfcodec_p.h* 中，QUtf16Codec::e 和 QUtf32Codec::e 均使用值 DetectEndianness（枚举）进行初始化.
在 qutfcodec.cpp 中，类 QUtf16 中的函数 convertFromUnicode 和 convertToUnicode 的开头附近QUtf32（由QUtf16Codec和QUtf32Codec使用），您可以找到以下行：
```
endian = (QSysInfo::ByteOrder == QSysInfo::BigEndian) 
    ？ BigEndianness : LittleEndianness;
```

Based on the source code in src/corelibs/codecs/, it seems Qt uses the byte ordering of the host for UTF-16 and UTF-32.

If you use QTextCodec to read an existing Unicode string that has a BOM, and you didn't explicitly ask to ignore the header, the byte ordering detected in the string is used.

In *qutfcodec_p.h* both QUtf16Codec::e and QUtf32Codec::e are initialized with the value DetectEndianness (an enum).
In qutfcodec.cpp, near the beginning of the functions convertFromUnicode and convertToUnicode from the classes QUtf16 and QUtf32 (used by QUtf16Codec and QUtf32Codec), you can find the line:
```
endian = (QSysInfo::ByteOrder == QSysInfo::BigEndian) 
    ? BigEndianness : LittleEndianness;
```