在 C++ 中定义 UTF-16BE 字符串

发布于 2024-12-27 12:44:58 字数 222 浏览 4 评论 0原文

我需要定义如下所示的 unicode 字符串：

const char SOME_STRING[] = { 0, 5, 0, 'M', 0, 'y', 0, 'S', 0, 't', 0, 'r' };

这是 UTF-16BE 字符串，前面带有包含长度的大端短字节，它在 java 中使用，这就是我需要它的用途。有没有比单独输入每个字符更好/更干净的方法来声明它？

原文

I need to define unicode string that would look like so:

const char SOME_STRING[] = { 0, 5, 0, 'M', 0, 'y', 0, 'S', 0, 't', 0, 'r' };

This is UTF-16BE string prepended with big endian short containing length, it's used in java and that's what I need it for. Is there better/cleaner way to declare it than typing every character separately?

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

喜爱纠缠 2025-01-03 12:44:58

您可以使用 wchar_t 代替，按需转换为字节，例如：

const wchar_t some_string[] = L"\x05MyStr";

int _tmain(int argc, _TCHAR* argv[])
{
    for (int i = 0; i <= some_string[0]; i++)
        printf("%d %d ", some_string[i] >> 8, some_string[i] & 0xFF);

    return 0;
}

You could use wchar_t instead, converting to bytes on demand, for example:

const wchar_t some_string[] = L"\x05MyStr";

int _tmain(int argc, _TCHAR* argv[])
{
    for (int i = 0; i <= some_string[0]; i++)
        printf("%d %d ", some_string[i] >> 8, some_string[i] & 0xFF);

    return 0;
}

回复收藏 0 原文