C++将 string（或 char）转换为 wstring（或 wchar_t）

发布于 2024-08-27 10:29:03 字数 141 浏览 15 评论 0原文

string s = "おはよう";
wstring ws = FUNCTION(s, ws);

我如何将 s 的内容分配给 ws ？

搜索谷歌并使用了一些技术，但他们无法分配确切的内容。内容被扭曲。

原文

string s = "おはよう";
wstring ws = FUNCTION(s, ws);

How would i assign the contents of s to ws?

Searched google and used some techniques but they can't assign the exact content. The content is distorted.

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

樱娆 2024-09-03 10:29:04

注意！请参阅底部的注意 (2023-10-05)！

假设示例中的输入字符串 (おはよう) 是 UTF-8 编码的（从外观来看，它不是，但为了解释起见，我们假设它是:-)）Unicode 字符串的表示形式如果您感兴趣，那么您的问题可以仅使用标准库（C++11 及更高版本）来完全解决。

TL;DR版本：

#include <locale>
#include <codecvt>
#include <string>

std::wstring_convert<std::codecvt_utf8_utf16<wchar_t>> converter;
std::string narrow = converter.to_bytes(wide_utf16_source_string);
std::wstring wide = converter.from_bytes(narrow_utf8_source_string);

更长的在线编译和运行示例：

（它们都显示相同的示例。只是有很多冗余......）

注意（旧）：

正如评论中指出的那样，并在 https://stackoverflow.com/ 中进行了解释a/17106065/6345 在某些情况下，使用标准库在 UTF-8 和 UTF-16 之间进行转换时，可能会在不同平台上产生意外的结果差异。为了获得更好的转换，请考虑使用 http 中所述的 std::codecvt_utf8 ://en.cppreference.com/w/cpp/locale/codecvt_utf8

注意（新）：

由于 codecvt 标头在 C++17 中已弃用，有人对此答案中提出的解决方案提出了一些担忧。然而，C++ 标准委员会在中添加了一条重要声明http://www.open-std.org/jtc1/sc22/wg21/docs/papers/2017/p0618r0.html说

该库组件应与附件 D 一起退役，直到标准化合适的替代品为止。

所以在可预见的未来，本答案中的codecvt解决方案是安全且可移植的。

注意 (2023-10-05)：

建议删除 C++26 中已弃用的 codecvt 和 wstring_convert：

NOTE! See Note (2023-10-05) at the bottom!

Assuming that the input string in your example (おはよう) is a UTF-8 encoded (which it isn't, by the looks of it, but let's assume it is for the sake of this explanation :-)) representation of a Unicode string of your interest, then your problem can be fully solved with the standard library (C++11 and newer) alone.

The TL;DR version:

#include <locale>
#include <codecvt>
#include <string>

std::wstring_convert<std::codecvt_utf8_utf16<wchar_t>> converter;
std::string narrow = converter.to_bytes(wide_utf16_source_string);
std::wstring wide = converter.from_bytes(narrow_utf8_source_string);

Longer online compilable and runnable example:

(They all show the same example. There are just many for redundancy...)

Note (old):

As pointed out in the comments and explained in https://stackoverflow.com/a/17106065/6345 there are cases when using the standard library to convert between UTF-8 and UTF-16 might give unexpected differences in the results on different platforms. For a better conversion, consider std::codecvt_utf8 as described on http://en.cppreference.com/w/cpp/locale/codecvt_utf8

Note (new):

Since the codecvt header is deprecated in C++17, some worry about the solution presented in this answer were raised. However, the C++ standards committee added an important statement in http://www.open-std.org/jtc1/sc22/wg21/docs/papers/2017/p0618r0.html saying

this library component should be retired to Annex D, along side , until a suitable replacement is standardized.

So in the foreseeable future, the codecvt solution in this answer is safe and portable.

Note (2023-10-05):

Proposal to remove the deprecated codecvt and wstring_convert in C++26:

C++将 string（或 char*）转换为 wstring（或 wchar_t*）

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（20）

utf-8 实现

utf-8 implementation

MSVC Deprecation Warning

Getting utf-8 on msvc

关于作者

相关话题

热门标签

推荐作者

尘曦

在梵高的星空下

善良天后

韬韬不绝

qq_CgiN62

不美如何

友情链接

C++将 string（或 char）转换为 wstring（或 wchar_t）

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。