C 和 C++ 中联合的目的

发布于 2024-08-23 05:09:46 字数 1953 浏览 6 评论 0原文

我之前就很舒服地使用了 union；今天，当我阅读这篇文章并了解到这段代码时，我感到震惊

union ARGB
{
    uint32_t colour;

    struct componentsTag
    {
        uint8_t b;
        uint8_t g;
        uint8_t r;
        uint8_t a;
    } components;

} pixel;

pixel.colour = 0xff040201;  // ARGB::colour is the active member from now on

// somewhere down the line, without any edit to pixel

if(pixel.components.a)      // accessing the non-active member ARGB::components

实际上是未定义的行为，即从联盟成员中读取而不是最近写入的成员会导致未定义的行为。如果这不是 union 的预期用途，那什么才是？有人可以详细解释一下吗？

更新：

事后我想澄清一些事情。

对于 C 和 C++，这个问题的答案并不相同；我年轻时无知的自己将其标记为 C 和 C++。
在仔细研究了 C++11 的标准之后，我不能最终地说它要求访问/检查非活动联合成员是未定义/未指定/实现定义的。我能找到的只是§9.5/1： <块引用>
如果一个标准布局联合包含多个共享公共初始序列的标准布局结构，并且该标准布局联合类型的对象包含其中一个标准布局结构，则允许检查公共初始序列任何标准布局结构成员的序列。 §9.2/19：如果相应的成员具有布局兼容的类型，并且两个成员都不是位字段，或者对于一个或多个初始序列来说，两个成员都是具有相同宽度的位字段，则两个标准布局结构共享一个公共初始序列成员。
在 C 语言中，(C99 TC3 - DR 283 开始）这样做是合法的（感谢 Pascal Cuoq 提出这个问题）。但是，如果读取的值恰好对于所读取的类型无效（所谓的“陷阱表示”），则尝试执行此操作仍然可能导致未定义的行为。否则，读取的值是实现定义的。
C89/90 在未指定的行为（附件 J）下指出了这一点，K&R 的书说它是实现定义的。引自 K&R：
<块引用>
这就是联合的目的 - 一个可以合法保存多种类型中的任何一种的变量。 [...]只要用法一致：检索的类型必须是最近存储的类型。程序员有责任跟踪联合体中当前存储的类型；如果某些内容存储为一种类型并提取为另一种类型，则结果取决于实现。
摘自 Stroustrup 的 TC++PL（重点是我的）
<块引用>
联合的使用对于数据的兼容性至关重要[...]有时被误用于“类型转换”。

最重要的是，提出这个问题（自我提出问题以来其标题保持不变）的目的是为了理解联合的目的，而不是标准允许的内容例如，当然，使用继承进行代码重用是， C++ 标准允许，但这不是目的或目的将继承作为 C++ 语言特性引入的初衷。这就是安德烈的答案继续被接受的原因。

原文

I have used unions earlier comfortably; today I was alarmed when I read this post and came to know that this code

union ARGB
{
    uint32_t colour;

    struct componentsTag
    {
        uint8_t b;
        uint8_t g;
        uint8_t r;
        uint8_t a;
    } components;

} pixel;

pixel.colour = 0xff040201;  // ARGB::colour is the active member from now on

// somewhere down the line, without any edit to pixel

if(pixel.components.a)      // accessing the non-active member ARGB::components

is actually undefined behaviour I.e. reading from a member of the union other than the one recently written to leads to undefined behaviour. If this isn't the intended usage of unions, what is? Can some one please explain it elaborately?

Update:

I wanted to clarify a few things in hindsight.

The answer to the question isn't the same for C and C++; my ignorant younger self tagged it as both C and C++.
After scouring through C++11's standard I couldn't conclusively say that it calls out accessing/inspecting a non-active union member is undefined/unspecified/implementation-defined. All I could find was §9.5/1:
If a standard-layout union contains several standard-layout structs that share a common initial sequence, and if an object of this standard-layout union type contains one of the standard-layout structs, it is permitted to inspect the common initial sequence of any of standard-layout struct members. §9.2/19: Two standard-layout structs share a common initial sequence if corresponding members have layout-compatible types and either neither member is a bit-field or both are bit-fields with the same width for a sequence of one or more initial members.
While in C, (C99 TC3 - DR 283 onwards) it's legal to do so (thanks to Pascal Cuoq for bringing this up). However, attempting to do it can still lead to undefined behavior, if the value read happens to be invalid (so called "trap representation") for the type it is read through. Otherwise, the value read is implementation defined.
C89/90 called this out under unspecified behavior (Annex J) and K&R's book says it's implementation defined. Quote from K&R:
This is the purpose of a union - a single variable that can legitimately hold any of one of several types. [...] so long as the usage is consistent: the type retrieved must be the type most recently stored. It is the programmer's responsibility to keep track of which type is currently stored in a union; the results are implementation-dependent if something is stored as one type and extracted as another.
Extract from Stroustrup's TC++PL (emphasis mine)
Use of unions can be essential for compatness of data [...] sometimes misused for "type conversion".

Above all, this question (whose title remains unchanged since my ask) was posed with an intention of understanding the purpose of unions AND not on what the standard allows E.g. Using inheritance for code reuse is, of course, allowed by the C++ standard, but it wasn't the purpose or the original intention of introducing inheritance as a C++ language feature. This is the reason Andrey's answer continues to remain as the accepted one.

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

燕归巢 2024-08-30 05:09:46

工会的目的相当明显，但由于某种原因，人们经常忽视它。

联合的目的是通过使用相同的内存区域在不同时间存储不同的对象来节省内存。就是这样。

这就像酒店的房间。不同的人在其中居住的时间不重叠。这些人从来没有见过面，通常彼此也不了解。通过妥善管理房间的分时（即确保不同的人不会同时分配到一个房间），相对较小的酒店可以为相对较多的人提供住宿，这就是酒店的作用是为了.

这正是工会所做的。如果您知道程序中的多个对象保存的值具有不重叠的值生命周期，那么您可以将这些对象“合并”到一个联合中，从而节省内存。就像酒店房间在每个时刻最多有一个“活跃”租户一样，工会在计划时间的每个时刻最多有一个“活跃”成员。只能读取“活动”成员。通过写入其他成员，您可以将“活动”状态切换到该其他成员。

由于某种原因，工会的最初目的被完全不同的东西“覆盖”：编写工会的一名成员，然后通过另一名成员检查它。这种内存重新解释（又名“类型双关”）~~不是联合的有效使用。它通常会导致未定义的行为~~在 C89/90 中被描述为产生实现定义的行为。

编辑： 使用联合来实现类型双关（即编写一个成员然后读取另一个成员）在 C99 标准的技术勘误表之一中给出了更详细的定义（请参阅 DR#257 和 DR#283）。但是，请记住，从形式上来说，这并不能防止您尝试读取陷阱表示而遇到未定义的行为。

C 和 C++ 中联合的目的

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（18）

关于作者

相关话题

热门标签

推荐作者

1CH1MKgiKxn9p

ゞ记忆︶ㄣ

JackDx

信远

yaoduoduo1995

霞映澄塘

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。