结构填充和包装

原来分手还会想你 2024-10-12 04:56:39

填充将结构成员与“自然”地址边界对齐 - 例如，< code>int 成员会有偏移量，在 32 位平台上为 mod(4) == 0。默认情况下，填充处于启用状态。它将以下“间隙”插入到您的第一个结构中：

struct mystruct_A {
    char a;
    char gap_0[3]; /* inserted by compiler: for alignment of b */
    int b;
    char c;
    char gap_1[3]; /* -"-: for alignment of the whole struct in an array */
} x;

打包，另一方面，会阻止编译器进行填充 - 这必须明确请求 - 在 GCC 下它是 __attribute__((__packed__))，因此以下内容：

struct __attribute__((__packed__)) mystruct_A {
    char a;
    int b;
    char c;
};

将在 32 位体系结构上生成大小为 6 的结构。

但需要注意的是，未对齐的内存访问在允许的架构（如 x86 和 amd64）上速度较慢，并且在 SPARC 等严格对齐架构上被明确禁止。

Padding aligns structure members to "natural" address boundaries - say, int members would have offsets, which are mod(4) == 0 on 32-bit platform. Padding is on by default. It inserts the following "gaps" into your first structure:

struct mystruct_A {
    char a;
    char gap_0[3]; /* inserted by compiler: for alignment of b */
    int b;
    char c;
    char gap_1[3]; /* -"-: for alignment of the whole struct in an array */
} x;

Packing, on the other hand prevents compiler from doing padding - this has to be explicitly requested - under GCC it's __attribute__((__packed__)), so the following:

struct __attribute__((__packed__)) mystruct_A {
    char a;
    int b;
    char c;
};

would produce structure of size 6 on a 32-bit architecture.

A note though - unaligned memory access is slower on architectures that allow it (like x86 and amd64), and is explicitly prohibited on strict alignment architectures like SPARC.

回复收藏 0 原文

沙沙粒小 2024-10-12 04:56:39

（上面的答案很清楚地解释了原因，但似乎对填充的大小并不完全清楚，所以，我将根据我从 结构打包的失落艺术，它已经发展到不仅限于C，而是也适用于Go、Rust。）

内存对齐（对于结构）

规则：

在每个单独的成员之前，都会有填充以便使其从可被其对齐要求整除的地址开始。
例如，在许多系统上，int 应该从可被 4 整除的地址开始，而 short 则应从可被 2 整除的地址开始。
char 和 char[ ] 很特殊，可以是任何内存地址，因此它们之前不需要填充。
对于struct，除了每个单独成员的对齐需要之外，整个结构本身的大小将通过在末尾进行填充来对齐到可被其任何成员的最严格对齐要求整除的大小。
例如，在许多系统上，如果结构的最大成员是int，则可以被4整除，如果short，则可以被2整除。

成员顺序：

成员的顺序可能会影响结构的实际大小，因此请记住这一点。
例如，下面示例中的 stu_c 和 stu_d 具有相同的成员，但顺序不同，并导致这两个结构的大小不同。

内存中的地址（对于结构）

空空间：

两个结构之间的空空间可以被适合的非结构变量使用。
例如，在下面的 test_struct_address() 中，变量 x 位于相邻的结构 g 和 h 之间。
无论x是否声明，h的地址都不会改变，x只是复用了g的空白空间代码>浪费了。
y 的情况类似。

示例

（对于 64 位系统）

memory_align.c：

/**
 * Memory align & padding - for struct.
 * compile: gcc memory_align.c
 * execute: ./a.out
 */ 
#include <stdio.h>

// size is 8, 4 + 1, then round to multiple of 4 (int's size),
struct stu_a {
    int i;
    char c;
};

// size is 16, 8 + 1, then round to multiple of 8 (long's size),
struct stu_b {
    long l;
    char c;
};

// size is 24, l need padding by 4 before it, then round to multiple of 8 (long's size),
struct stu_c {
    int i;
    long l;
    char c;
};

// size is 16, 8 + 4 + 1, then round to multiple of 8 (long's size),
struct stu_d {
    long l;
    int i;
    char c;
};

// size is 16, 8 + 4 + 1, then round to multiple of 8 (double's size),
struct stu_e {
    double d;
    int i;
    char c;
};

// size is 24, d need align to 8, then round to multiple of 8 (double's size),
struct stu_f {
    int i;
    double d;
    char c;
};

// size is 4,
struct stu_g {
    int i;
};

// size is 8,
struct stu_h {
    long l;
};

// test - padding within a single struct,
int test_struct_padding() {
    printf("%s: %ld\n", "stu_a", sizeof(struct stu_a));
    printf("%s: %ld\n", "stu_b", sizeof(struct stu_b));
    printf("%s: %ld\n", "stu_c", sizeof(struct stu_c));
    printf("%s: %ld\n", "stu_d", sizeof(struct stu_d));
    printf("%s: %ld\n", "stu_e", sizeof(struct stu_e));
    printf("%s: %ld\n", "stu_f", sizeof(struct stu_f));

    printf("%s: %ld\n", "stu_g", sizeof(struct stu_g));
    printf("%s: %ld\n", "stu_h", sizeof(struct stu_h));

    return 0;
}

// test - address of struct,
int test_struct_address() {
    printf("%s: %ld\n", "stu_g", sizeof(struct stu_g));
    printf("%s: %ld\n", "stu_h", sizeof(struct stu_h));
    printf("%s: %ld\n", "stu_f", sizeof(struct stu_f));

    struct stu_g g;
    struct stu_h h;
    struct stu_f f1;
    struct stu_f f2;
    int x = 1;
    long y = 1;

    printf("address of %s: %p\n", "g", &g);
    printf("address of %s: %p\n", "h", &h);
    printf("address of %s: %p\n", "f1", &f1);
    printf("address of %s: %p\n", "f2", &f2);
    printf("address of %s: %p\n", "x", &x);
    printf("address of %s: %p\n", "y", &y);

    // g is only 4 bytes itself, but distance to next struct is 16 bytes(on 64 bit system) or 8 bytes(on 32 bit system),
    printf("space between %s and %s: %ld\n", "g", "h", (long)(&h) - (long)(&g));

    // h is only 8 bytes itself, but distance to next struct is 16 bytes(on 64 bit system) or 8 bytes(on 32 bit system),
    printf("space between %s and %s: %ld\n", "h", "f1", (long)(&f1) - (long)(&h));

    // f1 is only 24 bytes itself, but distance to next struct is 32 bytes(on 64 bit system) or 24 bytes(on 32 bit system),
    printf("space between %s and %s: %ld\n", "f1", "f2", (long)(&f2) - (long)(&f1));

    // x is not a struct, and it reuse those empty space between struts, which exists due to padding, e.g between g & h,
    printf("space between %s and %s: %ld\n", "x", "f2", (long)(&x) - (long)(&f2));
    printf("space between %s and %s: %ld\n", "g", "x", (long)(&x) - (long)(&g));

    // y is not a struct, and it reuse those empty space between struts, which exists due to padding, e.g between h & f1,
    printf("space between %s and %s: %ld\n", "x", "y", (long)(&y) - (long)(&x));
    printf("space between %s and %s: %ld\n", "h", "y", (long)(&y) - (long)(&h));

    return 0;
}

int main(int argc, char * argv[]) {
    test_struct_padding();
    // test_struct_address();

    return 0;
}

执行结果 - test_struct_padding()：< /strong>

stu_a: 8
stu_b: 16
stu_c: 24
stu_d: 16
stu_e: 16
stu_f: 24
stu_g: 4
stu_h: 8

执行结果 - test_struct_address():

stu_g: 4
stu_h: 8
stu_f: 24
address of g: 0x7fffd63a95d0  // struct variable - address dividable by 16,
address of h: 0x7fffd63a95e0  // struct variable - address dividable by 16,
address of f1: 0x7fffd63a95f0 // struct variable - address dividable by 16,
address of f2: 0x7fffd63a9610 // struct variable - address dividable by 16,
address of x: 0x7fffd63a95dc  // non-struct variable - resides within the empty space between struct variable g & h.
address of y: 0x7fffd63a95e8  // non-struct variable - resides within the empty space between struct variable h & f1.
space between g and h: 16
space between h and f1: 16
space between f1 and f2: 32
space between x and f2: -52
space between g and x: 12
space between x and y: 12
space between h and y: 8

因此每个变量的起始地址为 g:d0 x:d h:e0 y:e8

(The above answers explained the reason quite clearly, but seems not totally clear about the size of padding, so, I will add an answer according to what I learned from The Lost Art of Structure Packing, it has evolved to not limit to C, but also applicable to Go, Rust.)

Memory align (for struct)

Rules:

Before each individual member, there will be padding so that to make it start at an address that is divisible by its alignment requirement.
E.g., on many systems, an int should start at an address divisible by 4 and a short by 2.
char and char[] are special, could be any memory address, so they don't need padding before them.
For struct, other than the alignment need for each individual member, the size of whole struct itself will be aligned to a size divisible by strictest alignment requirement of any of its members, by padding at end.
E.g., on many systems, if struct's largest member is int then by divisible by 4, if short then by 2.

Order of member:

The order of member might affect actual size of struct, so take that in mind.
E.g., the stu_c and stu_d from example below have the same members, but in different order, and result in different size for the 2 structs.

Address in memory (for struct)

Empty space:

Empty space between 2 structs could be used by non-struct variables that could fit in.
e.g in test_struct_address() below, the variable x resides between adjacent struct g and h.
No matter whether x is declared, h's address won't change, x just reused the empty space that g wasted.
Similar case for y.

Example

(for 64 bit system)

memory_align.c:

/**
 * Memory align & padding - for struct.
 * compile: gcc memory_align.c
 * execute: ./a.out
 */ 
#include <stdio.h>

// size is 8, 4 + 1, then round to multiple of 4 (int's size),
struct stu_a {
    int i;
    char c;
};

// size is 16, 8 + 1, then round to multiple of 8 (long's size),
struct stu_b {
    long l;
    char c;
};

// size is 24, l need padding by 4 before it, then round to multiple of 8 (long's size),
struct stu_c {
    int i;
    long l;
    char c;
};

// size is 16, 8 + 4 + 1, then round to multiple of 8 (long's size),
struct stu_d {
    long l;
    int i;
    char c;
};

// size is 16, 8 + 4 + 1, then round to multiple of 8 (double's size),
struct stu_e {
    double d;
    int i;
    char c;
};

// size is 24, d need align to 8, then round to multiple of 8 (double's size),
struct stu_f {
    int i;
    double d;
    char c;
};

// size is 4,
struct stu_g {
    int i;
};

// size is 8,
struct stu_h {
    long l;
};

// test - padding within a single struct,
int test_struct_padding() {
    printf("%s: %ld\n", "stu_a", sizeof(struct stu_a));
    printf("%s: %ld\n", "stu_b", sizeof(struct stu_b));
    printf("%s: %ld\n", "stu_c", sizeof(struct stu_c));
    printf("%s: %ld\n", "stu_d", sizeof(struct stu_d));
    printf("%s: %ld\n", "stu_e", sizeof(struct stu_e));
    printf("%s: %ld\n", "stu_f", sizeof(struct stu_f));

    printf("%s: %ld\n", "stu_g", sizeof(struct stu_g));
    printf("%s: %ld\n", "stu_h", sizeof(struct stu_h));

    return 0;
}

// test - address of struct,
int test_struct_address() {
    printf("%s: %ld\n", "stu_g", sizeof(struct stu_g));
    printf("%s: %ld\n", "stu_h", sizeof(struct stu_h));
    printf("%s: %ld\n", "stu_f", sizeof(struct stu_f));

    struct stu_g g;
    struct stu_h h;
    struct stu_f f1;
    struct stu_f f2;
    int x = 1;
    long y = 1;

    printf("address of %s: %p\n", "g", &g);
    printf("address of %s: %p\n", "h", &h);
    printf("address of %s: %p\n", "f1", &f1);
    printf("address of %s: %p\n", "f2", &f2);
    printf("address of %s: %p\n", "x", &x);
    printf("address of %s: %p\n", "y", &y);

    // g is only 4 bytes itself, but distance to next struct is 16 bytes(on 64 bit system) or 8 bytes(on 32 bit system),
    printf("space between %s and %s: %ld\n", "g", "h", (long)(&h) - (long)(&g));

    // h is only 8 bytes itself, but distance to next struct is 16 bytes(on 64 bit system) or 8 bytes(on 32 bit system),
    printf("space between %s and %s: %ld\n", "h", "f1", (long)(&f1) - (long)(&h));

    // f1 is only 24 bytes itself, but distance to next struct is 32 bytes(on 64 bit system) or 24 bytes(on 32 bit system),
    printf("space between %s and %s: %ld\n", "f1", "f2", (long)(&f2) - (long)(&f1));

    // x is not a struct, and it reuse those empty space between struts, which exists due to padding, e.g between g & h,
    printf("space between %s and %s: %ld\n", "x", "f2", (long)(&x) - (long)(&f2));
    printf("space between %s and %s: %ld\n", "g", "x", (long)(&x) - (long)(&g));

    // y is not a struct, and it reuse those empty space between struts, which exists due to padding, e.g between h & f1,
    printf("space between %s and %s: %ld\n", "x", "y", (long)(&y) - (long)(&x));
    printf("space between %s and %s: %ld\n", "h", "y", (long)(&y) - (long)(&h));

    return 0;
}

int main(int argc, char * argv[]) {
    test_struct_padding();
    // test_struct_address();

    return 0;
}

Execution result - test_struct_padding():

stu_a: 8
stu_b: 16
stu_c: 24
stu_d: 16
stu_e: 16
stu_f: 24
stu_g: 4
stu_h: 8

Execution result - test_struct_address():

stu_g: 4
stu_h: 8
stu_f: 24
address of g: 0x7fffd63a95d0  // struct variable - address dividable by 16,
address of h: 0x7fffd63a95e0  // struct variable - address dividable by 16,
address of f1: 0x7fffd63a95f0 // struct variable - address dividable by 16,
address of f2: 0x7fffd63a9610 // struct variable - address dividable by 16,
address of x: 0x7fffd63a95dc  // non-struct variable - resides within the empty space between struct variable g & h.
address of y: 0x7fffd63a95e8  // non-struct variable - resides within the empty space between struct variable h & f1.
space between g and h: 16
space between h and f1: 16
space between f1 and f2: 32
space between x and f2: -52
space between g and x: 12
space between x and y: 12
space between h and y: 8

Thus address start for each variable is g:d0 x:dc h:e0 y:e8

回复收藏 0 原文

飞烟轻若梦 2024-10-12 04:56:39

我知道这个问题很老了，这里的大多数答案都很好地解释了填充，但是在尝试自己理解它时，我认为对正在发生的事情有一个“视觉”图像会有所帮助。

处理器以一定大小（字）的“块”读取内存。假设处理器字长为 8 个字节。它将内存视为一大排 8 字节构建块。每当它需要从内存中获取一些信息时，它就会到达这些块之一并获取它。

Variables Alignment

如上图所示，Char（1 字节长）在哪里并不重要，因为它会位于这些块之一内，只需要 CPU 处理 1 个字。

当我们处理大于 1 字节的数据时，例如 4 字节 int 或 8 字节 double，它们在内存中的对齐方式会影响 CPU 必须处理的字数。如果 4 字节块以某种方式对齐，它们总是适合块内部（内存地址是 4 的倍数），则只需处理一个字。否则，一块 4 字节的块可能会在一个块上有一部分，在另一个块上有一部分，从而需要处理器处理 2 个字才能读取该数据。

这同样适用于 8 字节双精度数，只不过现在它必须位于 8 倍的内存地址中，以保证它始终位于块内。

这里考虑的是 8 字节字处理器，但该概念也适用于其他大小的字。

填充的工作原理是填充这些数据之间的间隙，以确保它们与这些块对齐，从而提高读取内存时的性能。

然而，正如其他答案所述，有时空间比性能本身更重要。也许您正在一台没有太多 RAM 的计算机上处理大量数据（可以使用交换空间，但速度要慢得多）。您可以在程序中排列变量，直到完成最少的填充（因为在其他一些答案中得到了很好的例证），但如果这还不够，您可以显式禁用填充，这就是打包。

回复收藏 0 原文

纸短情长 2024-10-12 04:56:39

结构填充抑制结构填充，当对齐最重要时使用填充，当空间最重要时使用填充。

一些编译器提供#pragma 来抑制填充或将其打包为 n 个字节。有些提供关键字来执行此操作。通常用于修改结构填充的编译指示将采用以下格式（取决于编译器）：

#pragma pack(n)

例如 ARM 提供 __packed 关键字来抑制结构填充。请阅读编译器手册以了解有关此内容的更多信息。

因此，压缩结构是一种没有填充的结构。

通常，压缩结构将用于

节省空间
格式化数据结构以通过网络传输，使用一些
协议（这当然不是一个好的做法，因为你需要
处理字节顺序）

Structure packing suppresses structure padding, padding used when alignment matters most, packing used when space matters most.

Some compilers provide #pragma to suppress padding or to make it packed to n number of bytes. Some provide keywords to do this. Generally pragma which is used for modifying structure padding will be in the below format (depends on compiler):

#pragma pack(n)

For example ARM provides the __packed keyword to suppress structure padding. Go through your compiler manual to learn more about this.

So a packed structure is a structure without padding.

Generally packed structures will be used

to save space
to format a data structure to transmit over network using some
protocol (this is not a good practice of course because you need to
deal with endianness)

回复收藏 0 原文

断爱 2024-10-12 04:56:39

填充和打包只是同一件事的两个方面：

打包或对齐是每个成员四舍五入的大小
填充是为匹配对齐而添加的额外空间

在mystruct_A中，假设默认对齐为4、每个成员按4字节的倍数对齐。由于 char 的大小为 1，因此 a 和 c 的填充为 4 - 1 = 3 个字节，而 不需要填充>int b 已经是 4 个字节了。 mystruct_B 的工作方式相同。

回复收藏 0 原文

人疚 2024-10-12 04:56:39

变量存储在可被其对齐方式（通常是其大小）整除的任何地址。因此，填充/打包不仅仅适用于结构。实际上，所有数据都有自己的对齐要求：

int main(void) {
    // We assume the `c` is stored as first byte of machine word
    // as a convenience! If the `c` was stored as a last byte of previous
    // word, there is no need to pad bytes before variable `i`
    // because `i` is automatically aligned in a new word.

    char      c;  // starts from any addresses divisible by 1(any addresses).
    char pad[3];  // not-used memory for `i` to start from its address.
    int32_t   i;  // starts from any addresses divisible by 4.

这与struct类似，但也有一些区别。首先，我们可以说有两种填充 - a) 为了正确地从每个成员的地址开始，在成员之间插入一些字节。 b) 为了正确地从其地址开始下一个结构实例，需要将一些字节附加到每个结构：

// Example for rule 1 below.
struct st {
    char      c;  // starts from any addresses divisible by 4, not 1.
    char pad[3];  // not-used memory for `i` to start from its address.
    int32_t   i;  // starts from any addresses divisible by 4.
};

// Example for rule 2 below.
struct st {
    int32_t   i;  // starts from any addresses divisible by 4.
    char      c;  // starts from any addresses.
    char pad[3];  // not-used memory for next `st`(or anything that has same
                  // alignment requirement) to start from its own address.
};

结构的第一个成员始终从可被结构自身的对齐要求整除的任何地址开始，该对齐要求由最大成员的对齐要求确定（此处 4< /code>，int32_t 的对齐方式）。这与普通变量不同。普通变量可以从任何可被其对齐整除的地址开始，但结构体的第一个成员的情况并非如此。如您所知，结构体的地址与其第一个成员的地址相同。
结构体内部可以有额外的填充尾随字节，使下一个结构体（或结构体数组中的下一个元素）从其自己的地址开始。想想 struct st arr[2]; 。为了使 arr[1]（arr[1] 的第一个成员）从可被 4 整除的地址开始，我们应该在每个结构体的末尾附加 3 个字节。

这是我从失落的结构包装艺术中学到的。

注意：您可以通过_Alignof运算符研究数据类型的对齐要求。另外，您可以通过 offsetof 宏获取结构体中成员的偏移量。

The variables are stored at any addresses divisible by its alignment(by its size generally). So, padding/packing is not just for struct only. Actually, all data has its own alignment requirement:

int main(void) {
    // We assume the `c` is stored as first byte of machine word
    // as a convenience! If the `c` was stored as a last byte of previous
    // word, there is no need to pad bytes before variable `i`
    // because `i` is automatically aligned in a new word.

    char      c;  // starts from any addresses divisible by 1(any addresses).
    char pad[3];  // not-used memory for `i` to start from its address.
    int32_t   i;  // starts from any addresses divisible by 4.

This is similar to struct, but there are some differences. First, we can say there are two kinds of padding— a) To start each member from its address properly, some bytes are inserted between members. b) To start next struct instance from its address properly, some bytes are appended to each struct:

// Example for rule 1 below.
struct st {
    char      c;  // starts from any addresses divisible by 4, not 1.
    char pad[3];  // not-used memory for `i` to start from its address.
    int32_t   i;  // starts from any addresses divisible by 4.
};

// Example for rule 2 below.
struct st {
    int32_t   i;  // starts from any addresses divisible by 4.
    char      c;  // starts from any addresses.
    char pad[3];  // not-used memory for next `st`(or anything that has same
                  // alignment requirement) to start from its own address.
};

The struct's first member always starts from any addresses divisible by struct's own alignment requirement which is determined by largest member's alignment requirement(here 4, alignment of int32_t). This is different with normal variables. The normal variables can start any addresses divisible by its alignment, but it is not the case for struct's first member. As you know, the address of a struct is the same as the address of its first member.
There can be additional padded trailing bytes inside a struct, making next struct(or next element in an array of structs) starting from its own address. Think of struct st arr[2];. To make arr[1](arr[1]'s first member) starting from an address divisible by 4, we should append 3 bytes at the end of each struct.

This is what i learned from The Lost Art of Structure Packing.

NOTE : You can investigate what the data type's alignment requirement is through _Alignof operator. Also, you can get member's offset inside a struct through offsetof macro.

回复收藏 0 原文

哭了丶谁疼 2024-10-12 04:56:39

填充规则：

结构体的每个成员都应位于可被其大小整除的地址处。
在元素之间或结构的末尾插入填充以确保满足此规则。这样做是为了让硬件更轻松、更高效地访问总线。
结构体末尾的填充是根据结构体最大成员的大小决定的。

为什么规则 2：
考虑以下结构，

如果我们要创建此结构的数组（包含 2 个结构），
末尾不需要填充：

因此，结构的大小 = 8 字节

假设我们要创建另一个结构，如下所示：

png" rel="nofollow noreferrer"> Struct 2

如果我们要创建此结构的数组，
末尾所需的填充字节数有两种可能性。

A. 如果我们在末尾添加 3 个字节并将其对齐为 int 而不是 Long：

B. 如果我们在末尾添加 7 个字节并将其对齐为 Long：

Struct2 数组与 Long 对齐 < /a>

第二个数组的起始地址是8的倍数（即24）。
结构的大小 = 24 字节

因此，通过将结构的下一个数组的起始地址对齐到最大成员的倍数（即，如果我们要创建此结构的数组，则第二个数组的首地址必须从结构体最大成员的倍数开始，我们可以计算出末尾所需的填充字节数。

回复收藏 0 原文

生来就爱笑 2024-10-12 04:56:39

这些结构是填充的还是包装的？

它们有衬垫。

最初浮现在脑海中的唯一可能的情况是，如果 char 和 int 的大小相同，那么最小的char/int/char 结构的大小不允许填充，int/char 结构也是如此。

但是，这需要 sizeof(int) 和 sizeof(char) 都为4（以获得十二个和八个大小）。整个理论分崩离析，因为标准保证 sizeof(char) 始终为 1。

如果 char 和 int 宽度相同，则大小将是一加一，而不是四加四。因此，为了获得 12 的大小，必须在最后一个字段之后进行填充。

什么时候进行填充或打包？

每当编译器实现需要它时。编译器可以自由地在字段之间以及最后一个字段之后插入填充（但不在第一个字段之前）。

这样做通常是为了性能，因为某些类型在特定边界上对齐时性能更好。甚至有一些架构在您尝试访问未对齐的数据时会拒绝运行（即崩溃）（是的，我正在看您， ARM）。

通常，您可以使用#pragma pack 等特定于实现的功能来控制打包/填充（这实际上是同一范围的两端）。即使您在特定实现中无法做到这一点，您也可以在编译时检查代码以确保其满足您的要求（使用标准 C 功能，而不是特定于实现的内容）。

例如：

// C11 or better ...
#include <assert.h>
struct strA { char a; int  b; char c; } x;
struct strB { int  b; char a;         } y;
static_assert(sizeof(struct strA) == sizeof(char)*2 + sizeof(int), "No padding allowed");
static_assert(sizeof(struct strB) == sizeof(char)   + sizeof(int), "No padding allowed");

如果这些结构中有任何填充，类似的东西将拒绝编译。

Are these structures padded or packed?

They're padded.

The only possibility that initially springs to mind, where they could be packed, is if char and int were the same size, so that the minimum size of the char/int/char structure would allow for no padding, ditto for the int/char structure.

However, that would require both sizeof(int) and sizeof(char) to be four (to get the twelve and eight sizes). The whole theory falls apart since it's guaranteed by the standard that sizeof(char) is always one.

Were char and int the same width, the sizes would be one and one, not four and four. So, in order to then get a size of twelve, there would have to be padding after the final field.

When does padding or packing take place?

Whenever the compiler implementation wants it to. Compilers are free to insert padding between fields, and following the final field (but not before the first field).

This is usually done for performance as some types perform better when they're aligned on specific boundaries. There are even some architectures that will refuse to function (i.e, crash) is you try to access unaligned data (yes, I'm looking at you, ARM).

You can generally control packing/padding (which is really opposite ends of the same spectrum) with implementation-specific features such as #pragma pack. Even if you cannot do that in your specific implementation, you can check your code at compile time to ensure it meets your requirement (using standard C features, not implementation-specific stuff).

For example:

// C11 or better ...
#include <assert.h>
struct strA { char a; int  b; char c; } x;
struct strB { int  b; char a;         } y;
static_assert(sizeof(struct strA) == sizeof(char)*2 + sizeof(int), "No padding allowed");
static_assert(sizeof(struct strB) == sizeof(char)   + sizeof(int), "No padding allowed");

Something like this will refuse to compile if there is any padding in those structures.

回复收藏 0 原文

小瓶盖 2024-10-12 04:56:39

仅当您明确告诉编译器打包结构时，才会完成结构打包。填充就是您所看到的。您的 32 位系统正在填充每个字段以进行字对齐。如果您告诉编译器打包结构，它们将分别为 6 和 5 字节。但不要这样做。它不可移植，并使编译器生成速度慢得多（有时甚至有错误）的代码。

回复收藏 0 原文

皓月长歌 2024-10-12 04:56:39

没有什么但是！想要掌握这个主题，必须做到以下几点，

仔细阅读 Eric S. Raymond 撰写的失落的结构打包艺术< /里>
浏览Eric 的代码示例
最后但并非最不重要的一点是，不要忘记以下关于填充的规则：结构与最大类型的对齐方式对齐
要求。

回复收藏 0 原文

欢烬 2024-10-12 04:56:39

数据结构对齐是数据在计算机内存中排列和访问的方式。它由两个独立但相关的问题组成：数据对齐和数据结构填充。当现代计算机读取或写入内存地址时，它将以字大小的块（例如，32 位系统上的 4 字节块）或更大的形式执行此操作。数据对齐意味着将数据放置在等于字大小的某个倍数的内存地址处，这会由于 CPU 处理内存的方式而提高系统的性能。为了对齐数据，可能需要在最后一个数据结构的末尾和下一个数据结构的开头之间插入一些无意义的字节，这就是数据结构填充。

为了对齐内存中的数据，在内存分配时，在为其他结构成员分配的内存地址之间插入（或留空）一个或多个空字节（地址）。这个概念称为结构填充。
计算机处理器的架构是这样的：一次可以从内存中读取 1 个字（32 位处理器中为 4 个字节）。
为了利用处理器的这一优势，数据总是按 4 字节包对齐，这会导致在其他成员地址之间插入空地址。
由于C中的这种结构填充概念，结构的大小总是与我们想象的不一样。

回复收藏 0 原文

结构填充和包装

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（11）

内存对齐（对于结构）

内存中的地址（对于结构）

示例

Memory align (for struct)

Address in memory (for struct)

Example

关于作者

相关话题

热门标签

推荐作者

佚名

今天

゛时过境迁

达拉崩吧

呆萌少年

孤者何惧

友情链接

结构填充和包装

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（11）

内存对齐（对于结构）

内存中的地址（对于结构）

示例

Memory align (for struct)

Address in memory (for struct)

Example

关于作者

相关话题

热门标签

推荐作者

佚名

今天

゛时过境迁

达拉崩吧

呆萌少年

孤者何惧

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。