当前位置：文江博客话题详情

使用可变长度数组有任何开销吗？

发布于 2024-08-17 11:39:45 字数 65 浏览 11 评论 0原文

使用可变长度数组有一些开销吗？数组的大小可以在运行时通过命令行参数传递吗？与自动和动态分配数组相比，为什么要引入它？

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

败给现实 2024-08-24 11:39:45

VLA 确实有一些开销（与“普通”命名的编译时大小的数组相比）。

首先，它具有运行时长度，而且该语言为您提供了在运行时获取数组实际大小的方法（使用 sizeof）。这立即意味着数组的实际大小必须存储在某个地方。这会导致每个阵列的内存开销微不足道。然而，由于 VLA 只能声明为自动对象，因此任何人都不会注意到这种内存开销。这就像声明一个额外的整型局部变量一样。

其次，VLA通常分配在堆栈上，但由于其大小可变，一般情况下在编译时无法知道其在内存中的确切位置。因此，底层实现通常必须将其实现为指向内存块的指针。这引入了一些额外的内存开销（对于指针），由于上述原因，这也是完全微不足道的。这也带来了轻微的性能开销，因为我们必须读取指针值才能找到实际的数组。这与访问 malloc 编辑的数组时获得的开销相同（并且不会使用命名的编译时大小的数组）。

由于 VLA 的大小是运行时整数值，因此它当然可以作为命令行参数传递。 VLA 并不关心它的大小从何而来。

VLA 是作为运行时大小的数组引入的，具有较低的分配/释放成本。它们介于“普通”命名的编译时大小的数组（分配-释放成本几乎为零，但大小固定）和 malloc 数组（具有运行时大小，但相对较高）之间。分配-解除分配成本）。

VLA [几乎]遵循与自动（即本地）对象相同的范围相关生命周期规则，这意味着在一般情况下它们不能替换 malloc-ed 数组。它们的适用性仅限于需要具有典型自动生命周期的快速运行时大小的数组的情况。

回复收藏 0 原文

可爱咩 2024-08-24 11:39:45

可变长度数组会产生一些运行时开销，但您必须非常努力地测量它。请注意，如果 vla 是可变长度数组，则 sizeof(vla) 不是编译时常量。

数组的大小可以在运行时传递给函数。如果您选择从命令行参数获取大小并将其转换为整数并在运行时将其传递给函数，那就这样吧——它会起作用。

使用可变长度数组是因为变量会自动分配到正确的大小，并在函数退出时自动释放。这可以避免过度分配空间（当您主要使用最小大小时，为最大可能大小分配足够的空间），并避免内存清理问题。

此外，对于多维数组，AFAIK 它的行为更像 Fortran - 您可以动态配置所有维度，而不是为除数组的前导维度之外的所有维度坚持固定大小。

VLA 的一些运行时开销的具体证据 - 至少对于 SPARC (Solaris 10) 上的 GCC 4.4.2 而言。

考虑下面的两个文件：

vla.c - 使用可变长度数组

#include <assert.h>
#include <stddef.h>
extern size_t identity_matrix(int n, int m);

size_t identity_matrix(int n, int m)
{
    int vla[n][m];
    int i, j;
    assert(n > 0 && n <= 32);
    assert(m > 0 && m <= 32);
    for (i = 0; i < n; i++)
    {
        for (j = 0; j < m; j++)
        {
            vla[i][j] = 0;
        }
        vla[i][i] = 1;
    }
    return(sizeof(vla));
}

fla.c - 使用固定长度数组

#include <assert.h>
#include <stddef.h>
extern size_t identity_matrix(int n, int m);

size_t identity_matrix(int n, int m)
{
    int fla[32][32];
    int i, j;
    assert(n > 0 && n <= 32);
    assert(m > 0 && m <= 32);
    for (i = 0; i < n; i++)
    {
        for (j = 0; j < m; j++)
        {
            fla[i][j] = 0;
        }
        fla[i][i] = 1;
    }
    return(sizeof(fla));
}

编译和目标文件大小

出于比较目的，本地数组的名称不同（vla 与 fla），并且数组的尺寸在声明时不同 - 否则，文件是相同的。

我编译使用：

$ gcc -O2 -c -std=c99 fla.c vla.c

目标文件大小有些不同 - 通过“ls”和“size”来测量：

$ ls -l fla.o vla.o
-rw-r--r--   1 jleffler rd          1036 Jan  9 12:13 fla.o
-rw-r--r--   1 jleffler rd          1176 Jan  9 12:13 vla.o
$ size fla.o vla.o
fla.o: 530 + 0 + 0 = 530
vla.o: 670 + 0 + 0 = 670

我没有进行广泛的测试来查看有多少开销是固定的，有多少是可变的，但是有使用 VLA 的开销。

There is some run-time overhead with variable-length arrays, but you would have to be working fairly hard to measure it. Note that sizeof(vla) is not a compile-time constant if vla is a variable-length array.

The size of the array can be passed to a function at run-time. If you choose to take the size from a command line argument and convert that into an integer and pass that to the function at run-time, so be it -- it will work.

Variable-length arrays are used because the variables are automatically allocated to the correct size and automatically freed on exit from the function. This avoids over-allocating space (allocating enough space for the maximum possible size when you mostly work with minimal sizes), and avoids problems with memory clean up.

Additionally, with multi-dimensional arrays, AFAIK it behaves more like Fortran - you can dynamically configure all the dimensions, rather than being stuck with fixed sizes for all but the leading dimension of the array.

Concrete evidence of some run-time overhead for VLA - at least with GCC 4.4.2 on SPARC (Solaris 10).

Consider the two files below:

vla.c - using a variable-length array

#include <assert.h>
#include <stddef.h>
extern size_t identity_matrix(int n, int m);

size_t identity_matrix(int n, int m)
{
    int vla[n][m];
    int i, j;
    assert(n > 0 && n <= 32);
    assert(m > 0 && m <= 32);
    for (i = 0; i < n; i++)
    {
        for (j = 0; j < m; j++)
        {
            vla[i][j] = 0;
        }
        vla[i][i] = 1;
    }
    return(sizeof(vla));
}

fla.c - using a fixed-length array

#include <assert.h>
#include <stddef.h>
extern size_t identity_matrix(int n, int m);

size_t identity_matrix(int n, int m)
{
    int fla[32][32];
    int i, j;
    assert(n > 0 && n <= 32);
    assert(m > 0 && m <= 32);
    for (i = 0; i < n; i++)
    {
        for (j = 0; j < m; j++)
        {
            fla[i][j] = 0;
        }
        fla[i][i] = 1;
    }
    return(sizeof(fla));
}

Compilation and object file sizes

For comparison purposes, the names of the local array are different (vla vs fla), and the dimensions on the array are different when it is declared - otherwise, the files are the same.

I compiled using:

$ gcc -O2 -c -std=c99 fla.c vla.c

The object file sizes are somewhat different - as measured both by 'ls' and by 'size':

$ ls -l fla.o vla.o
-rw-r--r--   1 jleffler rd          1036 Jan  9 12:13 fla.o
-rw-r--r--   1 jleffler rd          1176 Jan  9 12:13 vla.o
$ size fla.o vla.o
fla.o: 530 + 0 + 0 = 530
vla.o: 670 + 0 + 0 = 670

I've not done extensive testing to see how much of the overhead is fixed and how much is variable, but there is overhead in using a VLA.

回复收藏 0 原文