在循环中声明变量是否有任何开销？ (C++)

发布于 2024-07-23 07:01:12 字数 307 浏览 15 评论 0原文

我只是想知道如果您执行以下操作是否会造成速度或效率损失：

int i = 0;
while(i < 100)
{
    int var = 4;
    i++;
}

声明 int var 一百次。在我看来好像会有，但我不确定。这样做是否会更实用/更快：

int i = 0;
int var;
while(i < 100)
{
    var = 4;
    i++;
}

或者它们在速度和效率方面是否相同？

原文

I am just wondering if there would be any loss of speed or efficiency if you did something like this:

int i = 0;
while(i < 100)
{
    int var = 4;
    i++;
}

which declares int var one hundred times. It seems to me like there would be, but I'm not sure. would it be more practical/faster to do this instead:

int i = 0;
int var;
while(i < 100)
{
    var = 4;
    i++;
}

or are they the same, speedwise and efficiency-wise?

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

泅人 2024-07-30 07:01:12

局部变量的堆栈空间通常在函数作用域中分配。因此循环内部不会发生堆栈指针调整，只是将 4 分配给 var。因此，这两个片段具有相同的开销。

回复收藏 0 原文

夜灵血窟げ 2024-07-30 07:01:12

对于原始类型和 POD 类型来说，没有区别。在这两种情况下，编译器都会在函数开头为变量分配堆栈空间，并在函数返回时释放它。

对于具有重要构造函数的非 POD 类类型，它会产生影响 - 在这种情况下，将变量放在循环之外只会调用一次构造函数和析构函数，并且每次迭代都会调用赋值运算符，而将其放在循环内部循环将为循环的每次迭代调用构造函数和析构函数。根据类的构造函数、析构函数和赋值运算符的作用，这可能是理想的，也可能不是理想的。

回复收藏 0 原文

隔纱相望 2024-07-30 07:01:12

它们都是相同的，通过查看编译器的作用（即使没有将优化设置为高），您可以通过以下方式找到答案：

看看编译器（gcc 4.0）对您的简单示例做了什么：

1.c：

main(){ int var; while(int i < 100) { var = 4; } }

gcc -S 1.c

1.s:

_main:
    pushl   %ebp
    movl    %esp, %ebp
    subl    $24, %esp
    movl    $0, -16(%ebp)
    jmp L2
L3:
    movl    $4, -12(%ebp)
L2:
    cmpl    $99, -16(%ebp)
    jle L3
    leave
    ret

2.c

main() { while(int i < 100) { int var = 4; } }

gcc -S 2.c

2.s:

_main:
        pushl   %ebp
        movl    %esp, %ebp
        subl    $24, %esp
        movl    $0, -16(%ebp)
        jmp     L2
L3:
        movl    $4, -12(%ebp)
L2:
        cmpl    $99, -16(%ebp)
        jle     L3
        leave
        ret

从这些中，您可以看到两件事：首先，两者的代码是相同的。

其次， var 的存储是在循环外部分配的：

         subl    $24, %esp

最后，循环中唯一的事情是赋值和条件检查：

L3:
        movl    $4, -12(%ebp)
L2:
        cmpl    $99, -16(%ebp)
        jle     L3

这在不完全删除循环的情况下尽可能高效。

They are both the same, and here's how you can find out, by looking at what the compiler does (even without optimisation set to high):

Look at what the compiler (gcc 4.0) does to your simple examples:

1.c:

main(){ int var; while(int i < 100) { var = 4; } }

gcc -S 1.c

1.s:

_main:
    pushl   %ebp
    movl    %esp, %ebp
    subl    $24, %esp
    movl    $0, -16(%ebp)
    jmp L2
L3:
    movl    $4, -12(%ebp)
L2:
    cmpl    $99, -16(%ebp)
    jle L3
    leave
    ret

2.c

main() { while(int i < 100) { int var = 4; } }

gcc -S 2.c

2.s:

_main:
        pushl   %ebp
        movl    %esp, %ebp
        subl    $24, %esp
        movl    $0, -16(%ebp)
        jmp     L2
L3:
        movl    $4, -12(%ebp)
L2:
        cmpl    $99, -16(%ebp)
        jle     L3
        leave
        ret

From these, you can see two things: firstly, the code is the same in both.

Secondly, the storage for var is allocated outside the loop:

         subl    $24, %esp

And finally the only thing in the loop is the assignment and condition check:

L3:
        movl    $4, -12(%ebp)
L2:
        cmpl    $99, -16(%ebp)
        jle     L3

Which is about as efficient as you can be without removing the loop entirely.

回复收藏 0 原文

天涯沦落人 2024-07-30 07:01:12

如今，最好在循环内声明它，除非它是常量，因为编译器将能够更好地优化代码（减少变量范围）。

编辑：这个答案现在基本上已经过时了。随着后经典编译器的兴起，编译器无法弄清楚的情况越来越少。我仍然可以构建它们，但大多数人会将这种构建归类为糟糕的代码。

回复收藏 0 原文

手心的海 2024-07-30 07:01:12

大多数现代编译器都会为您优化这一点。话虽这么说，我会使用你的第一个例子，因为我发现它更具可读性。

回复收藏 0 原文

笑，眼淚并存 2024-07-30 07:01:12

对于内置类型，两种样式之间可能没有区别（可能就生成的代码而言）。

但是，如果变量是具有重要构造函数/析构函数的类，则运行时成本很可能存在重大差异。我通常会将变量的范围限制在循环内部（以保持范围尽可能小），但如果这对性能产生影响，我会考虑将类变量移到循环范围之外。然而，这样做需要一些额外的分析，因为颂歌路径的语义可能会改变，所以只有在语义允许的情况下才能这样做。

RAII 类可能需要这种行为。例如，管理文件访问生存期的类可能需要在每次循环迭代时创建和销毁，以正确管理文件访问。

假设您有一个 LockMgr 类，该类在构造时获取临界区并在销毁时释放它：

while (i< 100) {
    LockMgr lock( myCriticalSection); // acquires a critical section at start of
                                      //    each loop iteration

    // do stuff...

}   // critical section is released at end of each loop iteration

与以下情况有很大不同：

LockMgr lock( myCriticalSection);
while (i< 100) {

    // do stuff...

}

For a built-in type there will likely be no difference between the 2 styles (probably right down to the generated code).

However, if the variable is a class with a non-trivial constructor/destructor there could well be a major difference in runtime cost. I'd generally scope the variable to inside the loop (to keep the scope as small as possible), but if that turns out to have a perf impact I'd look to moving the class variable outside the loop's scope. However, doing that needs some additional analysis as the semantics of the ode path may change, so this can only be done if the sematics permit it.

An RAII class might need this behavior. For example, a class that manages file access lifetime might need to be created and destroyed on each loop iteration to manage the file access properly.

Suppose you have a LockMgr class that acquires a critical section when it's constructed and releases it when destroyed:

while (i< 100) {
    LockMgr lock( myCriticalSection); // acquires a critical section at start of
                                      //    each loop iteration

    // do stuff...

}   // critical section is released at end of each loop iteration

is quite different from:

LockMgr lock( myCriticalSection);
while (i< 100) {

    // do stuff...

}

回复收藏 0 原文

oО清风挽发oО 2024-07-30 07:01:12

两个循环具有相同的效率。它们都将花费无限长的时间:) 在循环内增加 i 可能是一个好主意。

回复收藏 0 原文

柠檬 2024-07-30 07:01:12

我曾经进行过一些性能测试，令我惊讶的是，情况1实际上更快！我想这可能是因为在循环内声明变量会减少其范围，因此它会更早被释放。然而，那是很久以前的事了，在一个非常古老的编译器上。我确信现代编译器在优化差异方面做得更好，但保持变量范围尽可能短仍然没有坏处。

回复收藏 0 原文

原来是傀儡 2024-07-30 07:01:12

#include <stdio.h>
int main()
{
    for(int i = 0; i < 10; i++)
    {
        int test;
        if(i == 0)
            test = 100;
        printf("%d\n", test);
    }
}

上面的代码总是打印 100 10 次，这意味着每次函数调用循环内的局部变量只分配一次。

#include <stdio.h>
int main()
{
    for(int i = 0; i < 10; i++)
    {
        int test;
        if(i == 0)
            test = 100;
        printf("%d\n", test);
    }
}

Code above always prints 100 10 times which means local variable inside loop is only allocated once per each function call.

回复收藏 0 原文

坚持沉默 2024-07-30 07:01:12

唯一确定的方法就是给它们计时。但是，即使存在差异，差异也将是微小的，因此您将需要一个强大的定时循环。

更重要的是，第一个是更好的风格，因为它初始化变量 var，而另一个则使其未初始化。这以及定义变量应尽可能接近其使用点的准则意味着通常应首选第一种形式。

回复收藏 0 原文

高跟鞋的旋律 2024-07-30 07:01:12

如果只有两个变量，编译器可能会为这两个变量分配一个寄存器。无论如何，这些寄存器都在那里，所以这并不需要时间。无论哪种情况，都有 2 个寄存器写入指令和 1 个寄存器读取指令。

回复收藏 0 原文

热风软妹 2024-07-30 07:01:12

我认为大多数答案都忽略了一个需要考虑的要点：“是否清楚”，显然，通过所有讨论，事实是；不它不是。
我建议在大多数循环代码中，效率几乎不是问题（除非您计算火星着陆器），所以实际上唯一的问题是什么看起来更明智和可读& 可维护 - 在这种情况下，我建议预先声明变量& 在循环之外——这只是让它变得更清晰。然后像你这样的人& 我什至懒得浪费时间在网上检查它是否有效。

回复收藏 0 原文