当前位置：文江博客话题详情

调用 std::vector::size() 与读取变量一样快吗？

发布于 2024-08-31 04:56:24 字数 228 浏览 12 评论 0原文

我对一个大整数向量进行了广泛的计算。计算过程中向量大小不会改变。代码经常访问向量的大小。一般来说，

使用 vector::size() 函数或
使用辅助常量 vectorSize 来存储向量的大小哪个更快？

我知道编译器通常在设置正确的编译器标志时内联 size() 函数，但是，这不能保证。

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

盗心人 2024-09-07 04:56:24

有趣的问题。

那么，会发生什么呢？如果你使用 gdb 进行调试，你会看到类似 3 个成员变量的内容（名称不准确）：

_M_begin：指向动态数组第一个元素的指针
_M_end：指针动态数组的最后一个元素过去了一个
_M_capacity：指针过去了可以存储在动态数组中的最后一个元素

vector::size()< 的实现因此， /code> 通常简化为：

return _M_end - _M_begin;  // Note: _Mylast - _Myfirst in VC 2008

现在，在考虑实际可能的优化时需要考虑两件事：

该函数是否会内联？可能：我不是编译器编写者，但这是一个不错的选择，因为函数调用的开销会使这里的实际时间相形见绌，而且由于它是模板化的，我们在翻译单元中拥有所有可用的代码，
结果将被缓存（即有点一个未命名的局部变量）：很可能是这样，但除非你反汇编生成的代码，否则你不会知道

换句话说：

如果你自己存储 size ，那么它很可能会像编译器能够得到的速度尽可能快。
如果不这样做，则取决于编译器是否可以确定没有其他东西正在修改向量；如果不是，则无法缓存该变量，并且每次都需要执行内存读取（L1）。

这是一个微观优化。一般来说，它是不明显的，要么因为性能无关紧要，要么因为编译器无论如何都会执行它。在编译器不应用优化的关键循环中，这可能是一个重大改进。

Interesting question.

So, what's going to happened ? Well if you debug with gdb you'll see something like 3 member variables (names are not accurate):

_M_begin: pointer to the first element of the dynamic array
_M_end: pointer one past the last element of the dynamic array
_M_capacity: pointer one past the last element that could be stored in the dynamic array

The implementation of vector<T,Alloc>::size() is thus usually reduced to:

return _M_end - _M_begin;  // Note: _Mylast - _Myfirst in VC 2008

Now, there are 2 things to consider when regarding the actual optimizations possible:

will this function be inlined ? Probably: I am no compiler writer, but it's a good bet since the overhead of a function call would dwarf the actual time here and since it's templated we have all the code available in the translation unit
will the result be cached (ie sort of having an unnamed local variable): it could well be, but you won't know unless you disassemble the generated code

In other words:

If you store the size yourself, there is a good chance it will be as fast as the compiler could get it.
If you do not, it will depend on whether the compiler can establish that nothing else is modifying the vector; if not, it cannot cache the variable, and will need to perform memory reads (L1) every time.

It's a micro-optimization. In general, it will be unnoticeable, either because the performance does not matter or because the compiler will perform it regardless. In a critical loop where the compiler does not apply the optimization, it can be a significant improvement.

回复收藏 0 原文

剑心龙吟 2024-09-07 04:56:24

据我了解 1998 C++ 规范，vector::size() 需要恒定时间，而不是线性时间。因此，这个问题可能归结为读取局部变量是否比调用执行很少工作的函数更快。

因此，我认为将向量的 size() 存储在局部变量中会稍微加快程序速度，因为您只会调用该函数（并且因此执行一次而不是多次所需的时间量很小。

回复收藏 0 原文

白色秋天 2024-09-07 04:56:24

vector::size() 的性能：是吗？
与读取变量一样快？

可能不会。

重要吗

可能不重要。

除非每次迭代所做的工作很小（例如一两个整数运算），否则开销将微不足道。

回复收藏 0 原文

私野 2024-09-07 04:56:24

在我见过的每个实现中，都看到 vector::size() 执行 end() 和 begin() 的减法，即它不是与读取变量一样快。

实现向量时，实现者必须在 end() 或 size() 之间做出选择，即存储初始化元素的数量或指针/迭代器到最后一个初始化元素之后的元素。
换句话说;使用迭代器进行迭代。

如果您担心 size() 性能，请像这样编写基于索引的 for 循环；

for (size_t i = 0, i_end = container.size(); i < i_end; ++i){
// do something performance critical
}

In every implementation I've, seen vector::size() performs a subtraction of end() and begin(), ie its not as fast as reading a variable.

When implementing a vector, the implementer has to make a choice between which shall be fastest, end() or size(), ie storing the number of initialized elements or the pointer/iterator to the element after the last initialized element.
In other words; iterate by using iterators.

If you are worried of the size() performance, write your index based for loop like this;

for (size_t i = 0, i_end = container.size(); i < i_end; ++i){
// do something performance critical
}

回复收藏 0 原文

池予 2024-09-07 04:56:24

我总是将 vector.size() 保存在局部变量中（如果大小在循环内没有改变！）。
在每次迭代中调用它与将其保存在局部变量中相比可以更快。
至少，我是这么经历的。
我无法给你任何真实的数字，因为我很久以前就测试过了。然而，据我所知，它产生了明显的差异（但可能仅在调试模式下），特别是在嵌套循环时。

对于所有抱怨微优化的人：
这是一行额外的代码，不会带来任何缺点。

回复收藏 0 原文