当前位置：文江博客话题详情

哪种 STL 容器最适合 std::sort？（这还重要吗？）

发布于 2024-07-16 08:05:13 字数 101 浏览 3 评论 0原文

标题不言而喻......

容器的选择是否会以某种方式影响默认 std::sort 算法的速度？例如，如果我使用列表，排序算法只是切换节点指针还是切换节点中的整个数据？

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

清风夜微凉 2024-07-23 08:05:13

选择确实会产生影响，但预测哪个容器最有效是非常困难的。最好的方法是使用最适合您的应用程序使用的容器（可能是 std::vector），查看该容器的排序是否足够快，如果是，则坚持使用它。如果没有，请对排序问题进行性能分析，并根据分析数据选择不同的容器。

作为一名前讲师和前培训师，我有时觉得个人对链表具有神秘的性能增强特性这一普遍观点负有责任。听一位知情者的话：链表出现在这么多教科书和教程中的唯一原因是因为对于编写这些书籍和教程的人来说，拥有一个可以说明指针、动态内存管理、递归的数据结构很方便，搜索和排序合二为一——这与效率无关。

回复收藏 0 原文

若沐 2024-07-23 08:05:13

我认为 std::sort 不适用于列表，因为它需要一个 list 未提供的随机访问迭代器。请注意，list<> 提供了 sort 方法，但它与 std::sort 完全分开。

容器的选择很重要。 STL 的 std::sort 依赖迭代器来抽象容器存储数据的方式。它只是使用您提供的迭代器来移动元素。这些迭代器在访问和分配元素方面的工作速度越快，std::sort 的工作速度就越快。

回复收藏 0 原文

爱你是孤单的心事 2024-07-23 08:05:13

std::list 绝对不是 std::sort() 的好（有效）选择，因为 std::sort() 需要随机访问迭代器。 std::map 和朋友也不好，因为元素的位置无法强制执行；也就是说，用户无法通过插入特定位置或交换来强制映射元素在映射中的位置。在标准容器中，我们只剩下 std::vector 和 std::deque。

std::sort() 与其他标准算法类似，它仅通过交换元素值 (*t = *s) 来起作用。因此，即使列表神奇地支持 O(1) 访问，链接也不会被重新组织，而是它们的值会被交换。

由于 std::sort() 不会更改容器的大小，因此无论您使用 std::vector 还是 std:: ，运行时性能都不会产生任何差异。双端队列。原始数组的排序速度也应该很快，甚至可能比标准容器更快——但我不认为速度差异足以证明使用它们的合理性。

回复收藏 0 原文

做个ˇ局外人 2024-07-23 08:05:13

这取决于元素类型。

如果您只是存储指针（或 POD），那么向量将是最快的。如果您要存储对象，那么列表的排序会更快，因为它将交换节点而不是物理元素。

回复收藏 0 原文

挽心 2024-07-23 08:05:13

排序算法对您的容器一无所知。它所知道的只是随机访问迭代器。因此，您可以对 STL 容器中不存在的内容进行排序。因此，它的速度取决于您提供的迭代器，以及取消引用和复制它们所指向的内容的速度。

std::sort 不适用于 std::list，因为排序需要随机访问迭代器。对于这种情况，您应该使用 std::list 的成员函数类型之一。这些成员函数将有效地交换链表指针，而不是复制元素。

回复收藏 0 原文

江挽川 2024-07-23 08:05:13

向量。

始终使用矢量作为默认值。与任何其他容器相比，它具有最低的空间开销和最快的访问速度（以及 C 兼容布局和随机访问迭代器等其他优点）。

现在，问问自己 - 您还用容器做什么？您需要强大的异常保证吗？列表、集合和映射可能是更好的选择（尽管它们都有自己的排序例程）。您需要定期将元素添加到容器的前面吗？考虑双端队列。您的容器是否需要始终进行分类？集合和地图可能更合适。

最后，明确什么是最适合您的，然后选择最合适的容器并衡量它如何满足您的需求。

回复收藏 0 原文

绻影浮沉 2024-07-23 08:05:13

我完全同意上面的人发表的言论。但学习新事物的最佳方法是什么？嘿！！！！当然不是阅读文本并死记硬背，但是......示例：D 最近我沉浸在 STL 指定的容器中，这里是不言自明的快速测试代码，我希望：

#include <iostream>
#include <vector>
#include <deque>
#include <array>
#include <list>
#include <iterator>
#include <cstdlib>
#include <algorithm>
#include "Timer.h"

constexpr int SIZE = 1005000;

using namespace std;

void test();

int main(){
    cout<<"array allocates "<<static_cast<double>(SIZE)/(1024*1024)<<" MB\n";
    test();


    return 0;
}


void test(){
    int values[SIZE];
    int size = 0;

    //init values to sort:
    do{
        values[size++] = rand() % 100000;
    }while(size < SIZE);

    //feed array with values:
    array<int, SIZE> container_1;
    for(int i = 0; i < SIZE; i++)
        container_1.at(i) = values[i];

    //feed vector with values
    vector<int> container_2(begin(values), end(values));
    list<int> container_3(begin(values), end(values)); 
    deque<int> container_4(begin(values), end(values)); 

    //meassure sorting time for containers
    {
       Timer t1("sort array");
       sort(container_1.begin(), container_1.end());
    }

    {
       Timer t2("sort vector");
       sort(container_2.begin(), container_2.end());
    }

    {
       Timer t3("sort list");
       container_3.sort();
    }

    {
       Timer t4("sort deque");
       sort(container_4.begin(), container_4.end());
    }

}

定时器的代码：

#include <chrono>
#include <string>
#include <iostream>

using namespace std;

class Timer{

public:
    Timer(string name = "unnamed") : mName(name){ mStart = chrono::system_clock::now();}
    ~Timer(){cout<<"action "<<mName<<" took: "<<
             chrono::duration_cast<chrono::milliseconds>(
                     chrono::system_clock::now() - mStart).count()<<"ms"<<endl;}
private:
    chrono::system_clock::time_point mStart;
    string mName;
};

这里是不使用优化时的结果（g++ --std=c++11 file.cpp -o a.out）：

数组分配 0.958443 MB
排序数组的操作花费了：183ms
动作排序向量花费：316ms
操作排序列表花费：725ms
操作排序双端队列花费：436ms

并且经过优化（g++ -O3 --std=c++11 file.cpp -o a.out）：

数组分配0.958443 MB< br>
排序数组的操作花费了：55ms
动作排序向量花费：57ms
操作排序列表花费：264ms
操作排序双端队列花费：67ms

请注意，尽管向量和数组在这种情况下排序的时间相似，但数组大小受到限制，因为它应该在堆栈上初始化（默认情况下，不使用自己的分配器等），

因此它还取决于您是否使用编译器优化，如果没有，我们可能会看到明显的差异。

I totally agree with the statements that guys have posted above. But what is the best way to learn new things? Hey!!!! surely not reading the text and learning by heart but.... EXAMPLES :D As recently I immersed in containers specified in STL, here is the quick test code that is self-explanatory, I hope:

#include <iostream>
#include <vector>
#include <deque>
#include <array>
#include <list>
#include <iterator>
#include <cstdlib>
#include <algorithm>
#include "Timer.h"

constexpr int SIZE = 1005000;

using namespace std;

void test();

int main(){
    cout<<"array allocates "<<static_cast<double>(SIZE)/(1024*1024)<<" MB\n";
    test();


    return 0;
}


void test(){
    int values[SIZE];
    int size = 0;

    //init values to sort:
    do{
        values[size++] = rand() % 100000;
    }while(size < SIZE);

    //feed array with values:
    array<int, SIZE> container_1;
    for(int i = 0; i < SIZE; i++)
        container_1.at(i) = values[i];

    //feed vector with values
    vector<int> container_2(begin(values), end(values));
    list<int> container_3(begin(values), end(values)); 
    deque<int> container_4(begin(values), end(values)); 

    //meassure sorting time for containers
    {
       Timer t1("sort array");
       sort(container_1.begin(), container_1.end());
    }

    {
       Timer t2("sort vector");
       sort(container_2.begin(), container_2.end());
    }

    {
       Timer t3("sort list");
       container_3.sort();
    }

    {
       Timer t4("sort deque");
       sort(container_4.begin(), container_4.end());
    }

}

And the code for timer:

#include <chrono>
#include <string>
#include <iostream>

using namespace std;

class Timer{

public:
    Timer(string name = "unnamed") : mName(name){ mStart = chrono::system_clock::now();}
    ~Timer(){cout<<"action "<<mName<<" took: "<<
             chrono::duration_cast<chrono::milliseconds>(
                     chrono::system_clock::now() - mStart).count()<<"ms"<<endl;}
private:
    chrono::system_clock::time_point mStart;
    string mName;
};

Here is the result when no optimization is used (g++ --std=c++11 file.cpp -o a.out):

array allocates 0.958443 MB
action sort array took: 183ms
action sort vector took: 316ms
action sort list took: 725ms
action sort deque took: 436ms

and with optimization (g++ -O3 --std=c++11 file.cpp -o a.out):

array allocates 0.958443 MB
action sort array took: 55ms
action sort vector took: 57ms
action sort list took: 264ms
action sort deque took: 67ms

Notice that although vector and array has similar times sorting for this case, array size is limited as it is supposed to be initialized on stack (by default, not using own allocators etc.)

So it depends also if you use optimization for compiler, if not, we may see noticeable difference.

回复收藏 0 原文

溇涏 2024-07-23 08:05:13

这确实很重要，因为不同的容器有不同的内存访问模式等，这可能会发挥作用。

但是，std::sort 不适用于 std::list<>::iterators，因为它们不是 RandomAccessIterators。此外，尽管可以实现对 std::list<> 的专门化来打乱节点的指针，但它可能会产生奇怪且令人惊讶的语义后果 - 例如。如果向量中的排序范围内有一个迭代器，则其值将在排序后发生变化，而对于此专业化来说，情况并非如此。

回复收藏 0 原文