如何在 C++ 期间监控/显示进度种类

发布于 2024-09-06 18:30:33 字数 670 浏览 4 评论 0原文

我计划编写一个交互式 C++ 几何处理插件，该插件将经常对大量数据进行排序。尽管初步迹象表明排序只需要一两秒，但我更愿意在这段时间内显示进度 - 即我想每秒更新几次进度指示器。这比打开等待光标并使用户留下一个冻结不确定时间长度（即使只有几秒钟）的程序更好。

如果我要使用像 std::sort 这样的东西，我可以使用比较函数时不时地更新进度指示器，但我不知道“完成百分比”。我还可以将排序分解为子排序，更新子排序之间的进度，然后合并。我最好的选择可能是编写自己的排序方法，尽管我不知道需要付出多少努力才能获得与 std::sort 一样好的性能（并确保正确性）。无论如何，该排序方法偶尔会向回调方法发送“完成百分比”。

我想知道其他人是否遇到并解决了这个问题 - 我希望标准库中可能有一种排序方法可以实现我想要的功能，或者是我没有想到的其他技术。

更新：感谢迄今为止的精彩回答。有一些非常好的建议，我将推迟选择已接受的答案，直到我有机会在即将进行的项目中测试这些想法。

更新 2：我完成了我的项目，结果证明这不是问题（至少对于客户而言）。由于他们将销售该软件，他们仍然可能会从客户那里得到反馈改变他们的想法）。选择一个可接受的答案很困难，因为有很多好的答案，但最终我选择的答案指向一篇关于合并排序的维基文章，该文章有一个非常令人回味的动画。因此，如果我需要继续这样做，这就是我会采取的第一个策略）。

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

恬淡成诗 2024-09-13 18:30:34

我认为，即使您编写自己的排序，如果您希望进度指示器准确，您也必须进行大量仔细的测量。如果您只想要一个近似的进度指示器，那么您可以使用一些指标，例如“比较元素之间的平均距离”或“与快速排序的平均预期数量相比的比较次数”作为指标，并实现您已经提到的比较想法。

是的，我认为您不是一个十足的白痴，也不打算在每次比较时更新进度指示器。如果你这样做，你会花更多的时间来指示进度而不是排序。

例如，您通常期望进行快速排序的 n log2 n 操作。对涉及多少比较的分析比一般测量更详细并且更准确，但出于本示例的目的，我们仅进行假设。因此，您可以计算比较次数并报告 number_of_comparisons / (n log2 n) 作为您对进度的估计。

由于这只是一个平均指标，我会进行一些实验，看看您的估计偏离了多少，并加入一些模糊因素以使其与平均预期情况相符。您还可以有一个进度条，通过“我认为我将完成的工作”来表示不确定性。指示器和指示器后面的一些空格。

即使你使用自己的排序并想出一个看似更精确的度量，进度条仍然不会平滑更新，效果也会相似。您确定排序将花费多长时间的唯一方法是，如果您使用稍微慢一些但真正可预测的排序，在这种情况下，您可以根据元素数量预测排序将花费多长时间，或者使用非常快的排序排序在特定情况下的行为不太可预测，在这种情况下，没有真正的方法可以拥有完全准确的进度条。

子任务的可预测性和比较总数的可预测性密切相关。所以我真的不认为子任务比比较总数更好。

如果您想使用自己的排序并且可预测性是您的最高目标，请选择堆排序。它仍然是一个 O(n log2 n) 排序，并且接近于最小比较排序（至少我在阅读 Knuth 时记得是这样）。无论输入的数据集是什么，它都需要非常可预测的时间才能完成。它是较慢的 O(n log2 n) 排序之一，但仍然如此。

正如您的一位评论者所提到的，您可能正在解决一个实际上并不存在的问题。首先进行一些实验。不管这个问题有什么用处，它都是一个有趣的智力挑战。 :-)

I think, even if you wrote your own sort, that you would have to do a lot of careful measurement if you wanted the progress indicator to be accurate. If you only want an approximate progress indicator, then you can use some metric like 'average distance between compared elements' or 'number of comparisons as compared to average expected number for quicksort' as your metric and implement the comparison idea you already mentioned.

And yes, I assume that you aren't a complete idiot and do not plan on updating the progress indicator at each and every comparison. If you did that you'd spend much more time indicating progress than sorting.

As an example, you would generally expect about n log2 n operations for quicksort. The analysis of how many comparisons are involved is more detailed and can be more accurate than that general measure, but for the purposes of this example, lets just assume. So you could count comparisons and report number_of_comparisons / (n log2 n) as your estimate of progress.

Since that's just an average indicator I would run a few experiments and see how far your estimate is off, and throw in some fudge factors to make it line up with the average expected case. You could also have a progress bar that indicated the uncertainty by having sort of "This is where I think I'll be done." indicator and some space after the indicator.

Even if you used your own sort and came up with a more seemingly precise measure, the progress bar still wouldn't update smoothly and the effect would be similar. The only way you know for sure how long your sort is going to take is if you use a somewhat slower, but really predictable sort, in which case you can predict how long it will take from the number of elements, or use a really fast sort that has less predictable behavior in specific cases, in which case there is no real way to have a perfectly accurate progress bar.

Predictability of subtasks and predictability of total number of comparisons are strongly linked. So I really don't think that subtasks make a better measure than total number of comparisons.

If you want to use your own sort and predictability is your highest goal, go for heapsort. It's still an O(n log2 n) sort, and it's close to being a minimum comparison sort (or so I remember from reading Knuth). It also takes a very predictable amount of time to complete regardless of the dataset its fed. It's one of the slower O(n log2 n) sorts, but still.

As one of your commenters mentioned though, you might be solving a problem that doesn't actually exist. Run some experiments first. The problem is a fun intellectual challenge regardless of its usefulness though. :-)

回复收藏 0 原文

时光无声 2024-09-13 18:30:34

由于 std::sort 是基于模板的，因此源应该在标头中可用。您可以复制它并插入进度回调。最大的问题是预测你距离完成有多近——大多数排序函数将基于快速排序，它并不总是进行相同数量的比较。

编写您自己的合并排序是可能的；该算法很简单，步骤数也明确定义。

回复收藏 0 原文

捂风挽笑 2024-09-13 18:30:34

我建议您使用第二个选项：使用 std::sort 或其他标准排序函数（如 qsort），并让比较器报告其进度。但不要在每次比较中都更新——这会慢得难以忍受——而是每（比如）100 毫秒更新一次。

回复收藏 0 原文

笑看君怀她人 2024-09-13 18:30:34

我认为您的问题如下：

您希望在单个连续过程中触发离散事件。
这个细分只是为了告诉用户事情正在进行中。

我的建议是：

使用类似 http://ajaxload.info/ 的加载图标，或者如果它不是基于 GUI 的环境，只需说明加载即可。由于事件持续时间不到 2 秒，因此这不会成为问题。如果等待时间超过 10 秒，则会挂断。
编写自己的排序方法确实会带来很多线程安全问题，如果您的代码使用多线程或将来必然会这样做，这可能会导致问题。

3.另一个重要信息是，每次您想要排序时，您都应该考虑数据的无序程度，因此实际上您将测量存在的随机性程度，以及您可能需要执行的预期计算数量。您可以使用此信息作为需要多少交换的指示器，而这反过来又可以在您迭代排序时进行计数。玩转数据。

回复收藏 0 原文

梦里梦着梦中梦 2024-09-13 18:30:34

使用蛮力:)

int elem_num = raw_data.size();
int percentage_delta = 100/(elem_num/20);
int percentage = 0;
int i = 0;
std::multiset<Elem*> sorted_data(&compareElemFunc);
foreach(Elem& elem, raw_data)
{
    sorted_data.insert(&elem);
    if(i%20)
    {
        updateProgressBar(percentage);
        percentage += percentage_delta;
    }
    i++;
}
//now, your data is perfectly sorted, iterate through sorted_data

（如果您不想实现自己的 std::sort() 并且因为我缺乏完整的要求）

use brute force :)

int elem_num = raw_data.size();
int percentage_delta = 100/(elem_num/20);
int percentage = 0;
int i = 0;
std::multiset<Elem*> sorted_data(&compareElemFunc);
foreach(Elem& elem, raw_data)
{
    sorted_data.insert(&elem);
    if(i%20)
    {
        updateProgressBar(percentage);
        percentage += percentage_delta;
    }
    i++;
}
//now, your data is perfectly sorted, iterate through sorted_data

(in case you don't want to implement your own std::sort() and since I'm lacking complete requirements)

回复收藏 0 原文

等风来 2024-09-13 18:30:34

使用观察者模式在每个部分完成时向父级发信号。使用它和需要排序的元素总数，您可以实时更新进度条。

回复收藏 0 原文

樱花细雨 2024-09-13 18:30:34

我不建议尝试破解 std::sort。这通常使用 introsort 实现，并且是一种非常快的 NLogN 操作。构建要排序的容器通常比排序数据更昂贵。

但是，如果您要实现进度条，我建议您将排序放在单独的线程中。通常，多线程应用程序比单线程应用程序更难编写和维护，但您可以采用不适合此进度条情况的方式来实现。您的应用程序仍然可以主要是单线程的，而不执行任何并发操作，除了此进度条和可能的一些事件处理以保持 UI 响应之外。当您准备好对数据进行排序时，只需启动另一个线程来执行此操作，并将主线程置于等待循环中，直到排序线程完成，在此期间休眠并升级进度条。

您可以将这种非侵入式方法推广到任何类型的耗时操作，而不必在整个代码中散布 update_progress_bar() 类型调用或深入研究 std::sort 的实现或尝试重新发明轮子。由于主线程将处于等待/更新进度条状态，因此在某种意义上会阻塞，直到您的工作线程完成为止，因此您不会遇到与多线程相关的任何问题（需要线程同步来访问整个进程中的共享资源）应用程序（进度计数器、竞争条件、死锁等除外）。它也将是您可以实现的最流畅的进度计数器，因为它将同时更新。

如果您担心与锁定进度计数器相关的效率，只需使用原子操作来递增它。

至于确定排序算法的进展程度，有几种方法可以实现。一种是让它根据您拥有的数据大小运行一次，并尝试预测后续运行所需的时间。这是完全非侵入性的，但有点难以做到，但是，如果做得正确，它将比定期增加计数器更准确地监控进度（这忽略了间隔可能不会花费甚至大量时间的事实）。第二种方法更简单，但有点邪恶，是修改比较器谓词以增加进度计数器。使用状态进行谓词通常会令人不悦，但这比仅仅因为想要一个进度计数器而尝试实现自己的内部排序要好得多。

另外，如果你的 introsort 花了这么长时间，我不得不想知道，你的容器是否存储这些三角形对象或指向它们的指针？如果是前者，您可能需要考虑后者，因为它会大大加快速度。

I don't recommend trying to hack apart std::sort. That's generally implemented with introsort and is an extremely fast NLogN operation. Constructing the container you're going to sort will typically be more expensive than sorting the data.

However, if you're going to implement a progress bar, I recommend you put the sorting in a separate thread. Normally multithreaded applications are harder to write and maintain than single-threaded ones, but you can do it in a way in which it isn't for this progress bar case. Your application can still be predominantly single-threaded without any concurrent operations being performed with the exception of this progress bar and probably some event handling to keep the UI responsive. When you're ready to sort the data, simply fire off another thread to do it and put the main thread in a wait loop until the sorting thread is finished, sleeping here and there and upgrating the progress bar in the meantime.

You can generalize this non-intrusive approach to any kind of time-consuming operation without having to sprinkle update_progress_bar() type calls throughout your code or digging into the implementations of std::sort or trying to reinvent wheels. Because the main thread will be in a waiting/updating progress bar state and therefore blocking in a sense until your working thread is finished, you don't have any of the issues associated with multithreading (need for thread synchronization to access shared resources throughout your application with the exception of the progress counter, race conditions, dead locks, etc). It'll also be the smoothest progress counter you can implement since it will be updating concurrently.

If you're worried about the efficiency with associated with locking the progress counter, simply use atomic operations to increment it.

As for determining how much the sort algorithm has progressed, there are a couple ways to do it. One is to let it run once with the size of the data you have and try to predict the amount of time it would take for subsequent runs. That's completely non-intrusive but a little difficult to do, yet, if done right, it will monitor progress more accurately than incrementing a counter at regular intervals (which omits the fact that intervals may not take even amounts of time). The second approach which is simpler but a little bit evil is to modify your comparator predicate to increment a progress counter. Making predicates with state is generally frowned upon, but it's less evil than trying to implement your own introsort just because you want a progress counter.

Also if your introsort is taking so long, I have to wonder, does your container store these triangle objects or pointers to them? If the former, you might want to consider the latter as it should speed things up dramatically.

回复收藏 0 原文

~没有更多了~