当前位置：文江博客话题详情

估计/预测下载完成时间

发布于 2024-08-29 07:41:47 字数 405 浏览 5 评论 0原文

我们都嘲笑“还剩 X 分钟”对话框，这似乎太简单了，但我们如何改进它呢？

实际上，输入是截至当前时间的下载速度集，我们需要使用它来估计完成时间，也许带有确定性指示，例如使用某些 Y% 置信区间的“剩余 20-25 分钟”。

执行此操作的代码可以放入一个小库中并在所有项目中使用，那么这真的那么困难吗？你会怎么做？您对之前的下载速度有何权重？

或者已经有一些开源代码了吗？

编辑：总结：

通过更好的算法/过滤器等改进估计完成时间。
提供间隔而不是单一时间（“1h45-2h30 分钟”），或者只是限制精度（“大约 2 小时”）。
指出进展何时停滞——尽管如果进展持续停滞然后继续，我们应该能够处理这个问题。也许“大约 2 小时，目前停滞不前”

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

岁吢 2024-09-05 07:41:47

更一般地说，我认为您正在寻找一种方法来即时测量传输速度，该速度通常是通过一小段时间的平均值获得的。

问题通常是，为了反应性，周期通常非常小，这会导致溜溜球效应。

我会提出一个非常简单的方案，让我们对其进行建模。

考虑随时间 (x) 变化的曲线速度 (y)。

即时速度，只不过是读取当前 x (x0) 的 y。
平均速度不超过 Integral(f(x), x in [x0-T,x0]) / T
我提出的方案是应用一个过滤器，给最后时刻更多的权重，同时仍然采取考虑到过去的时刻。

它可以轻松地实现为 g(x,x0,T) = 2 * (x - x0) + 2T，这是表面 T 的一个简单三角形。

现在您可以计算 Integral( f(x)*g(x,x0,T), x in [x0-T,x0]) / T，这应该有效，因为两个函数始终为正。

当然，您可以有不同的 g，只要它在给定区间内始终为正，并且它在区间上的积分为 T（因此它自己的平均值恰好为 1）。

这种方法的优点是，因为您对即时事件给予更多的权重，所以即使考虑更大的时间间隔，您也可以保持相当的反应性（这样平均值就更精确，并且不太容易出现问题）。

另外，我很少看到但认为会提供更精确的估计的是将用于计算平均值的时间与估计的剩余时间相关联：

如果我下载 5ko 文件，它将立即加载，无需估计
如果我下载一个 15 Mo 文件，大约需要 2 分钟，所以我想估计......每 5 秒一次？
如果我下载一个 1.5 Go 文件，它将需要......大约 200 分钟（以相同的速度）......也就是说 3 小时 20 分钟......也许每分钟估计就足够了？

因此，下载时间越长，我需要的反应就越少，我可以平均得到的就越多。一般来说，我想说一个窗口可以覆盖总时间的 2%（也许除了少数的初步估计，因为人们喜欢即时反馈）。此外，一次以整个百分比表示进度就足够了。如果任务很长，我还是准备等待。

More generally, I think you are looking for a way to give an instant mesure of the transfer speed, which is generally obtained by an average over a small period.

The problem is generally that in order to be reactive, the period is usually extremely small, which leads to the yoyo effect.

I would propose a very simple scheme, let's model it.

Think of a curve speed (y) over time (x).

the Instant Speed, is no more than reading y for the current x (x0).
the Average Speed, is no more than Integral(f(x), x in [x0-T,x0]) / T
the scheme I propose is to apply a filter, to give more weight to the last moments, while still taking into account the past moments.

It can be easily implement as g(x,x0,T) = 2 * (x - x0) + 2T which is a simple triangle of surface T.

And now you can compute Integral(f(x)*g(x,x0,T), x in [x0-T,x0]) / T, which should work because both functions are always positive.

Of course you could have a different g as long as it's always positive in the given interval and that its integral on the interval is T (so that its own average is exactly 1).

The advantage of this method is that because you give more weight to immediate events, you can remain pretty reactive even if you consider larger time intervals (so that the average is more precise, and less susceptible to hiccups).

Also, what I have rarely seen but think would provide more precise estimates would be to correlate the time used for computing the average to the estimated remaining time:

if I download a 5ko file, it's going to be loaded in an instant, no need to estimate
if I download a 15 Mo file, it's going to take between 2 minutes roughly, so I would like estimates say... every 5 seconds ?
if I download a 1.5 Go file, it's going to take... well around 200 minutes (with the same speed)... which is to say 3h20m... perhaps that an estimates every minute would be sufficient ?

So, the longer the download is going to take, the less reactive I need to be, and the more I can average out. In general, I would say that a window could cover 2% of the total time (perhaps except for the few first estimates, because people appreciate immediate feedback). Also, indicating progress by whole % at a time is sufficient. If the task is long, I was prepared to wait anyway.

回复收藏 0 原文