对文件进行分区以进行并行下载

发布于 2024-12-29 16:38:55 字数 221 浏览 3 评论 0原文

我想制作一个多线程下载器（用Python），我需要告诉每个线程从哪里开始以及下载多少字节。为此，我获取远程文件大小并将其除以（例如）2。现在，假设远程文件大小为 5：当我将该数字除以 2 时，得到结果 2。现在我可以开始下载，但我会丢失一个字节（因为 2*2=4，而不是 5）。我无法使用浮点数，因为我无法下载半个字节。例如，我如何除以该数字并获得带有 [2, 3] 的列表？

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

夜访吸血鬼 2025-01-05 16:38:55

使用 divmod：

>>> divmod(5, 2)
(2, 1)
>>>

这告诉你，5 除以除以 2 为 2，余数为 1，因此最后一块将是 2 + 1 = 3。

>>> divmod(12345, 6)
(2057, 3)

在这里，您将在 2057 处有 5 个块，在 2057+3 处有最后一个切片。

该算法也适用于除法没有余数的情况：

>>> divmod(12345, 5)
(2469, 0)

在这里，您将在 2469 处有 4 个块，加上 2469+0 处的最后一个切片。

因此，您的块大小可以计算为：

def chunk_sizes(filesize, num_chunks):
    d, r = divmod(filesize, num_chunks)
    result = [d] * num_chunks
    result[-1] += r
    return result

Use divmod:

>>> divmod(5, 2)
(2, 1)
>>>

This tells you, that 5 divided by 2 is 2, remainder 1, so the last piece will be 2 + 1 = 3.

>>> divmod(12345, 6)
(2057, 3)

Here, you'll have 5 chunks at 2057 and a last slice at 2057+3.

This algorithm will also work for cases, where division is without remainder:

>>> divmod(12345, 5)
(2469, 0)

Here, you'll have 4 chunks at 2469 plus a last slice at 2469+0.

So, your chunk sizes could be computed as:

def chunk_sizes(filesize, num_chunks):
    d, r = divmod(filesize, num_chunks)
    result = [d] * num_chunks
    result[-1] += r
    return result

回复收藏 0 原文

朕就是辣么酷 2025-01-05 16:38:55

如果你想获得每个块的大小，你可以简单地将除法的余数添加到最后一个元素：

>>> file_size = 11
>>> no_of_chunks = 3
>>> chunks = [file_size / no_of_chunks] * no_of_chunks
>>> chunks[-1] += file_size % no_of_chunks
>>> chunks
[3, 3, 5]

你也可以修改它以将余数分布到所有块中，以便块的大小最多偏差1 :

>>> for i in range(file_size % no_of_chunks):
>>>    chunks[i] += 1
>>> chunks
[4, 4, 3]

If you want to get the size of each chunk, you can simply add the remainder of the division to the last element:

>>> file_size = 11
>>> no_of_chunks = 3
>>> chunks = [file_size / no_of_chunks] * no_of_chunks
>>> chunks[-1] += file_size % no_of_chunks
>>> chunks
[3, 3, 5]

You can also modify that to distribute the remainder across all chunks, so that the size of the chunks deviates by at most 1:

>>> for i in range(file_size % no_of_chunks):
>>>    chunks[i] += 1
>>> chunks
[4, 4, 3]

回复收藏 0 原文

哥，最终变帅啦 2025-01-05 16:38:55

最后一个线程的特殊情况——分配它以获取剩余的字节数。

回复收藏 0 原文

~没有更多了~

关于作者

薄暮涼年

暂无简介

文章

26 人气

关注发私信

友情链接

文江博客

对文件进行分区以进行并行下载

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（3）

关于作者

相关话题

热门标签

推荐作者

櫻之舞

弥枳

m2429

寻找一个思念的角度

野却迷人

我怀念的。

友情链接

对文件进行分区以进行并行下载

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（3）

关于作者

相关话题

热门标签

推荐作者

櫻之舞

弥枳

m2429

寻找一个思念的角度

野却迷人

我怀念的。

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。