在 Python 中检查 HTTP POST 标头而不下载正文

发布于 2024-11-11 18:01:42 字数 532 浏览 7 评论 0原文

Web 服务器使用要下载的文件响应 POST 请求（具有 Content-Disposition 标头）。使用 urllib 或 mechanize opener 在什么时候下载响应正文？

opener = mechanize.build_opener(HTTPRefererProcessor, HTTPEquivProcessor, HTTPRefreshProcessor)
r = make_post_request() # makes Request object to send
res = opener.open(r)
info = response.info()
content_disp = info.getheader('content-disposition')
filename = content_disp.split('=')[1]
content = res.read() # or skip based on filename

我的印象是正文在 read() 之前不会下载，这对于跳过某些下载（例如已下载的文件）很有用，但我没有看到性能有很大的提高。

原文

A web server responds to a POST request with a file to download (has Content-Disposition header). Using urllib or mechanize opener at what point will the response body be downloaded?

opener = mechanize.build_opener(HTTPRefererProcessor, HTTPEquivProcessor, HTTPRefreshProcessor)
r = make_post_request() # makes Request object to send
res = opener.open(r)
info = response.info()
content_disp = info.getheader('content-disposition')
filename = content_disp.split('=')[1]
content = res.read() # or skip based on filename

I was under the impression that the body won't download until read(), which would be useful for skipping certain download (such as files already downloaded) but I am not seeing great deal of performance improvement.

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

小鸟爱天空丶 2024-11-18 18:01:42

HTTP 是一种无连接协议，这意味着没有建立通道，服务器可以在其中分几步写入数据。因此，如果将 POST 或 GET 请求发送到服务器，它必须以完整的响应进行响应，因为它无法知道这是第一个还是第二个请求。 Cookie、AJAX、Comet 有助于模拟通道之类的东西，但目前还没有。这就是为什么有 HEAD 请求：通过这个请求，浏览器可以确定是否必须加载资源。

回复收藏 0 原文