将 urllib2 获取的 gzip 数据转换为 HTML
我目前使用 mechanize 来读取 gzipped 网页,如下所示:
br = mechanize.Browser()
br.set_handle_gzip(True)
response = br.open(url)
data = response.read()
我想知道如何将 urllib2 获取的 gzipped 数据解压缩为 HTML 文本?
req = urllib2.Request(url)
opener = urllib2.build_opener()
response = opener.open(req)
data = response.read()
if response.info()['content-encoding'] == 'gzip':
HOW TO DECOMPRESS DATA TO HTML
I currently use mechanize to read gzipped web page as below:
br = mechanize.Browser()
br.set_handle_gzip(True)
response = br.open(url)
data = response.read()
I wonder how to decompress gzipped data fetched by urllib2 to HTML text?
req = urllib2.Request(url)
opener = urllib2.build_opener()
response = opener.open(req)
data = response.read()
if response.info()['content-encoding'] == 'gzip':
HOW TO DECOMPRESS DATA TO HTML
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(2)
试试这个:
html
现在应该包含 HTML(打印出来查看)。有关详细信息,请参阅此处。Try this:
html
should now hold the HTML (Print it to see). See here for more info.