使用 HTTP 代理 - Python

发布于 2024-10-31 16:38:38 字数 834 浏览 4 评论 0原文

我熟悉这样一个事实：我应该将 HTTP_RPOXY 环境变量设置为代理地址。

一般来说，urllib 工作正常，问题在于处理 urllib2。

>>> urllib2.urlopen("http://www.google.com").read()

urllib2.URLError: <urlopen error [Errno 10061] No connection could be made because the target machine actively refused it>

或

urllib2.URLError: <urlopen error [Errno 11004] getaddrinfo failed>

额外信息：

urllib.urlopen(....) 工作正常！这只是 urllib2 在玩把戏...

我尝试了 @Fenikso 答案，但我现在收到此错误：

URLError: <urlopen error [Errno 10060] A connection attempt failed because the 
connected party did not properly respond after a period of time, or established
connection failed because connected host has failed to respond>

有什么想法吗？

原文

I familiar with the fact that I should set the HTTP_RPOXY environment variable to the proxy address.

Generally urllib works fine, the problem is dealing with urllib2.

>>> urllib2.urlopen("http://www.google.com").read()

returns

urllib2.URLError: <urlopen error [Errno 10061] No connection could be made because the target machine actively refused it>

urllib2.URLError: <urlopen error [Errno 11004] getaddrinfo failed>

Extra info:

urllib.urlopen(....) works fine! It is just urllib2 that is playing tricks...

I tried @Fenikso answer but I'm getting this error now:

URLError: <urlopen error [Errno 10060] A connection attempt failed because the 
connected party did not properly respond after a period of time, or established
connection failed because connected host has failed to respond>

Any ideas?

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

奈何桥上唱咆哮 2024-11-07 16:38:38

即使没有 HTTP_PROXY 环境变量，您也可以做到这一点。尝试这个示例：

import urllib2

proxy_support = urllib2.ProxyHandler({"http":"http://61.233.25.166:80"})
opener = urllib2.build_opener(proxy_support)
urllib2.install_opener(opener)

html = urllib2.urlopen("http://www.google.com").read()
print html

在您的情况下，代理服务器似乎确实拒绝连接。

更多值得尝试的事情：

import urllib2

#proxy = "61.233.25.166:80"
proxy = "YOUR_PROXY_GOES_HERE"

proxies = {"http":"http://%s" % proxy}
url = "http://www.google.com/search?q=test"
headers={'User-agent' : 'Mozilla/5.0'}

proxy_support = urllib2.ProxyHandler(proxies)
opener = urllib2.build_opener(proxy_support, urllib2.HTTPHandler(debuglevel=1))
urllib2.install_opener(opener)

req = urllib2.Request(url, None, headers)
html = urllib2.urlopen(req).read()
print html

编辑 2014 年：
这似乎是一个流行的问题/答案。但是今天我将使用第三方 requests 模块。

对于一个请求只需执行以下操作：

import requests

r = requests.get("http://www.google.com", 
                 proxies={"http": "http://61.233.25.166:80"})
print(r.text)

对于多个请求，请使用 Session 对象，这样您就不必在所有请求中添加 proxies 参数：

import requests

s = requests.Session()
s.proxies = {"http": "http://61.233.25.166:80"}

r = s.get("http://www.google.com")
print(r.text)

You can do it even without the HTTP_PROXY environment variable. Try this sample:

import urllib2

proxy_support = urllib2.ProxyHandler({"http":"http://61.233.25.166:80"})
opener = urllib2.build_opener(proxy_support)
urllib2.install_opener(opener)

html = urllib2.urlopen("http://www.google.com").read()
print html

In your case it really seems that the proxy server is refusing the connection.

Something more to try:

import urllib2

#proxy = "61.233.25.166:80"
proxy = "YOUR_PROXY_GOES_HERE"

proxies = {"http":"http://%s" % proxy}
url = "http://www.google.com/search?q=test"
headers={'User-agent' : 'Mozilla/5.0'}

proxy_support = urllib2.ProxyHandler(proxies)
opener = urllib2.build_opener(proxy_support, urllib2.HTTPHandler(debuglevel=1))
urllib2.install_opener(opener)

req = urllib2.Request(url, None, headers)
html = urllib2.urlopen(req).read()
print html

Edit 2014:
This seems to be a popular question / answer. However today I would use third party requests module instead.

For one request just do:

import requests

r = requests.get("http://www.google.com", 
                 proxies={"http": "http://61.233.25.166:80"})
print(r.text)

For multiple requests use Session object so you do not have to add proxies parameter in all your requests:

import requests

s = requests.Session()
s.proxies = {"http": "http://61.233.25.166:80"}

r = s.get("http://www.google.com")
print(r.text)

回复收藏 0 原文

一人独醉 2024-11-07 16:38:38

我建议您只使用 requests 模块。

它比内置的 http 客户端要容易得多：
http://docs.python-requests.org/en/latest/index.html

示例用法：

r = requests.get('http://www.thepage.com', proxies={"http":"http://myproxy:3129"})
thedata = r.content

I recommend you just use the requests module.

It is much easier than the built in http clients:
http://docs.python-requests.org/en/latest/index.html

Sample usage:

r = requests.get('http://www.thepage.com', proxies={"http":"http://myproxy:3129"})
thedata = r.content

回复收藏 0 原文

过期情话 2024-11-07 16:38:38

只是想提一下，您可能还需要设置 https_proxy 操作系统环境变量，以防需要访问 https URL。
就我而言，这对我来说并不明显，我花了几个小时才发现这一点。

我的用例：Win 7，jython-standalone-2.5.3.jar，通过 ez_setup.py 安装 setuptools

回复收藏 0 原文

悲歌长辞 2024-11-07 16:38:38

Python 3：

import urllib.request

htmlsource = urllib.request.FancyURLopener({"http":"http://127.0.0.1:8080"}).open(url).read().decode("utf-8")

Python 3:

import urllib.request

htmlsource = urllib.request.FancyURLopener({"http":"http://127.0.0.1:8080"}).open(url).read().decode("utf-8")

回复收藏 0 原文

我喜欢麦丽素 2024-11-07 16:38:38

我在 jython 客户端上遇到了这个。
服务器仅使用 TLS 进行通信，而客户端则使用 SSL 上下文。

javax.net.ssl.SSLContext.getInstance("SSL")

一旦客户端使用 TLS，一切就开始工作了。

回复收藏 0 原文

~没有更多了~

关于作者

魔法唧唧

暂无简介

0 文章

0 评论

24 人气

关注发私信

苍风燃霜

文章 0 评论 0

关注

我的黑色迷你裙

文章 0 评论 0

关注

悸初

文章 0 评论 0

关注

撧情箌佬

文章 0 评论 0

关注

森罗

文章 0 评论 0

关注

lyn1245

文章 0 评论 0

友情链接

文江博客

使用 HTTP 代理 - Python

额外信息：

urllib.urlopen(....) 工作正常！这只是 urllib2 在玩把戏...

Extra info:

urllib.urlopen(....) works fine! It is just urllib2 that is playing tricks...

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（5）

关于作者

相关话题

热门标签

推荐作者

苍风燃霜

我的黑色迷你裙

悸初

撧情箌佬

森罗

lyn1245

友情链接

使用 HTTP 代理 - Python

额外信息：

urllib.urlopen(....) 工作正常！这只是 urllib2 在玩把戏...

Extra info:

urllib.urlopen(....) works fine! It is just urllib2 that is playing tricks...

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（5）

关于作者

相关话题

热门标签

推荐作者

苍风燃霜

我的黑色迷你裙

悸初

撧情箌佬

森罗

lyn1245

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。