Barnes and Noble 网站不将 HeadlessChrome 视为真实用户
我使用 Chrome-Headless 的原因是因为它的行为就像一个真正的浏览器,但是当我将 chrome-headless + Selenium 定向到此 Barnes and Noble 链接时:
我收到此回复,无需导航到任何其他 我知道
<html><head>
<title>Access Denied</title>
</head><body>
<h1>Access Denied</h1>
You don't have permission to access "https://www.barnesandnoble.com/w/the-woman-they-could-not-silence-kate-moore/1138489968?ean=9781728242576" on this server.<p>
Reference #
</p></body></html>
我需要添加标头和所有内容,但这与常规 GET 请求 + 标头有何不同?
还有什么能让 Chrome-headless 送给巴恩斯和诺贝尔奖呢?
我做错了什么?
我缺少什么?
The reason I'm using Chrome-Headless is because it's acts like a real browser, but when I direct chrome-headless + Selenium to this Barnes and Noble link:
I get this response, without navigating to any other page
<html><head>
<title>Access Denied</title>
</head><body>
<h1>Access Denied</h1>
You don't have permission to access "https://www.barnesandnoble.com/w/the-woman-they-could-not-silence-kate-moore/1138489968?ean=9781728242576" on this server.<p>
Reference #
</p></body></html>
I understand that I would need to add headers and all, but how is this different than just a regular GET request + Headers?
What else is giving Chrome-headless away to the Barnes and Nobel in particular?
What am I doing wrong?
What am I missing?
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
您将看到以下访问被拒绝错误页面:
由于存在关键字 user-agent
解决方案
您可以使用以下命令覆盖默认的useragent其他一些常规UserAgent 如下:
代码块:
屏幕截图:
参考资料
您可以在以下位置找到一些相关的详细讨论:
You are seeing the following Access Denied error page:
due to presence of the keyword Headless within the user-agent
Solution
You can override the default useragent with some other regular UserAgent as follows:
Code Block:
Screenshot:
References
You can find a couple of relevant detailed discussions in: