机器人和 301 重定向
我在 6 个多月前更改了网站的 URL 结构。我检测到旧 URL 的使用,并重定向到带有 301 状态代码的新 URL。我使用 flidder 验证了状态代码是否根据请求正确返回。但机器人(yahoo slurps、googlebot 等)仍在访问旧的 URL。我有什么遗漏的吗?
I have changed the structure of the URLs of my site more than 6 months ago. I detect the use of legacy URLs and redirect to the new URL with a 301 status code. I verified with flidder that the status code is correctly returned upon the request. But bots (yahoo slurps, googlebot, etc.) are still hitting the old URLs. Is there something I am missing?
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(4)
不,只是爬虫需要非常非常长的时间才能获取消息。我的机器人正在抓取自 2005 年以来就不存在的地址——当人们喋喋不休地谈论地址是永久的时,它们确实是永久的。
此外,根据 URL 的结构,您可以使用 robots.txt 禁止旧地址
No, just it takes a very, very long time for crawlers to get the message. I have bots crawling addresses that have not existed since 2005 - when folk harp on with addresses being permanent, they really are.
Additionally, depending on how your URL's are structured, you can disallow the old addresses with robots.txt
试试这个,这只会重定向到机器人。
Try this and this will only redirect to the bots.
如果外部网站已链接到您的旧页面,并且机器人仍然可以访问这些链接,则机器人将不断出现并尝试访问内容。
If external sites have linked to your old pages and those links are still accessible for bots, the bots will keep coming and try to access the content.
此处提到您的网站地址:
http://www.your-main-site.com/
我们用它来转移域名,有时也用于黑帽 seo。
mentioned you site address here:
http://www.your-main-site.com/
Thats we use to transfer the domain and sometime for blackhat seo.