Google不索引网站 - 说'由Robots.txt' - 但是robots.txt允许所有爬行者 - 两个不同的托管服务的相同问题
我已经构建并发布了很多网站,但从未遇到以下问题:
Google没有索引我的网站。每当我提交页面(在Google搜索控制台中)时,它都会说“ 被robots.txt 封锁” 用户代理: *
和允许:/
)。可以通过myDomain.com/robots.txt
访问robots.txt,并且可以通过mydomain.com/sitemap
访问站点的站点。
我已经尝试了两个不同的托管提供商: dreamhost.com 和 fastcomet.com 。但是,这个问题仍然存在,我看不出原因。这些域已在 namecheap.com 上注册,从那以后我一直在许多其他网站上使用。
我使用 grav cms - 一个了不起的扁平flile cms-通常是完美无缺的,我认为我不认为CMS引起了问题。
以下是Google搜索控制台中Google错误消息的屏幕截图。显然, robots.txt不能是罪魁祸首,因为允许爬网访问。
最后,甚至在Google的搜索结果中都没有出现域名。通常,Google会显示一个没有附带描述等的域,如果不允许爬网。
I have built and published quite a few websites and never had the following issue:
Google is not indexing my website. Whenever I submit the page (in Google Search Console) it says "blocked by robots.txt" although the robots.txt allows every crawler (User-agent: *
and Allow: /
). The robots.txt is accessible via mydomain.com/robots.txt
and the site's sitemap is accessible via mydomain.com/sitemap
.
I have tried it with two different hosting providers: Dreamhost.com and Fastcomet.com. The issue persists however, and I cannot see why. The domains are registered with Namecheap.com which I have been using for many other sites since forever.
I use Grav CMS -- a terrific flat-file CMS -- which usually works flawlessly and I don't think that the CMS causes the problem.
Here below is a screenshot of Google's error message inside Google Search Console. Obviously, the robots.txt cannot be the culprit, since crawlers are allowed access.
Lastly, not even the domain is coming up in Google's search results. Usually, Google displays a domain without the accompanying description etc., if it is not allowed to crawl that domain.
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。

绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论