Robots.txt - MDN Web Docs Glossary: Definitions of Web-related terms 编辑
Robots.txt is a file which is usually placed in the root of any website. It decides whether crawlers are permitted or forbidden access to the web site.
For example, the site admin can forbid crawlers to visit a certain folder (and all the files therein contained) or to crawl a specific file, usually to prevent those files being indexed by other search engines.
Learn more
General knowledge
- Robots.txt on Wikipedia
- https://developers.google.com/search/reference/robots_txt
- Standard specification draft: https://tools.ietf.org/html/draft-rep-wg-topic
- https://www.robotstxt.org/
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论