404 处理程序和真正不存在的动态页面...对 SEO 不利?
我们有一个 IIS 404 asp.net 处理程序,可以在找不到 html 页面时呈现页面。它使用页面的 URL 来查询我们的数据库并动态构建丰富的相关内容。根据我在 IIS 日志中看到的内容以及通过 Web 浏览器工具分析页面,没有迹象表明该页面实际上不存在并且是动态生成的。
在这些情况下,IIS 是否确实向客户端发送 404?是否确实发生了任何类型的重定向?搜索引擎会因此惩罚我吗?
两个月过去了,Google 已经对所有内容建立了索引,但 Bing 和 Yahoo 还没有对任何动态内容建立索引,尽管我提交了各种目录页面、站点地图和 Feed 以及所有链接。我的主页已在所有搜索引擎上建立索引并包含我的所有链接。当我在这些链接中搜索非常独特的关键字时,我可以看到 bing 和 yahoo 确实在我的主页链接上看到了它们 - 但仅限于此。
我可以运行或检查什么来确保我的动态页面不会被搜索引擎视为不良页面吗?有什么方法可以检查 404(无论 404 对于客户端来说实际上是什么,除了另一个页面)是否返回给爬虫?
非常感谢。
We have an IIS 404 asp.net handler that renders pages when an html page is not found. It uses the page's URL to query our Databases and builds rich relevant content on the fly. From what I can tell in the IIS logs and anaylyzing the pages from web browser tools there is NO indication the page does not actually exist and was dynamically generated.
In these cases is IIS actually sending a 404 to the client? Is there a redirect of any kind actually happening? Will Search engines punish me for this?
It's been 2 months and Google has indexed everything, but Bing and Yahoo have not indexed anything dynamic dispite my submitting various Directory pages, Sitemaps and Feeds with all my links. My home page is indexed on all search engines and has all my links. When I search very unique keywords in those links, I can see that bing and yahoo do see them on my Home Page links - but only there.
Is there anything I can run or check to make sure my dynamic pages are not somehow viewed as bad by Search engines? Any way to check if a 404 (whatever a 404 actually is to a client besides just another page) is returned to crawlers?
Many Thanks.
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
动态页面就好了。互联网上的大部分内容都是动态生成的。搜索引擎并不关心内容是否是动态的,事实上,他们通常不知道内容是动态的,因为他们看到的只是 URL 和由该 URL 生成的 HTML。
使用 Firebug 等工具或 Chrome 中的内置开发人员工具来查看 HTTP 标头。爬虫看到的标头与浏览器看到的标头相同,因此这是一种简单的方法来判断您的页面正在发送哪些标头。
Dynamic pages are just fine. Most of the content on the Internet is dynamically produced. The search engines don't care if content is dynamic and, in fact, they usually do not know content is dynamic as all they see if the URL and the HTML that is produced by that URL.
Use a tool like Firebug or the built in developer tools in Chrome to view your HTTP headers. Crawlers see the same headers a browser would see so that is an easy way to tell what headers your pages are sending out.