SEO、内容排序和重复
我经营一个专门的新闻网站,并试图在其中应用一些 SEO 技术。我听到的最重要的事情之一是避免内容重复。我已经介绍了所有基础知识,但我仍然坚持内容的排序。
例如,网站的存档可按日期、浏览量和评级排序。由于我们没有那么多新闻项目,因此特定日期的存档页面通常只有几个项目,因此以下 URL 都具有相同的内容,尽管顺序不同:
- /news/archive/2010/05/ 16/
- /news/archive/2010/05/16/?o=views
- /news/archive/2010/05/16/?o= rating
搜索机会惩罚这种特殊类型的内容重复吗?如果是,避免上述处罚的最佳方法是什么? ?告诉 Google & Co. 忽略
o
参数?使用 nofollow
标记订购链接?只允许通过 robots.txt 对按日期排序的存档站点建立索引(不确定这是否可能)?
I run a specialized news site and am trying to apply a little bit of SEO sauce to it. One of the most important things I hear is to avoid duplication of content. I've covered all the basics but I'm stuck with ordering of content.
As an example, the archive of the site is orderable by date, views, and rating. Since we don't have that many news items, an archive page for a particular day has usually only a couple of items, so the following URLs all have the same content, albeit in different ordering:
- /news/archive/2010/05/16/
- /news/archive/2010/05/16/?o=views
- /news/archive/2010/05/16/?o=rating
Do search machines penalize this particular kind of duplication of content? And if yes, what's the best way to avoid said penalty? <link rel="canonical" />
? Tell Google & Co. to ingore the o
parameter? Marking the ordering links with nofollow
? Only allow the indexation of the date-ordered archive sites through robots.txt (not sure if this is even possible)?
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(3)
http://googlewebmastercentral.blogspot .com/2008/09/dynamic-urls-vs-static-urls.html
基本上,Google 并不关心这一点。 Googlebot 足够聪明,可以为您处理这个问题。
我总是使用规范标签,对我来说似乎更干净。
http://googlewebmastercentral.blogspot.com/2008/09/dynamic-urls-vs-static-urls.html
Basically, Google doesn't care about that. Googlebot is smart enough to handle this issue for you.
I always use a canonical tag, seems cleaner to me.
我不知道它是否会影响搜索引擎排名(可能会)。您可以在 Google 网站管理员中告知要忽略哪些查询字符串参数(请参阅站点配置/设置)。您还可以将其添加到 robots.txt:
即使您有其他查询参数,这也会阻止爬虫,如下所示
I don't know whether it affects search engine rankings or not (probably does). You can tell in Google Webmaster which query string parameters to ignore (see Site Configuration/Settings). You can also add this to robots.txt:
This is block the crawler even if you have other query parameters, as in
转到 Google 网站管理员工具 >配置> URL 参数并将“o”参数设置为对内容进行排序的参数。
Go to Google Webmaster Tools > Configuration > URL Parameters and setup your "o" parameter as one that sorts content.