谷歌如何确定帖子的发布日期?
当您在谷歌搜索时,搜索术语时,您可以点击页面左侧的“讨论”。这将引导您进入您可以选择的基于论坛的讨论。我正在为用户组设计一个讨论板,我希望谷歌能够用发布时间索引我的数据。
您可以按“任何时间”-“过去一小时”-“过去 24 小时”-“过去一周”- 等过滤结果。
确保将发布日期传达给 Google 的最佳方法是什么?主题的 RSS 提要?具有特定 id 的特殊 HTML 标签?或者其他方法?
When you search in google, when searching for a term, you can click "Discussion" on the left hand side of the page. This will lead you to forum based discussions which you can select. I was in the process of designing a discussion board for a usergroup and I would like for google to index my data with post time.
You can filter the results by "Any Time" - "Past Hour" - "Past 24 Hours" - "Past Week" - etc.
What is the best way to ensure that the post date is communicated to google? RSS feed for thread? Special HTML label tag with particular id? Or some other method?
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(3)
谷歌不断改进他们的启发式方法,因此,我认为您所描述的内容没有任何(众所周知的)规则。事实上,我刚刚自己进行了一次讨论搜索,发现结果页面的布局差异很大,而且并非所有页面都有 RSS 提要或使用标准论坛软件。我只是猜测谷歌会寻找常见的指标,例如帖子编号、作者、日期。
基于时间的过滤主要基于Google索引您的页面和识别新内容的频率(尽管讨论页面也可以根据单个发布日期进行过滤,这又完全取决于Google)。只是猜测,但将“Last-Modified”标头添加到您的页面也可能有所帮助。
Google continually improves their heuristics and as such, I don't think there are any (publicly known) rules for what you describe. In fact, I just did a discussion search myself and found the resulting pages to have wildly differing layouts, and not all of them have RSS feeds or use standard forum software. I would just guess that Google looks for common indicators such as Post #, Author, Date.
Time-based filtering is mostly based on how frequently Google indexes your page and identifies new content (although discussion pages could also be filtered based on individual post dates, which is once again totally up to Google). Just guessing, but it might also help to add Last-Modified headers to your pages.
我相信谷歌只会看内容何时出现。无需在那里进行解析,也不需要您进行特殊处理。
I believe Google will simply look at when the content appeared. No need for parsing there, and no special treatment required on your end.
我曾经读过谷歌搜索者的一篇论文(遗憾的是我再也找不到这篇论文了,如果有人找到它,请给我一张便条),其中有大纲。很多公式等等,但底线是:谷歌已经分析了网络上顶级论坛系统的结构。它不使用页面隐喻来分析它,而是将论坛分解为主题、线程和帖子。
所以基本上,如果您使用标准的、流行的论坛系统,谷歌就会知道它是一个论坛,并将您放入讨论部分。如果您构建自己的论坛软件,最好使用现有的、已建立的论坛约定(主题、主题、帖子、作者......)。
i once read a paper from a googler (a paper i sadly can't find anymore, if somebody finds it, please give me a note) where it was outlines. a lot of formulas and so on, but the bottom line was: google has analyzed the structure of the top forum systems on the web. it does not use a page metaphor to analyse it, but breaks the forum down into topics, threads and posts.
so basically, if you use a standard, popular forum system, google knows that it is a forum and puts you into the discussion segment. if you build your own forum software it is probably best to use existing, established forum conventions (topics, threads, posts, authors....).