新的供稿项目未显示在 Google 阅读器中
有一个由 Wordpress 提供支持的 博客,它具有有效的 RSS 提要(在 Safari 中可以正常打开),但没有不在 Google 阅读器中显示新帖子。事实上,Google Reader 的最新文章是 2010 年 7 月 21 日,而博客上的最新文章是 2010 年 8 月 19 日。
我应该如何处理 RSS feed(转义字符?修改 XML 还是什么?)使用 Google 阅读器?
这是一个重新提出的问题,因为我发现的原始问题已迁移给超级用户,然后在那里关闭,因为它最适合 stackoverflow,所以没有提供解决方案,也没有机会这样做。 请给它一个机会得到答复。
更新: Google Reader 会以 10 篇为一组提取新文章,而且不是最新的。例如,如果 Google 阅读器中未显示 12 篇(或 13 或 11 篇)新文章,则添加下一篇文章时,这些文章中最早的 10 篇(正好 10 篇)会出现在 Google 阅读器中,并且 Google 阅读器中显示的日期为每篇文章都是平等的,就好像所有 10 篇文章都在同一秒内发布——它们出现在 Google Reader 上的那一秒。这个问题在我尝试过的其他聚合器中并未出现。
更新2: 文章开始定期出现,所以问题暂时解决了。我不知道为什么会发生这种情况,也许是因为更多的读者订阅了(出于测试目的),或者是因为我最近添加了 PubSubHubBub 插件。在事情变得明朗之前,这个问题在接下来的 3 天里仍然悬而未决。
There is a blog, powered by Wordpress, which has valid RSS feed (opens up fine in Safari), but doesn't show new posts in Google Reader. In fact, the latest article from Google Reader is from Jul 21, 2010, while the latest article on the blog dates to Aug 19, 2010.
What should I do about the RSS feed (escape characters? modify XML or what?) for it to work on Google Reader?
This is a reopened question, because the original question I found was migrated to superuser, then closed there because it is best fitted on stackoverflow, so no solution was ever provided, and no chance was given to do so. Please give it a chance to get answered.
Update:
Google Reader pulls new articles, in groups of 10, and not the latest. For example if 12 (or 13, or 11) new articles are not shown in Google Reader, when the next one is added, the oldest 10 (exactly 10) of these articles appear on Google Reader, and the date shown in Google Reader is equal for each article, as if all 10 were published in the same second - the second they appeared on Google Reader. This problem doesn't manifest itself in other aggregators that I've tried.
Update 2:
Articles started showing up regularly, so the problem is solved, temporarily. Why did it happen I don't know, maybe it's because more readers subscribed (for testing purposes), or it's because of the PubSubHubBub plugin that I've added recently. Until it becomes clear, and for 3 more days, this question remains open.
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(3)
感谢您的回复,我也提出了一些可能的解决方案(谢谢您)。
我不知道这是否是我所做的,或者与此无关,但从昨天(当你回答这个问题时)开始,提要开始正常显示。
也许是因为您的帮助,博客在 Google Reader 上获得了更多订阅者,并且更新率出现了反弹(就像@Bermo 建议的那样)。
或者,也许 PubSubHubBub 插件的引入改变了一些东西。但这是第一个变体(订户数量)。尽管为什么其他极不受欢迎的博客在 Google Reader 中为我提供定期文章仍然是一个谜。
现在我只会投票赞成好的答案,直到一切变得清晰(无法真正确定确切的原因)或直到这个赏金的最后一天。
Thank you for your responses, I too have come up with some possible solutions (thanks to you).
I don't know whether It's something I did, or independent of that, but as from yesterday (when you answered to this question), feeds started showing up normally.
Maybe it is due to the fact that thanks to you the blog got more subscribers on Google Reader and the Update Rate bounced (just like @Bermo suggested).
Or, maybe the introduction of the PubSubHubBub plugin changed something. But it's rather the first variant (number of subscribers). Though it is still a mystery why other extremely unpopular blogs give me regular articles in Google Reader.
For now I will only upvote good answers, until everything becomes clear (can't really determine the exact cause) or until the last day of this bounty.
我刚刚将博客添加到我的 Google 阅读器中并进行了一些阅读。我注意到你观察到的相同行为,我错过了 5 个最新帖子,其中大约 10 个帖子都有相同的日期:
在网络上进行了一些搜索后,我发现 这篇文章解释了如何通过右侧的工具提示实际查看发布日期:
然后,一旦我单击顶部的 Google Reader 中的“刷新”按钮,新帖子就会显示:
我相信 Google 蜘蛛雷达上的大量博客每隔几个小时就会被索引一次,因此所有帖子的接收日期都非常接近发布日期,因此没有人注意到/关心它是实际上显示接收日期。
然而,对于低容量的博客,缓存的更新频率似乎要低得多。 Google 提供了一些尝试更新的提示 - 阅读器中的提要未更新。也许我对博客的订阅更新了缓存,但由于蜘蛛有延迟,我直到按“刷新”才看到更新。或者,按下“刷新”按钮的行为可能会触发它立即查找新帖子。
最后,我将博客订阅到了我妻子的 Google 阅读器帐户,这次 5 篇最新帖子立即出现,并带有匹配的接收时间,这大约翻译回我按下“刷新”按钮时的时间(或者可能是当我添加喂养)。
我感受到你的痛苦 - 我同意这对于低容量 RSS 提要来说似乎有点麻烦......
I just added the blog to my Google Reader and had a bit of a play. I noticed the same behaviour you observed where I was missing the 5 most recent posts and a bunch of about 10 of them all had the same date:
After doing a bit of a search on the web, I found this post which explains how you can actually view the Published date via a tooltip on the right-hand side:
Then once I click the "Refresh" button from Google Reader at the top, the new posts showed up:
I believe that high volume blogs that are on the Google spiders' radar would be indexed every few hours and therefore all posts would have their Received date very close to the Published date so nobody notices/cares that it is actually displaying the Recevied date.
For low volume blogs however, it seems the cache is updated much less frequently. Google has some tips to try to get it to update - Feed not updating in Reader. Maybe my subscription to the blog updated the cache, but as the spider has a delay I didn't see the updates till pressing "Refresh". Or maybe the act of pressing the "Refresh" button triggered it to look for new posts immediately.
Lastly I subscribed the blog to my wife's Google Reader account and this time the 5 latest posts came up straight away with matching Received times which translated back to about the time when I pressed the "Refresh" button (or maybe it was when I added the feed).
I feel your pain - I agree that it all seems a bit cumbersome for a low volume RSS feed ...
您还可以咨询博客作者/托管公司,看看他们是否拒绝了 Google 索引率。 Google 可以在网站上创造大量流量。降低索引率(抓取率)会有所帮助,但它会妨碍 Google Reader。
正如其他发帖者所提到的,这也可能是受欢迎程度低/页面排名低/其他原因导致 Googlebot 无法足够频繁地抓取博客的因素。
Google Reader 显示依赖于 Google 抓取博客以获取最新内容。实际上,您需要通过客户端拉取 RSS 提要来获取最新数据,这样您就不必依赖 Google 来抓取网站。 Outlook 2010、Firefox 以及许多其他工具都存在。客户端软件将直接从博客中提取更新的 RSS 提要,并在发布到 RSS 提要时捕获帖子。
You may also check with the blog author / hosting company and see if they have turned down the Google indexing rate. Google can create high volumes of traffic on a site. Turning down the indexing rate (crawl rate) will help with that but it b0rks Google Reader.
As other posters have mentioned, it could also be a factor of low popularity / low page rank / something else causing Googlebot to fail to crawl the blog frequently enough.
Google Reader display is dependent on Google crawling the blog to pick up the latest content. Realistically, you'll want a client side pull of the RSS feed to get the latest data so you aren't dependent on Google crawling the website. Outlook 2010, Firefox, many others exist. The client side software will directly pull the updated RSS feed from the blog, capturing the posts as they are published to the RSS feed.