Google 如何计算和估计搜索结果的数量?
Google 如何计算和估计搜索结果的数量?例如,当我搜索“stackoverflow”时,它会计算出 2,910,000 个结果。
How does Google count and estimate the number of a search results? For example when I search "stackoverflow," it counts 2,910,000 results.
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(2)
可能是这方面最权威的文章:
http://searchengineland.com/why -google-cant-count-results-properly-53559
因为有趣的是,取出东西有时实际上可以提供更多结果(!!)。
来自 Google 开发人员(网络垃圾邮件团队负责人 Matt Cutts):
“我们试图非常清楚地表明,我们的结果估计只是估计。理论上,我们可以在系统的这方面花费周期,但实际上我们还有很多其他事情要做,而且更准确的结果估计比许多其他事情在列表中的位置要低”
Probably the most definitive article on this:
http://searchengineland.com/why-google-cant-count-results-properly-53559
as it's interesting to note that taking stuff out can actually sometimes give more results(!!).
From a Google developer (Matt Cutts, head of the web spam team):
"We try to be very clear that our results estimates are just that--estimates. In theory we could spend cycles on that aspect of our system, but in practice we have a lot of other things to work on, and more accurate results estimates is lower on the list than lots of other things"
我认为这个问题是技术性的而不是概念性的。
这就是 MEMCACHE。
据我所知,
memcache
可以在标准时间内执行任何操作。 记录数量无关紧要。分布式 RAM
架构。如需类似的实时实现,请阅读分片计数器文章。他们保留包含大约 10-1000 个 2-4 GB RAM 的集群,在其中进行临时数学计算,当他们保存最终值时,他们只对每个 RAM 中的总和进行求和。我对该实现的使用表明它会更快(而且也更准确)。
它现已开源,请访问 memcached。
说到公司“估计”价值......
我的观点是大公司(包括谷歌)会分散人们对真相的注意力。很多东西都倾销给人们,这样人们只会使用他们所提供的东西,至少在不久的将来不会成为公司的竞争对手。人们意味着公司、投资者、有潜力的技术人员、每个人。
如果他们进行估计,他们会做很多事情,而这些事情是“估计”不可能实现的。比如钱包api、adsense发票等。
I believe this question is technical than conceptual.
And that is MEMCACHE.
To my knowledge,
memcache
, does any operation in a standard amount of time. immaterial of the amount of records. Adistributed RAM
architecture. For a similar realtime implementation, read Sharding Counters article. They keep clusters containing of about 10-1000 2-4 gb RAMs, They do temporory math in them, When they persist the final value, they only sum up the sums in each of the RAM.My use of that implementation tells it will be faster (and accurate too).
It is open source now, visit memcached.
Coming to the point that says the company 'estimates' values...
My view is big companies (including google) divert people from truth. A lot many things are dumped on the people so that people will only use what they are offered, and don't become competitors to companies in the near future at the least.. People means companies, investors, talent potential techies, everyone.
If they estimate, there are so many things they do which are not possible on 'estimation'. Like wallet api, adsense invoicing, etc.