为什么 Mongohint 可以使查询运行速度提高 10 倍？

发布于 2024-12-09 10:36:14 字数 1220 浏览 5 评论 0原文

如果我使用explain()从shell运行mongo查询，获取所使用的索引的名称，然后再次运行相同的查询，但使用hint()指定要使用的相同索引 - 解释计划中的“millis”字段是显着减少，

例如

没有提供提示：

>>db.event.find({ "type" : "X", "active" : true, "timestamp" : { "$gte" : NumberLong("1317498259000") }, "count" : { "$gte" : 0 } }).limit(3).sort({"timestamp" : -1 }).explain();

{
    "cursor" : "BtreeCursor my_super_index",
    "nscanned" : 599,
    "nscannedObjects" : 587,
    "n" : 3,
    "millis" : 24,
    "nYields" : 0,
    "nChunkSkips" : 0,
    "isMultiKey" : true,
    "indexOnly" : false,
    "indexBounds" : { ... }
}

提供提示：

>>db.event.find({ "type" : "X", "active" : true, "timestamp" : { "$gte" : NumberLong("1317498259000") }, "count" : { "$gte" : 0 } }).limit(3).sort({"timestamp" : -1 }).hint("my_super_index").explain();

{
    "cursor" : "BtreeCursor my_super_index",
    "nscanned" : 599,
    "nscannedObjects" : 587,
    "n" : 3,
    "millis" : 2,
    "nYields" : 0,
    "nChunkSkips" : 0,
    "isMultiKey" : true,
    "indexOnly" : false,
    "indexBounds" : { ... }
}

唯一的区别是“millis”字段

有人知道为什么吗？

更新：“选择要使用的索引”并没有解释它，因为据我所知，mongo为每个X（100？）运行选择索引，所以它应该与提示下一个（X-1）一样快运行

原文

If I run a mongo query from the shell with explain(), get the name of the index used and then run the same query again, but with hint() specifying the same index to be used - "millis" field from explain plan is decreased significantly

for example

no hint provided:

>>db.event.find({ "type" : "X", "active" : true, "timestamp" : { "$gte" : NumberLong("1317498259000") }, "count" : { "$gte" : 0 } }).limit(3).sort({"timestamp" : -1 }).explain();

{
    "cursor" : "BtreeCursor my_super_index",
    "nscanned" : 599,
    "nscannedObjects" : 587,
    "n" : 3,
    "millis" : 24,
    "nYields" : 0,
    "nChunkSkips" : 0,
    "isMultiKey" : true,
    "indexOnly" : false,
    "indexBounds" : { ... }
}

hint provided:

>>db.event.find({ "type" : "X", "active" : true, "timestamp" : { "$gte" : NumberLong("1317498259000") }, "count" : { "$gte" : 0 } }).limit(3).sort({"timestamp" : -1 }).hint("my_super_index").explain();

{
    "cursor" : "BtreeCursor my_super_index",
    "nscanned" : 599,
    "nscannedObjects" : 587,
    "n" : 3,
    "millis" : 2,
    "nYields" : 0,
    "nChunkSkips" : 0,
    "isMultiKey" : true,
    "indexOnly" : false,
    "indexBounds" : { ... }
}

The only difference is "millis" field

Does anyone know why is that?

UPDATE: "Selecting which index to use" doesn't explain it, because mongo, as far as I know, selects index for each X (100?) runs, so it should be as fast as with hint next (X-1) runs

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

风透绣罗衣 2024-12-16 10:36:14

Mongo 使用一种算法来确定在没有提供提示时使用哪个索引，然后缓存用于接下来 1000 次调用的类似查询的索引，

但是每当您解释 mongo 查询时，它总是会运行索引选择算法，因此解释（与不带提示的explain()相比，带提示的explain()总是花费更少的时间。

类似的问题在这里得到了回答
了解 mongo db 解释

回复收藏 0 原文

同展鸳鸯锦 2024-12-16 10:36:14

从扫描的对象数量可以看出，Mongo 两次都进行了相同的搜索。您还可以看到使用的索引是相同的（看看“光标”条目），两者都已经使用了您的 my_super_index 索引。

“hint”仅告诉 Mongo 使用它在第一个查询中已自动执行的特定索引。

第二次搜索简单更快，因为所有数据可能已经在缓存中。

回复收藏 0 原文

花开浅夏 2024-12-16 10:36:14

我很难为同样的事情寻找理由。我发现当我们有很多索引时，mongo确实比使用hint花费更多的时间。 Mongo 基本上花了很多时间来决定使用哪个索引。考虑一个场景，您有 40 个索引并且执行一个查询。 Mongo 需要做的第一个任务是哪个索引最适合用于特定查询。这意味着 mongo 需要扫描所有键，并在每次扫描中进行一些计算，以找到一些性能索引（如果使用此键）。提示肯定会加速，因为将保存索引键扫描。

回复收藏 0 原文

獨角戲 2024-12-16 10:36:14

我会告诉你如何找出更快的方法
1) 无索引
它将把每个文档拉入内存以获得结果
2）有索引
如果该集合有很多索引，它将从缓存中获取索引
3）与.hint（_index）
它将采用您

在hint() 中提到的特定索引，而无需hint()
两次你都.explain("executionStats")
使用hint()，那么您可以检查totalKeysExamined值，该值将与totalDocsExamined匹配
如果没有hint()，您可以看到 totalKeysExamined 值大于 totalDocsExamined

totalDocsExamined 这个结果在大多数情况下将与结果计数完美匹配。