Postgresql - 使用enable_nestloop=false 查询运行速度更快。为什么规划者没有做正确的事情？

发布于 2024-12-11 11:31:05 字数 876 浏览 0 评论 0原文

当我使用默认的enable_nestloop = true和enable_nestloop = false（〜10秒）运行它时，我有一个查询运行速度慢得多（〜5分钟）。

解释两种情况的分析结果：

机器 A Nestloop=true - http://explain.depesz.com/s/nkj0 （约 5 分钟）机器 A Nestloop=false - http://explain.depesz.com/s/wBM（约 10 秒

）不同的稍慢的机器，复制数据库并保留默认的enable_nestloop=true需要大约20秒。

机器 B Nestloop=true - (~ 20secs)

对于上述所有情况，我确保在运行查询之前进行了分析。没有其他查询并行运行。

两台机器都运行 Postgres 8.4。机器 A 运行 Ubuntu 10.04 32 位，而机器 B 运行 Ubuntu 8.04 32 位。

实际的查询可以在这里找到。这是一个具有许多连接的报告查询，因为数据库主要用于事务处理。

如果不诉诸于诸如物化视图之类的东西，我该怎么做才能使规划器完成我通过设置enable_nestloop=false所实现的目标？
根据我所做的研究，规划器选择看似不理想的查询的原因似乎是因为估计行与实际行之间存在巨大差异。我怎样才能让这个数字更接近？
如果我应该重写查询，我应该更改什么？
为什么规划器似乎为机器 B 做了正确的事情。我应该在两台机器中比较什么？

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

删除→记忆 2024-12-18 11:31:05

如果查询规划器选择次优查询计划，则很可能有不完整或误导性的信息可供使用。

请参阅有关服务器调整的 PostgreSQL Wiki 页面。特别要注意random_page_cost和default_statistics_target章节。
另请阅读手册中有关规划器使用的统计信息和规划器成本常量。

更具体地说，它可能有助于增加以下列的统计目标：

ALTER TABLE postgres.products ALTER COLUMN id SET STATISTICS 1000;
ALTER TABLE postgres.sales_orders ALTER COLUMN retailer_id SET STATISTICS 1000;
ALTER TABLE postgres.sales_orders ALTER COLUMN company_id SET STATISTICS 1000;

ALTER TABLE goods_return_notes ALTER COLUMN retailer_id SET STATISTICS 1000;
ALTER TABLE goods_return_notes ALTER COLUMN company_id SET STATISTICS 1000;

ALTER TABLE retailer_category_leaf_nodes ALTER COLUMN tree_left SET STATISTICS 1000;
ALTER TABLE channels ALTER COLUMN principal_id SET STATISTICS 1000;

这些列涉及过滤器，从而导致

估计行数与实际行数之间存在巨大差异。

还有更多。检查刨床与估计偏差较大的每一列。默认值为 100。仅对具有 >> 的表有意义。 1000 行。尝试设置。随后在表上运行 ANALYZE 以使更改生效。

它还可能有助于在 postgres(sales_orders.retailer_id) WHERE Retailer_id IS NOT NULL 上创建部分索引（取决于 NULL 值的常见程度）。

另一件可能对您有帮助的事情是升级到最新版本 9.1。该领域已经取得了许多重大改进。

If the query planner chooses suboptimal query plans, then chances are it has incomplete or misleading information to work with.

See this PostgreSQL Wiki page on server tuning. Especially pay attention to the chapters on random_page_cost and default_statistics_target.
Also read the corresponding chapters in the manual on Statistics Used by the Planner and Planner Cost Constants.

More specifically, it might help to increase the statistics target for the following columns:

ALTER TABLE postgres.products ALTER COLUMN id SET STATISTICS 1000;
ALTER TABLE postgres.sales_orders ALTER COLUMN retailer_id SET STATISTICS 1000;
ALTER TABLE postgres.sales_orders ALTER COLUMN company_id SET STATISTICS 1000;

ALTER TABLE goods_return_notes ALTER COLUMN retailer_id SET STATISTICS 1000;
ALTER TABLE goods_return_notes ALTER COLUMN company_id SET STATISTICS 1000;

ALTER TABLE retailer_category_leaf_nodes ALTER COLUMN tree_left SET STATISTICS 1000;
ALTER TABLE channels ALTER COLUMN principal_id SET STATISTICS 1000;

These are involved in the filters resulting in the

huge difference between the estimated and actual rows.

There are more. Check every column where the planer deviates a lot from the estimate. Default is just 100. Makes only sense for tables with >> 1000 rows. Experiment with the setting. Run ANALYZE on the tables afterwards for the changes to take effect.

It might also help to create a partial index on postgres(sales_orders.retailer_id) WHERE retailer_id IS NOT NULL (depending on how common NULL values are).

Another thing that may help you is to upgrade to the latest version 9.1. There have been a number of substantial improvements in this area.

回复收藏 0 原文