为什么后GreesQl的解释分析估计估计有正确的统计数据?
总结,
我正在加入一个大表(〜600k行),其中一个较小的(〜11K行)在PostgreSQL数据库上加入,我需要通过描述性 text> text
字段过滤结果。
通过 bigint
较小表的字段过滤时,优化器会正确估算产生的行数,但是当通过 text> text
小型表过滤时,优化器低估了结果。即使两者之间存在1-1的关系,排数千次。
我不了解这种行为。
可以通过在
环境
select version();
|version |
|---------------------------------------------------------------------------------------------------|
|PostgreSQL 13.6 on x86_64-pc-linux-gnu, compiled by gcc (Ubuntu 7.5.0-3ubuntu1~18.04) 7.5.0, 64-bit|
坐在Azure灵活服务器上的
。结构和数据
parent 表(11228行):
create table parent_tb as
select id, md5(random()::text) descr
from generate_series(1::bigint,11228::bigint) as a(id);
alter table parent_tb add primary key (id);
create index idx_parent_tb_desc on parent_tb(descr);
示例数据:
select *
from parent_tb
limit 3;
|id |descr |
|---|--------------------------------|
|1 |a34ea09794a959c333116a83b3b77700|
|2 |6c40f8248d28dc541724e86d17f96775|
|3 |99f218398cd5125c803edf7eccc9c832|
child 表(597057行):
create table child_tb (
id_tenant bigint,
id bigint,
descr text,
id_ref bigint references parent_tb(id) null,
primary key (id_tenant, id)
) partition by list(id_tenant);
child> child_tb
表可为每个到
35
的
create table child_tb_ten_1 partition of child_tb for values in ('1');
-- ...
create table child_tb_ten_1 partition of child_tb for values in ('35');
> id_tenant
select *
from child_tb
limit 3;
|id_tenant|id |descr |id_ref|
|---------|---|--------------------------------|------|
|1 |1 |638d2b2aa799d4871e0e2fa73ae607de|1 |
|1 |2 |f06668f8df7eaee3c539d2a1ba613604|1 |
|1 |3 |f588557239d37ec301bae79ab9a61742|1 |
从 1
set default_statistics_target=10000;
vacuum analyze parent_tb;
vacuum analyze child_tb;
值 id_tenant
:
select count(1) n_keys, id_tenant, count(distinct id_ref) n_le, array_agg(distinct (id_ref,p.descr))
from child_tb c join parent_tb p on (c.id_ref=p.id)
group by id_tenant
order by 1 desc
limit 3;
|n_keys|id_tenant|n_le|array_agg |
|------|---------|----|------------------------------------------|
|475759|6 |1 |{"(53,6ea8c6d951f3c8371662509ff8a5e37e)"} |
|18352 |14 |1 |{"(4,a0b360d0344c7018aa98d511392aa26f)"} |
|17102 |2 |1 |{"(17,43595b7e1b092d120bbc8a94afca9583)"} |
租户 6
显然是全球级别的异常值。
统计信息认识到,第一个最常见的值和最常见的频率与租户 6
保留了约80%的数据:
select
(most_common_vals::text::numeric[])[1] mcv_1, (most_common_freqs::text::numeric[])[1] mcf_1,
(most_common_vals::text::numeric[])[2] mcv_2, (most_common_freqs::text::numeric[])[2] mcf_2,
*
from pg_stats where tablename in ('child_tb','child_tb_ten_6');
|mcv_1|mcf_1 |mcv_2|mcf_2 |schemaname|tablename |attname |
|-----|---------|-----|-----------|----------|--------------|---------|
|6 |0.7968402|14 |0.030737434|argodb |child_tb |id_tenant|
| | | | |argodb |child_tb |id |
| | | | |argodb |child_tb |descr |
|53 |0.7968402|4 |0.030737434|argodb |child_tb |id_ref |
|6 |1 | | |argodb |child_tb_ten_6|id_tenant|
| | | | |argodb |child_tb_ten_6|id |
| | | | |argodb |child_tb_ten_6|descr |
|53 |1 | | |argodb |child_tb_ten_6|id_ref |
获取租户 6
的数据用于过滤最终查询:
select distinct c.id_ref,p.descr
from child_tb c join parent_tb p on (c.id_ref=p.id)
where id_tenant=6;
|id_ref|descr |
|------|--------------------------------|
|53 |6ea8c6d951f3c8371662509ff8a5e37e|
行为
请注意:此处显示的每个执行计划都是 dixply Analyze
的结果,因此表示实际情况。
首次尝试 - 访问父母的ID。
结果确定 - 优化器正确识别加入产生的行数。
explain analyze
select * from child_tb c join parent_tb p on (c.id_ref=p.id)
where p.id=53 --fill with the appropriate value at point (1)
and c.id_tenant=6;
|QUERY PLAN |
|---------------------------------------------------------------------------------------------------------------------------------|
|Nested Loop (cost=0.29..17305.28 rows=475759 width=98) (actual time=0.025..88.812 rows=475759 loops=1) |
| -> Index Scan using parent_tb_pkey on parent_tb p (cost=0.29..4.30 rows=1 width=41) (actual time=0.012..0.014 rows=1 loops=1)|
| Index Cond: (id = 53) |
| -> Seq Scan on child_tb_ten_6 c (cost=0.00..12543.38 rows=475759 width=57) (actual time=0.011..52.590 rows=475759 loops=1) |
| Filter: ((id_ref = 53) AND (id_tenant = 6)) |
|Planning Time: 0.183 ms |
|Execution Time: 103.176 ms |
第二次尝试 - 由父母的描述性字段访问。
结果KO - 优化器低估了1000倍加入的结果记录!
explain analyze
select * from child_tb c join parent_tb p on (c.id_ref=p.id)
where p.descr='6ea8c6d951f3c8371662509ff8a5e37e' --fill with the appropriate value at point (1)
and c.id_tenant=6;
|QUERY PLAN |
|-------------------------------------------------------------------------------------------------------------------------------------------------|
|Gather (cost=1004.32..9413.99 rows=42 width=98) (actual time=0.409..119.637 rows=475759 loops=1) |
| Workers Planned: 2 |
| Workers Launched: 2 |
| -> Hash Join (cost=4.32..8409.79 rows=18 width=98) (actual time=0.074..44.309 rows=158586 loops=3) |
| Hash Cond: (c.id_ref = p.id) |
| -> Parallel Seq Scan on child_tb_ten_6 c (cost=0.00..7884.91 rows=198233 width=57) (actual time=0.007..16.501 rows=158586 loops=3) |
| Filter: (id_tenant = 6) |
| -> Hash (cost=4.30..4.30 rows=1 width=41) (actual time=0.020..0.021 rows=1 loops=3) |
| Buckets: 1024 Batches: 1 Memory Usage: 9kB |
| -> Index Scan using idx_parent_tb_desc on parent_tb p (cost=0.29..4.30 rows=1 width=41) (actual time=0.016..0.017 rows=1 loops=3)|
| Index Cond: (descr = '6ea8c6d951f3c8371662509ff8a5e37e'::text) |
|Planning Time: 0.620 ms |
|Execution Time: 134.484 ms |
其他尝试 - 用实现的CTE混淆父母的访问值。
优化器假设平坦的数据分布将租户的行总数 6
分区到不同 id_ref
s的全局数量。期望这种行为。
select
(select count(1) from child_tb where id_tenant=6) cnt_child_6,
(select count(distinct id_ref) from child_tb) cnt_child_dist_refs,
(select count(1) from child_tb where id_tenant=6)/(select count(distinct id_ref) from child_tb) cnt_flat_6_distr
|cnt_child_6|cnt_child_dist_refs|cnt_flat_6_distr|
|-----------|-------------------|----------------|
|475759 |61 |7799 |
结果确定 - 优化器正确地假设联接产生的行数。
explain analyze
with par as materialized (
select *
from parent_tb
where descr='6ea8c6d951f3c8371662509ff8a5e37e' --fill with the appropriate value at point (1)
)
select * from child_tb c join par p on (c.id_ref=p.id)
where c.id_tenant=6;
|QUERY PLAN |
|-------------------------------------------------------------------------------------------------------------------------------------|
|Hash Join (cost=4.33..13220.41 rows=7799 width=97) (actual time=0.031..124.799 rows=475759 loops=1) |
| Hash Cond: (c.id_ref = p.id) |
| CTE par |
| -> Index Scan using idx_parent_tb_desc on parent_tb (cost=0.29..4.30 rows=1 width=41) (actual time=0.014..0.015 rows=1 loops=1)|
| Index Cond: (descr = '6ea8c6d951f3c8371662509ff8a5e37e'::text) |
| -> Seq Scan on child_tb_ten_6 c (cost=0.00..11353.99 rows=475759 width=57) (actual time=0.009..45.301 rows=475759 loops=1) |
| Filter: (id_tenant = 6) |
| -> Hash (cost=0.02..0.02 rows=1 width=40) (actual time=0.017..0.018 rows=1 loops=1) |
| Buckets: 1024 Batches: 1 Memory Usage: 9kB |
| -> CTE Scan on par p (cost=0.00..0.02 rows=1 width=40) (actual time=0.015..0.015 rows=1 loops=1) |
|Planning Time: 0.150 ms |
|Execution Time: 138.972 ms |
结论
我无法理解 42
在第二次尝试中估计的行估计行。我希望它能实现英国与PK之间的1-1关系,并使用该关系,例如首次尝试。此外, 42
值对我来说看起来是任意的,因为我无法从 7799
估算 附加尝试的估计不同,这对我来说是任意的。 。
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
data:image/s3,"s3://crabby-images/d5906/d59060df4059a6cc364216c4d63ceec29ef7fe66" alt="扫码二维码加入Web技术交流群"
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论