【发布时间】:2019-03-29 14:41:03
【问题描述】:
在具有相同数据库的类似 Amazon RDS PostgreSQL 服务器版本 9.6.11 上,我为一个 SQL 查询获得不同的执行计划。
我尝试重新创建索引并运行ANALYZE 和VACUUM。没有任何帮助。
我的查询:
SELECT "users_employee"."id",
(
SELECT U0."created"
FROM "surveys_surveyrequest" U0
WHERE (U0."confirmed" IS NULL
AND U0."skipped" IS NULL
AND U0."from_member_id" = ("users_employee"."id"))
ORDER BY U0."created" ASC
LIMIT 1) AS "earliest_request_date"
FROM "users_employee"
ORDER BY "users_employee"."id" ASC;
问题表信息:
create table surveys_surveyrequest
(
id integer default nextval('public.surveys_surveyrequest_id_seq'::regclass) not null
constraint surveys_surveyrequest_pkey
primary key,
created timestamp with time zone not null,
skipped timestamp with time zone,
from_member_id integer
constraint surveys_surveyreques_from_member_id_81f0e82e_fk_users_emp
references users_employee
deferrable initially deferred,
confirmed timestamp with time zone
);
create index surveys_sur_confirm_48bfa6_idx
on surveys_surveyrequest (confirmed);
create index surveys_sur_created_099976_idx
on surveys_surveyrequest (created);
create index surveys_surveyrequest_70b76ad7
on surveys_surveyrequest (from_member_id);
计划: 答:
QUERY PLAN
-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
Index Only Scan using auth_user_pkey on users_employee (cost=0.28..991903.69 rows=1478 width=12) (actual time=139.054..195486.465 rows=1478 loops=1)
Heap Fetches: 51
Buffers: shared hit=296637323
SubPlan 1
-> Limit (cost=0.42..671.07 rows=1 width=8) (actual time=132.258..132.259 rows=1 loops=1478)
Buffers: shared hit=296637288
-> Index Scan using surveys_sur_created_099976_idx on surveys_surveyrequest u0 (cost=0.42..24143.63 rows=36 width=8) (actual time=132.256..132.256 rows=1 loops=1478)
Filter: ((confirmed IS NULL) AND (skipped IS NULL) AND (from_member_id = users_employee.id))
Rows Removed by Filter: 405780
Buffers: shared hit=296637288
Planning time: 0.188 ms
Execution time: 195487.356 ms
(12 rows)
乙:
QUERY PLAN
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
Index Only Scan using auth_user_pkey on users_employee (cost=0.28..886476.74 rows=1578 width=12) (actual time=0.977..1043.414 rows=1578 loops=1)
Heap Fetches: 0
Buffers: shared hit=98270 read=8
SubPlan 1
-> Limit (cost=561.74..561.74 rows=1 width=8) (actual time=0.658..0.659 rows=1 loops=1578)
Buffers: shared hit=98266 read=5
-> Sort (cost=561.74..561.79 rows=22 width=8) (actual time=0.658..0.658 rows=1 loops=1578)
Sort Key: u0.created
Sort Method: quicksort Memory: 25kB
Buffers: shared hit=98266 read=5
-> Bitmap Heap Scan on surveys_surveyrequest u0 (cost=474.19..561.63 rows=22 width=8) (actual time=0.646..0.652 rows=13 loops=1578)
Recheck Cond: ((from_member_id = users_employee.id) AND (confirmed IS NULL))
Filter: (skipped IS NULL)
Rows Removed by Filter: 3
Heap Blocks: exact=9707
Buffers: shared hit=98266 read=5
-> BitmapAnd (cost=474.19..474.19 rows=23 width=0) (actual time=0.641..0.641 rows=0 loops=1578)
Buffers: shared hit=88562 read=2
-> Bitmap Index Scan on surveys_surveyrequest_70b76ad7 (cost=0.00..11.29 rows=382 width=0) (actual time=0.023..0.023 rows=258 loops=1578)
Index Cond: (from_member_id = users_employee.id)
Buffers: shared hit=5847 read=2
-> Bitmap Index Scan on surveys_sur_confirm_48bfa6_idx (cost=0.00..462.64 rows=24829 width=0) (actual time=0.826..0.826 rows=24756 loops=1165)
Index Cond: (confirmed IS NULL)
Buffers: shared hit=82715
Planning time: 0.234 ms
Execution time: 1043.680 ms
(26 rows)
Time: 1044,547 ms (00:01,045)
我希望生成相同的查询计划,但这不会发生。 可能是什么原因? B计划如何实现执行?
【问题讨论】:
-
您是否尝试禁用 GEQO ? (SET geqo=false) 这样做后你还有不同的QP吗?
-
不。仅当涉及的表超过
geqo_threshold时才会生效。 -
不知道它是否会对您的问题产生影响,但看起来
DISTINCT ON对您的查询来说会更简单。它应该允许您消除子查询并只拥有一个JOIN。 -
@jpmc26 感谢您的想法。据我所知,连接比子查询更快、更可取。真是个好主意。我会测试!
-
您可能还需要考虑更具体的索引。照原样,即使您更快的查询计划也必须位图和两个索引,这可能是一个代价高昂的过程,然后再执行另一个过滤器。如果您在这两列是
NULL(例如CREATE INDEX ON surveys_surveyrequest (from_member_id, created) WHERE confirmed IS NULL AND skipped IS NULL)的条件下有一个过滤索引,那么引擎应该能够更快地找到匹配的行。我也会尝试在该索引中包含和省略created。
标签: sql postgresql query-performance