Slow query optimisation in Postgres

SELECT "jobs".* FROM "jobs" WHERE "jobs"."status" IN (1, 2, 3, 4) ORDER BY "jobs"."due_date" ASC LIMIT 5; Limit (cost=0.42..1844.98 rows=5 width=2642) (actual time=16927.150..18151.643 rows=1 loops=1) -> Index Scan using index_jobs_on_due_date on jobs (cost=0.42..1278647.41 rows=3466 width=2642) (actual time=16927.148..18151.641 rows=1 loops=1) Filter: (status = ANY ('{1,2,3,4}'::integer[])) Rows Removed by Filter: 595627 Planning time: 0.205 ms Execution time: 18151.684 ms

CREATE INDEX index_jobs_on_due_date ON public.jobs USING btree (due_date) CREATE INDEX index_jobs_on_due_date_and_status ON public.jobs USING btree (due_date, status) CREATE INDEX index_jobs_on_status ON public.jobs USING btree (status) CREATE UNIQUE INDEX jobs_pkey ON public.jobs USING btree (id)

For this query:

SELECT  j.*
FROM "jobs" j
WHERE j."status" IN (1, 2, 3, 4)
ORDER BY "jobs"."due_date" ASC
LIMIT 5;

The "obvious" index is on (status). But that may not help. The goal is to get rid of the sorting. So, you can rewrite the query and use an index jobs(status, due_date):

select j.*
from ((select j.*
       from jobs j
       where j.status = 1
       order by j.due_date asc
       limit 5
      ) union all
      (select j.*
       from jobs j
       where j.status = 2
       order by j.due_date asc
       limit 5
      ) union all
      (select j.*
       from jobs j
       where j.status = 3
       order by j.due_date asc
       limit 5
      ) union all
      (select j.*
       from jobs j
       where j.status = 4
       order by j.due_date asc
       limit 5
      )
     ) j
order by due_date
limit 5;

The subqueries should each use the composite index. The final sort would then be on (at most) 20 rows, which should be fast).

EDIT:

Here is a related idea, with the same index:

SELECT j.*
FROM (SELECT  j.*,
              ROW_NUMBER() OVER (PARTITION BY j.status ORDER BY j.due_date ASC) as seqnum
      FROM "jobs" j
     ) j
WHERE j.status in (1, 2, 3, 4) AND seqnum <= 5
ORDER BY j.due_date ASC
LIMIT 5;

This can use the index for the ROW_NUMBER() calculation. That might require a full table scan of the table. But, the final sort will be limited to 20 rows, so the final sort is eliminated.

Recommended topics

Hot tags