Query plan on indexed partitioned table. Avoid sequential scan

Tags:

postgresql

In a postgres 10.1 server I have a very big table partitioned by list value and a view which only filterthe table by the partition column.

When using the view, the planner is not giving me the best possible plan, i mean, scanning only the selected children tables. Instead it always scans all partitions of the parent table.

I have created a index by the partition column and a constraint tool. The DDL:


                                  Table "parted_mob_matrix"
    Column    |         Type          | Collation | Nullable | Default | Storage  | Stats target | Description 
--------------+-----------------------+-----------+----------+---------+----------+--------------+-------------
 id           | integer               |           | not null |         | plain    |              | 
 delivery_id  | integer               |           |          |         | Partition key: LIST (delivery_id)
Partitions: parted_mob_matrix_delivery_0 FOR VALUES IN (0),
            parted_mob_matrix_delivery_1 FOR VALUES IN (1),
            parted_mob_matrix_delivery_10 FOR VALUES IN (10),
            ....
            parted_mob_matrix_delivery_10 FOR VALUES IN (620),


                            Table "parted_mob_matrix_delivery_620"
    Column    |         Type          | Collation | Nullable | Default | Storage  | Stats target | Description 
--------------+-----------------------+-----------+----------+---------+----------+--------------+-------------
 id           | integer               |           | not null |         | plain    |              | 
 delivery_id  | integer               |           |          |         | plain    |              | 
Partition of: parted_mob_matrix FOR VALUES IN (620)
Partition constraint: ((delivery_id IS NOT NULL) AND (delivery_id = ANY (ARRAY[620])))
Indexes:
    "parted_mob_matrix_delivery_620_delivery_id_idx" btree (delivery_id)
Check constraints:
    "parted_mob_matrix_delivery_620_check_delivery" CHECK (delivery_id = 620)

Mi view code:

EXPLAIN SELECT
  parted_mob_matrix.*
FROM
  parted_mob_matrix
1) where parted_mob_matrix.delivery_id in (620)
2) where parted_mob_matrix.delivery_id in (select 620)

I need to use the 2 version here simplified (It's a real query to another very little table) but it plans very different and worse.

QUERY PLAN 1 (good on efficency):

Append  (cost=0.00..78308.11 rows=758031 width=738)

  ->  Seq Scan on parted_mob_matrix_delivery_620  (cost=0.00..78308.11 rows=758031 width=738)

        Filter: (delivery_id = 620)

QUERY PLAN 2 (rowset, slow):


Hash Semi Join  (cost=0.01..25077311.20 rows=7539693 width=860)

  Hash Cond: (parted_mob_matrix_delivery_0.delivery_id = (620))

  ->  Append  (cost=0.00..24942162.20 rows=211111399 width=859)

        ->  Seq Scan on parted_mob_matrix_delivery_0  (cost=0.00..10.75 rows=250 width=294)

        ->  Seq Scan on parted_mob_matrix_delivery_1  (cost=0.00..10.75 rows=250 width=294)

 -- All the child tables

        ->  Seq Scan on parted_mob_matrix_delivery_620  (cost=0.00..77929.09 rows=758031 width=738)

 -- All the child tables are scanned

How can I use the plan 1 on a query which a where like 2?

457

asked Jul 17 '19 17:07

1 Answers

You can solve your problem in PostgreSQL v10 wrapping the input of WHERE condition as an IMMUTABLE plpgsql function which returns an ARRAY of integers. By definition, an IMMUTABLE plpgsql function "(...) allows the optimizer to pre-evaluate the function when a query calls it with constant arguments (...)" (https://www.postgresql.org/docs/10/xfunc-volatility.html).

This solution should work.

Example:

SELECT
  parted_mob_matrix.*
FROM
  parted_mob_matrix
WHERE parted_mob_matrix.delivery_id = ANY(get_deliveries('cod_011'))

The function you could use:

CREATE OR REPLACE FUNCTION get_deliveries(
    high_level_id TEXT
)
RETURNS INTEGER[]
AS $BODY$
DECLARE
    _delivery_ids INTEGER[];
BEGIN
  EXECUTE format(
    $$
    SELECT ARRAY_AGG(delivery_id)
    FROM
        your_table_with_all_delivery_ids
    WHERE
        high_level_id = '%1$s'
    ;
    $$, high_level_id
  ) INTO _delivery_ids;
  RETURN _delivery_ids;
END;
$BODY$
LANGUAGE plpgsql IMMUTABLE;

answered Oct 21 '22 11:10

cayetano benavent

Related questions
                            
                                Group by Concat Teradata
                            
                                Find the 3rd Maximum Salary for each department based on table data
                            
                                MySQL command output too wide in command-line client [duplicate]
                            
                                What is off page in Mysql?
                            
                                Comparing empty string with null value - SQL Server
                            
                                Selecting latest consecutive records that match a condition with PostgreSQL
                            
                                Postgres GROUP BY Array Column
                            
                                Add a new column in table with a sequence - Oracle
                            
                                extract the date from a timestamp value variable in Impala
                            
                                How to do a Select in another Select with Postgresql
                            
                                How to decode BASE64 in Standard SQL?
                            
                                Insert a pandas dataframe into a SQLite table
                            
                                how to select only by date from timestamp column in postgres?
                            
                                Invalid POLYGON bigQuery
                            
                                Updating rows in jOOQ with joins
                            
                                How to change a UNION to a IN clause?
                            
                                How to remove garbage data from array output
                            
                                How to query and iterate over array of structures in Athena (Presto)?
                            
                                In Oracle, what does [select * from table()] mean?
                            
                                Fetch dynamic table name in trigger

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Query plan on indexed partitioned table. Avoid sequential scan

Tags:

sql

postgresql

Pablo Caro

People also ask

1 Answers

cayetano benavent

Recent Activity

Donate For Us