I have situation, where running a query that filters by an indexed column in a partitioned table, performs a full table scan. Apparently , this is a known issue in postgresql, and it's explained in detail here. Is there a more elegant way around this other than performing a query on each partition, and then performing a UNION on all of the results?

Indexes work just fine to do a scan only of the relevant partitions in PostgreSQL. But, you have to set everything up properly for it to work, and it's easy to miss a step in the long list of things documented at http://www.postgresql.org/docs/current/static/ddl-partitioning.html The main thing to realize is that in order to avoid a sequential scan, you have to provide enough information to PostgreSQL so it can prove some partitions cannot have the data you're looking for; then they are skipped as potential sources for the query results. The article you link to points this out as a solution to the seq scan problem: "If you add range constraints to the date field of each partition, this query can be optimized into a loop where you query the “latest” partition first and work backwards until you find a single value that is higher than the range of all the remaining partitions."--but doesn't show the improved plan you'd see after that change. Some common mistakes you might have made: -The constraint_exclusion parameter in the postgresql.conf file is off by default. With that default, you won't get what you expect. -Didn't create non-overlapping partitions using CHECK, which keeps the planner from knowing what's inside each of them. It's possible to miss this step but still get your data into the right partitions properly, the planner just won't know that. -Did not put an index on each partition, only created one on the master table. This will give you a sequential scan just on the relevant partition, so not as bad as the above but not good either. There's some work to make this all easier in upcoming PostgreSQL releases (setting constraint_partition is fairly automatic in 8.4 and some sort of partition setup automation is being worked in). Right now, if you follow the instructions carefully and avoid all these problems, it should work.

How can I use an index on a partitioned table in postgresql 8.3.7

1 Answers

Indexes work just fine to do a scan only of the relevant partitions in PostgreSQL. But, you have to set everything up properly for it to work, and it's easy to miss a step in the long list of things documented at http://www.postgresql.org/docs/current/static/ddl-partitioning.html

The main thing to realize is that in order to avoid a sequential scan, you have to provide enough information to PostgreSQL so it can prove some partitions cannot have the data you're looking for; then they are skipped as potential sources for the query results. The article you link to points this out as a solution to the seq scan problem: "If you add range constraints to the date field of each partition, this query can be optimized into a loop where you query the “latest” partition first and work backwards until you find a single value that is higher than the range of all the remaining partitions."--but doesn't show the improved plan you'd see after that change.

Some common mistakes you might have made:

-The constraint_exclusion parameter in the postgresql.conf file is off by default. With that default, you won't get what you expect.

-Didn't create non-overlapping partitions using CHECK, which keeps the planner from knowing what's inside each of them. It's possible to miss this step but still get your data into the right partitions properly, the planner just won't know that.

-Did not put an index on each partition, only created one on the master table. This will give you a sequential scan just on the relevant partition, so not as bad as the above but not good either.

There's some work to make this all easier in upcoming PostgreSQL releases (setting constraint_partition is fairly automatic in 8.4 and some sort of partition setup automation is being worked in). Right now, if you follow the instructions carefully and avoid all these problems, it should work.

answered Nov 15 '22 07:11

Greg Smith

Related questions
                            
                                DBLINK vs Postgres_FDW, which one may provide better performance?
                            
                                High Sierra + Python + Postgresql error: Illegal instruction: 4
                            
                                How does postgresql lock tables when inserting and selecting?
                            
                                How to create buckets and groups within those buckets using PostgresQL
                            
                                PostgreSQL - create an auto-increment column for non-primary key
                            
                                How to compare numeric in PostgreSQL JSONB
                            
                                Django ORM raw delete query not deleting records
                            
                                Create a timestamp with time zone in PostgreSQL from Liquibase XML
                            
                                Reset identity column with last value of table's identity in postgres
                            
                                Spring Docker container cannot access Postgres Docker container
                            
                                operator does not exist: json @> unknown
                            
                                Performance impact of adding unique constraint to existing postgres index
                            
                                How to change Postgresql max_connections config via Kubernetes statefulset environment variable?
                            
                                org.postgresql.util.PGobject not available in org.postgresql
                            
                                Permission denied when trying to load into Postgres RDS from S3 with a path that contains the equals sign
                            
                                Pgbouncer: how to run within a kubernetes cluster properly
                            
                                Error installing PostGIS for PostgreSQL on Mac
                            
                                Postgres ON CONFLICT DO UPDATE only non null values in python
                            
                                ERROR: cannot alter type of a column used by a view or rule DETAIL: rule _RETURN on view depends on column "status"
                            
                                Helm Postgres password authentication failed

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How can I use an index on a partitioned table in postgresql 8.3.7

Tags:

indexing

postgresql

partitioning

Tom Feiner

People also ask

1 Answers

Greg Smith

Recent Activity

Donate For Us