Multiple indexes vs single index on multiple columns in postgresql

Tags:

postgresql

I could not reach any conclusive answers reading some of the existing posts on this topic.

I have certain data at 100 locations the for past 10 years. The table has about 800 million rows. I need to primarily generate yearly statistics for each location. Some times I need to generate monthly variation statistics and hourly variation statistics as well. I'm wondering if I should generate two indexes - one for location and another for year or generate one index on both location and year. My primary key currently is a serial number (Probably I could use location and timestamp as the primary key).

Thanks.

504

asked Sep 02 '16 16:09

let_there_be_light

1 Answers

Regardless of how many indices have you created on relation, only one of them will be used in a certain query (which one depends on query, statistics etc). So in your case you wouldn't get a cumulative advantage from creating two single column indices. To get most performance from index I would suggest to use composite index on (location, timestamp).

Note, that queries like ... WHERE timestamp BETWEEN smth AND smth will not use the index above while queries like ... WHERE location = 'smth' or ... WHERE location = 'smth' AND timestamp BETWEEN smth AND smth will. It's because the first attribute in index is crucial for searching and sorting.

Don't forget to perform

ANALYZE;

after index creation in order to collect statistics.

Update: As @MondKin mentioned in comments certain queries can actually use several indexes on the same relation. For example, query with OR clauses like a = 123 OR b = 456 (assuming that there are indexes for both columns). In this case postgres would perform bitmap index scans for both indexes, build a union of resulting bitmaps and use it for bitmap heap scan. In certain conditions the same scheme may be used for AND queries but instead of union there would be an intersection.

147

answered Oct 04 '22 04:10

Ildar Musin

Related questions
                            
                                Django: What are the best practices to migrate a project from sqlite to PostgreSQL
                            
                                Unable to install psycopg2 (pip install psycopg2)
                            
                                Rails Migrations: tried to change the type of column from string to integer
                            
                                postgres update after select
                            
                                Getting the last word from a Postgres string, declaratively
                            
                                Adding months to a date in PostgreSQL shows syntax error
                            
                                PG::InvalidParameterValue: ERROR: invalid value for parameter "client_min_messages": "panic"
                            
                                How to take backup of functions only in Postgres
                            
                                TypeError: rxjs_1.lastValueFrom is not a function
                            
                                How does pgBouncer help to speed up Django
                            
                                Postgresql gem install pg 0.18.4 passes, bundle install fails
                            
                                Postgres usage of btree indexes vs MySQL B+trees
                            
                                PostgreSQL error: could not connect to database template1: could not connect to server: No such file or directory
                            
                                What's the cause of "PGError: FATAL: terminating connection due to administrator command" on heroku?
                            
                                Recover postgreSQL databases from raw physical files
                            
                                Best way to learn PostgreSQL stored procedures? [closed]
                            
                                OFFSET vs. ROW_NUMBER()
                            
                                How to determine what type of index to use in Postgres?
                            
                                SQL function return-type: TABLE vs SETOF records
                            
                                Why is it a vacuum not needed with Mysql compared to the PostgreSQL?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With