PostgreSQL: GIN or GiST indexes?

Q: Which index is best in PostgreSQL?

B-tree indexes B-tree is the default index in Postgres and is best used for specific value searches, scanning ranges, data sorting or pattern matching.

Q: What is GiST in Postgres?

GiST stands for Generalized Search Tree. It is a balanced, tree-structured access method, that acts as a base template in which to implement arbitrary indexing schemes.

Q: What is GIN index in PostgreSQL?

GIN stands for Generalized Inverted Index. GIN is designed for handling cases where the items to be indexed are composite values, and the queries to be handled by the index need to search for element values that appear within the composite items.

Q: What are GiST indexes?

A GiST index is lossy, meaning that the index might produce false matches, and it is necessary to check the actual table row to eliminate such false matches. (PostgreSQL does this automatically when needed.) GiST indexes are lossy because each document is represented in the index by a fixed-length signature.

Tags:

indexing

postgresql

gwt-gin

gist-index

From what information I could find, they both solve the same problems - more esoteric operations like array containment and intersection (&&, @>, <@, etc). However I would be interested in advice about when to use one or the other (or neither possibly).
The PostgreSQL documentation has some information about this:

GIN index lookups are about three times faster than GiST
GIN indexes take about three times longer to build than GiST
GIN indexes are about ten times slower to update than GiST
GIN indexes are two-to-three times larger than GiST

However I would be particularly interested to know if there is a performance impact when the memory to index size ration starts getting small (ie. the index size becomes much bigger than the available memory)? I've been told on the #postgresql IRC channel that GIN needs to keep all the index in memory, otherwise it won't be effective, because, unlike B-Tree, it doesn't know which part to read in from disk for a particular query? The question would be: is this true (because I've also been told the opposite of this)? Does GiST have the same restrictions? Are there other restrictions I should be aware of while using one of these indexing algorithms?

845

asked Aug 22 '08 05:08

Grey Panther

1 Answers

First of all, do you need to use them for text search indexing? GIN and GiST are index specialized for some data types. If you need to index simple char or integer values then the normal B-Tree index is the best.
Anyway, PostgreSQL documentation has a chapter on GIST and one on GIN, where you can find more info.
And, last but not least, the best way to find which is best is to generate sample data (as much as you need to be a real scenario) and then create a GIST index, measuring how much time is needed to create the index, insert a new value, execute a sample query. Then drop the index and do the same with a GIN index. Compare the values and you will have the answer you need, based on your data.

141

answered Oct 25 '22 18:10

Andrea Bertani

Related questions
                            
                                Solution for speeding up a slow SELECT DISTINCT query in Postgres
                            
                                How to change DATABASE_URL for a heroku application
                            
                                How do I speed up counting rows in a PostgreSQL table?
                            
                                pq: could not resize shared memory segment. No space left on device
                            
                                PostgreSQL - next serial value in a table
                            
                                How to write a postgresql query for getting only the date part of timestamp field, from a table
                            
                                How do I reset the postgresql 9.2 default user (usually 'postgres') password on mac os x 10.8.2?
                            
                                Django: How to write query to sort using multiple columns, display via template
                            
                                Setting a connect timeout with PDO
                            
                                How can I move postgresql data to another directory on Ubuntu over Amazon EC2?
                            
                                Refactor a PL/pgSQL function to return the output of various SELECT queries
                            
                                PostgreSQL how to create a copy of a database or schema?
                            
                                Fatal error: Call to undefined function pg_connect()
                            
                                How to reset the sequence for IDs on PostgreSQL tables
                            
                                How can I create a constraint to check if an email is valid in postgres?
                            
                                Determining the OID of a table in Postgres 9.1?
                            
                                postgresql- restoring .dump file
                            
                                Permission denied when trying to import a CSV file from PGAdmin
                            
                                Convert PostgreSQL array to PHP array
                            
                                Show table structure and list of tables in PostgreSQL [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With