I have a table that has a datetime field "updated_at". A lot of my queries will be querying on this field using range queries such as rows that have updated_at > a certain date. I already added an index to updated_at, but most of my queries are still very slow, even when I had a limit to the number of rows return. What else can I do to optimize queries that query on datetime fields?

Usually database optimizers won't chose to use indexes for open-ended ranges, such a <code>updated_at > somedate</code>. But, in many cases the datatime column won't exceed "now", so you can preserve the semantic of <code>> somedate</code> by converting the condition to a range by using <code>between</code> like this: <pre class="prettyprint"><code>where updated_at between somedate and current_timestamp </code></pre> A <code>between</code> predicate is much more likely to cause the optimizer to chose to use an index. <hr> Please post if this approach improved your query’s performance.

Postgres: Optimizing querying by datetime

2 Answers

Usually database optimizers won't chose to use indexes for open-ended ranges, such a updated_at > somedate.

But, in many cases the datatime column won't exceed "now", so you can preserve the semantic of > somedate by converting the condition to a range by using between like this:

where updated_at between somedate and current_timestamp

A between predicate is much more likely to cause the optimizer to chose to use an index.

Please post if this approach improved your query’s performance.

139

answered Sep 16 '22 14:09

Bohemian

For any given query, the use of an index depends on the cost of using that index compared to a sequential scan

Frequently developers think that because there is an index, a query should run faster, and if a query runs slow, an index is the solution. This is usually the case when the query will return few tuples. But as the number of tuples in the result increases, the cost of using an index might increase.

You are using postgres. Postgres does not support clustering around a given attribute. That means that postgres, when confronted with a range query (of the type att > a and att < b) needs to compute an estimation of the number of tuples in the result (make sure you vacuum your database frequently) and the cost of using an index compared to doing a sequential scan. it will then decide what method to use.

you can inspect this decision by running

EXPLAIN ANALYZE <query>;

in psql. It will tell you if it uses an index or not.

If you really, really want to use the indexes instead of a sequential scan (sometimes it is needed) and you really really know what you are doing, you can change the cost of a sequential scan in the planner constants or disable sequential scans in favor of any other method. See this page for the details:

http://www.postgresql.org/docs/9.1/static/runtime-config-query.html

Make sure you browse the correct version of the documentation.

--dmg

answered Sep 17 '22 14:09

dmg

Related questions
                            
                                What is the difference when comparing with parentheses: WHERE (a, b)=(1,2)
                            
                                What is the difference between setting statement fetch size in JDBC or firing a SQL query with LIMIT clause?
                            
                                testing inequality with columns that can be null
                            
                                SQL Server Query log for failed/incorrect queries?
                            
                                How to check if a column is being updated in an INSTEAD OF UPDATE Trigger
                            
                                Calling Oracle stored procedure with output parameter from SQL Server
                            
                                Dynamic SQL Parameters with Anorm and Scala Play Framework
                            
                                How to fix "Only one expression can be specified in the select list when the subquery is not introduced with EXISTS" error?
                            
                                Update VERY LARGE PostgreSQL database table efficiently
                            
                                Setting up a PHP web project, the infrastructure
                            
                                How to connect to a MySQL database from an iPhone?
                            
                                Quick SQL question: Correct syntax for creating a table with a primary key in H2?
                            
                                Best practice: best database naming convention for JPA? [closed]
                            
                                Select a portion from a MySQL Blob Field
                            
                                how much safe from SQL-Injection if using hibernate
                            
                                Query for count of distinct values in a rolling date range
                            
                                Slow query with where clause
                            
                                Get the difference between two dates both In Months and days in sql
                            
                                SQL TOP 5000 faster than normal query with less than 5000 result rows?
                            
                                Does Postgresql plpgsql/sql support short circuiting in the where clause?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Postgres: Optimizing querying by datetime

Tags:

performance

sql

postgresql

postgresql-performance

Henley

People also ask

2 Answers

Bohemian

dmg

Recent Activity

Donate For Us