My queries get very slow when I add a <code>limit 1</code>. I have a table <code>object_values</code> with timestamped values for objects: <pre class="prettyprint"><code> timestamp | objectID | value -------------------------------- 2014-01-27| 234 | ksghdf </code></pre> Per object I want to get the latest value: <pre class="prettyprint"><code>SELECT * FROM object_values WHERE (objectID = 53708) ORDER BY timestamp DESC LIMIT 1; </code></pre> (I cancelled the query after more than 10 minutes) This query is very slow when there are no values for a given objectID (it is fast if there are results). If I remove the limit it tells me nearly instantaneous that there are no results: <pre class="prettyprint"><code>SELECT * FROM object_values WHERE (objectID = 53708) ORDER BY timestamp DESC; ... Time: 0.463 ms </code></pre> An explain shows me that the query without limit uses the index, where as the query with <code>limit 1</code> does not make use of the index: Slow query: <pre class="prettyprint"><code>explain SELECT * FROM object_values WHERE (objectID = 53708) ORDER BY timestamp DESC limit 1; QUERY PLAN` ---------------------------------------------------------------------------------------------------------------------------- Limit (cost=0.00..2350.44 rows=1 width=126) -> Index Scan Backward using object_values_timestamp on object_values (cost=0.00..3995743.59 rows=1700 width=126) Filter: (objectID = 53708)` </code></pre> Fast query: <pre class="prettyprint"><code>explain SELECT * FROM object_values WHERE (objectID = 53708) ORDER BY timestamp DESC; QUERY PLAN -------------------------------------------------------------------------------------------------------------- Sort (cost=6540.86..6545.11 rows=1700 width=126) Sort Key: timestamp -> Index Scan using object_values_objectID on working_hours_t (cost=0.00..6449.65 rows=1700 width=126) Index Cond: (objectID = 53708) </code></pre> The table contains 44,884,559 rows and 66,762 distinct objectIDs. I have separate indexes on both fields: <code>timestamp</code> and <code>objectID</code>. I have done a <code>vacuum analyze</code> on the table and I have reindexed the table. Additionally the slow query becomes fast when I set the limit to 3 or higher: <pre class="prettyprint"><code>explain SELECT * FROM object_values WHERE (objectID = 53708) ORDER BY timestamp DESC limit 3; QUERY PLAN -------------------------------------------------------------------------------------------------------------------- Limit (cost=6471.62..6471.63 rows=3 width=126) -> Sort (cost=6471.62..6475.87 rows=1700 width=126) Sort Key: timestamp -> Index Scan using object_values_objectID on object_values (cost=0.00..6449.65 rows=1700 width=126) Index Cond: (objectID = 53708) </code></pre> In general I assume it has to do with the planner making wrong assumptions about the exectution costs and therefore chooses for a slower execution plan. Is this the real reason? Is there a solution for this?

You can avoid this issue by adding an unneeded <code>ORDER BY</code> clause to the query. <pre class="prettyprint"><code>SELECT * FROM object_values WHERE (objectID = 53708) ORDER BY timestamp, objectID DESC limit 1; </code></pre>

PostgreSQL query very slow with limit 1

Tags:

performance

postgresql

limit

query-optimization

My queries get very slow when I add a limit 1.

I have a table object_values with timestamped values for objects:

 timestamp |  objectID |  value --------------------------------  2014-01-27|       234 | ksghdf

Per object I want to get the latest value:

SELECT * FROM object_values WHERE (objectID = 53708) ORDER BY timestamp DESC LIMIT 1;

(I cancelled the query after more than 10 minutes)

This query is very slow when there are no values for a given objectID (it is fast if there are results). If I remove the limit it tells me nearly instantaneous that there are no results:

SELECT * FROM object_values WHERE (objectID = 53708) ORDER BY timestamp DESC;   ...   Time: 0.463 ms

An explain shows me that the query without limit uses the index, where as the query with limit 1 does not make use of the index:

Slow query:

explain SELECT * FROM object_values WHERE (objectID = 53708) ORDER BY timestamp DESC limit 1;   QUERY PLAN` ---------------------------------------------------------------------------------------------------------------------------- Limit  (cost=0.00..2350.44 rows=1 width=126) ->  Index Scan Backward using object_values_timestamp on object_values  (cost=0.00..3995743.59 rows=1700 width=126)      Filter: (objectID = 53708)`

Fast query:

explain SELECT * FROM object_values WHERE (objectID = 53708) ORDER BY timestamp DESC;                                                   QUERY PLAN --------------------------------------------------------------------------------------------------------------  Sort  (cost=6540.86..6545.11 rows=1700 width=126)    Sort Key: timestamp    ->  Index Scan using object_values_objectID on working_hours_t  (cost=0.00..6449.65 rows=1700 width=126)          Index Cond: (objectID = 53708)

The table contains 44,884,559 rows and 66,762 distinct objectIDs.
I have separate indexes on both fields: timestamp and objectID.
I have done a vacuum analyze on the table and I have reindexed the table.

Additionally the slow query becomes fast when I set the limit to 3 or higher:

explain SELECT * FROM object_values WHERE (objectID = 53708) ORDER BY timestamp DESC limit 3;                                                      QUERY PLAN --------------------------------------------------------------------------------------------------------------------  Limit  (cost=6471.62..6471.63 rows=3 width=126)    ->  Sort  (cost=6471.62..6475.87 rows=1700 width=126)          Sort Key: timestamp          ->  Index Scan using object_values_objectID on object_values  (cost=0.00..6449.65 rows=1700 width=126)                Index Cond: (objectID = 53708)

In general I assume it has to do with the planner making wrong assumptions about the exectution costs and therefore chooses for a slower execution plan.

Is this the real reason? Is there a solution for this?

303

asked Jan 27 '14 16:01

pat

1 Answers

You can avoid this issue by adding an unneeded ORDER BY clause to the query.

SELECT * FROM object_values WHERE (objectID = 53708) ORDER BY timestamp, objectID DESC limit 1;

answered Sep 19 '22 22:09

Brendan Nee

Related questions
                            
                                Cannot drop table users because other objects depend on it
                            
                                postgres jsonb_set multiple keys update
                            
                                Difference between Stream Replication and logical replication
                            
                                How to do case-insensitive order in Rails with postgresql
                            
                                PostgreSQL JOIN with array type with array elements order, how to implement?
                            
                                Get last record of a table in Postgres
                            
                                How to apply a function to each element of an array column in Postgres?
                            
                                Install psycopg2 on Ubuntu
                            
                                Insert Data Into Tables Linked by Foreign Key
                            
                                How can I execute pl/pgsql code without creating a function?
                            
                                How can I merge the columns from two tables into one output?
                            
                                Where condition for joined table in Sequelize ORM
                            
                                PostgreSQL: Select where timestamp is empty
                            
                                Composite primary key in django
                            
                                How can I tell if PostgreSQL's Autovacuum is running on UNIX?
                            
                                How should I import data from CSV into a Postgres table using pgAdmin 3?
                            
                                Creating temporary tables in SQL
                            
                                Is there a way to define a named constant in a PostgreSQL query?
                            
                                postgresql - can't create database - OperationalError: source database "template1" is being accessed by other users
                            
                                How does one drop a template database from PostgreSQL?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With