I want to improve this slow query, I thinking to add an index, but I don't know what index type is better to my case. <pre class="prettyprint"><code>SELECT COUNT(*) ct FROM events WHERE dtt AT TIME ZONE 'America/Santiago' >= date(now() AT TIME ZONE 'America/Santiago') + interval '1s' </code></pre> Query Plan: <pre class="prettyprint"><code>"Aggregate (cost=128032.03..128032.04 rows=1 width=0) (actual time=3929.083..3929.083 rows=1 loops=1)" " -> Seq Scan on events (cost=0.00..125937.68 rows=837742 width=0) (actual time=113.080..3926.972 rows=25849 loops=1)" " Filter: (timezone('America/Santiago'::text, dtt) >= (date(timezone('America/Santiago'::text, now())) + '00:00:01'::interval))" " Rows Removed by Filter: 2487386" "Planning time: 0.179 ms" "Execution time: 3929.136 ms" </code></pre> <ul> <li>The query gets the count of events of the day.</li> <li>dtt is a timestamp with time zone column.</li> <li>I'm using Postgresql 9.4.</li> </ul> Note: With the Erwin advices the query run a little faster but still I think isn't fast enough. <pre class="prettyprint"><code>"Aggregate (cost=119667.76..119667.77 rows=1 width=0) (actual time=3687.151..3687.152 rows=1 loops=1)" " -> Seq Scan on vehicle_events (cost=0.00..119667.14 rows=250 width=0) (actual time=104.635..3687.068 rows=469 loops=1)" " Filter: (dtt >= timezone('America/Santiago'::text, date_trunc('day'::text, timezone('America/Santiago'::text, now()))))" " Rows Removed by Filter: 2513337" "Planning time: 0.164 ms" "Execution time: 3687.204 ms" </code></pre>

First, fix your query to make the predicate sargable: <pre class="prettyprint"><code>SELECT count(*) AS ct FROM events WHERE dtt >= date_trunc('day', now() AT TIME ZONE 'America/Santiago') AT TIME ZONE 'America/Santiago' </code></pre> Use the column value as is and move all calculations to the parameter. That's right, after deriving the local start of the day, apply <code>AT TIME ZONE</code> a second time to convert the <code>timestamp</code> back to <code>timestamptz</code> again. Details: <ul> <li>Ignoring timezones altogether in Rails and PostgreSQL</li> </ul> <h3>Explanation step-by-step</h3> <ol> <li><code>now()</code> .. is the Postgres implementation for the SQL standard <code>CURRENT_TIMESTAMP</code>. Both are 100 % equivalent, you can use either. It returns the current point in time as <code>timestamptz</code> - the display of the value takes the time zone of the current session into consideration, but that's irrelevant for the value.</li> <li><code>now()</code> <code>AT TIME ZONE 'America/Santiago'</code> .. computes the local time for the given time zone. The resulting data type is <code>timestamp</code>. We do this to allow for:</li> <li><code>date_trunc(</code> <code>now() AT TIME ZONE 'America/Santiago'</code> <code>)</code> .. truncates the time component to get the local start of the day in 'America/Santiago', independent of the current time zone setting.</li> <li><code>date_trunc('day', now() AT TIME ZONE 'America/Santiago')</code> <code>AT TIME ZONE 'America/Santiago'</code> .. feeding the <code>timestamp</code> to the <code>AT TIME ZONE</code> construct we get the corresponding <code>timestamptz</code> value (UTC internally) to compare the <code>timestamptz</code> value <code>dtt</code> to.</li> </ol> I removed the <code>+ interval '1s'</code>, suspecting you have just been abusing that to convert the <code>date</code> to <code>timestamp</code>. Use <code>date_trunc()</code> instead to produce a <code>timestamp</code> value. Now, a plain (default) btree index on <code>dtt</code> will do. Of course, the index will only be used, if the predicate is selective enough. <pre class="prettyprint"><code>CREATE INDEX events_dtt_idx ON events (dtt); </code></pre> If your important queries only consider recent rows, a partial index might help some more. Details: <ul> <li>Get latest child per parent from big table - query is too slow</li> </ul>

Add an index to a timestamp with time zone

Tags:

timezone

sql

indexing

postgresql

count

I want to improve this slow query, I thinking to add an index, but I don't know what index type is better to my case.

SELECT COUNT(*) ct FROM events
WHERE dtt AT TIME ZONE 'America/Santiago'
   >= date(now() AT TIME ZONE 'America/Santiago') + interval '1s'

Query Plan:

"Aggregate  (cost=128032.03..128032.04 rows=1 width=0) (actual time=3929.083..3929.083 rows=1 loops=1)"
"  ->  Seq Scan on events  (cost=0.00..125937.68 rows=837742 width=0) (actual time=113.080..3926.972 rows=25849 loops=1)"
"        Filter: (timezone('America/Santiago'::text, dtt) >= (date(timezone('America/Santiago'::text, now())) + '00:00:01'::interval))"
"        Rows Removed by Filter: 2487386"
"Planning time: 0.179 ms"
"Execution time: 3929.136 ms"

The query gets the count of events of the day.
dtt is a timestamp with time zone column.
I'm using Postgresql 9.4.

Note: With the Erwin advices the query run a little faster but still I think isn't fast enough.

"Aggregate  (cost=119667.76..119667.77 rows=1 width=0) (actual time=3687.151..3687.152 rows=1 loops=1)"
"  ->  Seq Scan on vehicle_events  (cost=0.00..119667.14 rows=250 width=0) (actual time=104.635..3687.068 rows=469 loops=1)"
"        Filter: (dtt >= timezone('America/Santiago'::text, date_trunc('day'::text, timezone('America/Santiago'::text, now()))))"
"        Rows Removed by Filter: 2513337"
"Planning time: 0.164 ms"
"Execution time: 3687.204 ms"

613

asked Aug 17 '15 02:08

Goku

1 Answers

First, fix your query to make the predicate sargable:

SELECT count(*) AS ct
FROM   events
WHERE  dtt >= date_trunc('day', now() AT TIME ZONE 'America/Santiago')
                                      AT TIME ZONE 'America/Santiago'

Use the column value as is and move all calculations to the parameter.

That's right, after deriving the local start of the day, apply AT TIME ZONE a second time to convert the timestamp back to timestamptz again. Details:

Ignoring timezones altogether in Rails and PostgreSQL

Explanation step-by-step

now()
.. is the Postgres implementation for the SQL standard CURRENT_TIMESTAMP. Both are 100 % equivalent, you can use either. It returns the current point in time as timestamptz - the display of the value takes the time zone of the current session into consideration, but that's irrelevant for the value.
now() AT TIME ZONE 'America/Santiago'
.. computes the local time for the given time zone. The resulting data type is timestamp. We do this to allow for:
date_trunc( now() AT TIME ZONE 'America/Santiago' )
.. truncates the time component to get the local start of the day in 'America/Santiago', independent of the current time zone setting.
date_trunc('day', now() AT TIME ZONE 'America/Santiago') AT TIME ZONE 'America/Santiago'
.. feeding the timestamp to the AT TIME ZONE construct we get the corresponding timestamptz value (UTC internally) to compare the timestamptz value dtt to.

I removed the + interval '1s', suspecting you have just been abusing that to convert the date to timestamp. Use date_trunc() instead to produce a timestamp value.

Now, a plain (default) btree index on dtt will do. Of course, the index will only be used, if the predicate is selective enough.

CREATE INDEX events_dtt_idx ON events (dtt);

If your important queries only consider recent rows, a partial index might help some more. Details:

Get latest child per parent from big table - query is too slow

178

answered Sep 18 '22 20:09

Erwin Brandstetter

Related questions
                            
                                Handle NULL value in UNPIVOT
                            
                                Backup SQL Server database using WITH FORMAT
                            
                                Week of year calculation differences among Java and multi SQL RDBMS
                            
                                Left join with dynamic table name derived from column
                            
                                Materialized View: How to automatically refresh it upon table data changes?
                            
                                SqlAlchemy Reflection of Oracle Table Not Owned
                            
                                Object with id was not of the specified subclass
                            
                                linq to sql startwith performance indexed columns
                            
                                Error: "Multiple columns are specified in an aggregated expression containing an outer reference."
                            
                                Is 'length() IS NULL' equivalent and faster than 'IS NULL' for BLOBs?
                            
                                Set EXECUTE sp_executesql result into a variable in sql
                            
                                How to check progress of long running insertions in oracle
                            
                                adding a value to a column from data in next row sql
                            
                                How to deal with Unicode replacement character � (0xFFFD / 65533) in SQL
                            
                                Why are both SELECT count(PK) and SELECT count(*) so slow?
                            
                                SQL INSERT INTO WITH SELECT query
                            
                                NULL defaults to empty string in mysql?
                            
                                Is null-checking on Linq queries idiomatic?
                            
                                Updating the database using php
                            
                                SQL's `case when ...` code conversion using data.table package in R

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With