Creating an index on a timestamp to optimize query

Tags:

I have a query of the following form:

SELECT * FROM MyTable WHERE Timestamp > [SomeTime] AND Timestamp < [SomeOtherTime]

I would like to optimize this query, and I am thinking about putting an index on timestamp, but am not sure if this would help. Ideally I would like to make timestamp a clustered index, but MySQL does not support clustered indexes, except for primary keys.

MyTable has 4 million+ rows.
Timestamp is actually of type INT.
Once a row has been inserted, it is never changed.
The number of rows with any given Timestamp is on average about 20, but could be as high as 200.
Newly inserted rows have a Timestamp that is greater than most of the existing rows, but could be less than some of the more recent rows.

Would an index on Timestamp help me to optimize this query?

459

asked Jan 31 '12 22:01

DanielGibbs

3 Answers

No question about it. Without the index, your query has to look at every row in the table. With the index, the query will be pretty much instantaneous as far as locating the right rows goes. The price you'll pay is a slight performance decrease in inserts; but that really will be slight.

answered Oct 06 '22 10:10

Chris Nash

You should definitely use an index. MySQL has no clue what order those timestamps are in, and in order to find a record for a given timestamp (or timestamp range) it needs to look through every single record. And with 4 million of them, that's quite a bit of time! Indexes are your way of telling MySQL about your data -- "I'm going to look at this field quite often, so keep an list of where I can find the records for each value."

Indexes in general are a good idea for regularly queried fields. The only downside to defining indexes is that they use extra storage space, so unless you're real tight on space, you should try to use them. If they don't apply, MySQL will just ignore them anyway.

answered Oct 06 '22 11:10

kitti

I don't disagree with the importance of indexing to improve select query times, but if you can index on other keys (and form your queries with these indexes), the need to index on timestamp may not be needed.

For example, if you have a table with timestamp, category, and userId, it may be better to create an index on userId instead. In a table with many different users this will reduce considerably the remaining set on which to search the timestamp.

...and If I'm not mistaken, the advantage of this would be to avoid the overhead of creating the timestamp index on each insertion -- in a table with high insertion rates and highly unique timestamps this could be an important consideration.

I'm struggling with the same problems of indexing based on timestamps and other keys. I still have testing to do so I can put proof behind what I say here. I'll try to postback based on my results.

A scenario for better explanation:

timestamp 99% unique
userId 80% unique
category 25% unique
- Indexing on timestamp will quickly reduce query results to 1% the table size
- Indexing on userId will quickly reduce query results to 20% the table size
- Indexing on category will quickly reduce query results to 75% the table size
- Insertion with indexes on timestamp will have high overhead **
- Despite our knowledge that our insertions will respect the fact of have incrementing timestamps, I don't see any discussion of MySQL optimisation based on incremental keys.
- Insertion with indexes on userId will reasonably high overhead.
- Insertion with indexes on category will have reasonably low overhead.

** I'm sorry, I don't know the calculated overhead or insertion with indexing.

answered Oct 06 '22 10:10

blackstrype

Related questions
                            
                                Change output format for MySQL command line results to CSV
                            
                                MySQL: get MAX or GREATEST of several columns, but with NULL fields
                            
                                How do you write a conditional in a MySQL select statement?
                            
                                MySQL stored procedures use them or not to use them
                            
                                Retrieve last inserted id with Mysql
                            
                                How to display a date as iso 8601 format with PHP
                            
                                mysql server port number
                            
                                MySQL arrange existing table columns
                            
                                Pentaho Data Integration SQL connection
                            
                                MySQL - How to count all rows per table in one query
                            
                                ORDER BY date and time BEFORE GROUP BY name in mysql
                            
                                How to change mysql to mysqli?
                            
                                Can't import database through phpmyadmin file size too large
                            
                                Laravel Migration table already exists, but I want to add new not the older
                            
                                Moving connections and instances between two computers
                            
                                Calculate age based on date of birth
                            
                                How can I clear the screen in the MySQL console? [duplicate]
                            
                                AWS: can't connect to RDS database from my machine
                            
                                Database/SQL: How to store longitude/latitude data?
                            
                                Is there a way to run MySQL in-memory for JUnit test cases?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Creating an index on a timestamp to optimize query

Tags:

timestamp

optimization

indexing

mysql