Mysql How do you create a clustered index?

Tags:

I'm reading all about how clustered indexes work, and think they would be beneficial to my app. I understand that primary keys are automatically clustered indexes, but how would you add a clustered index to a non-primary key column?

I.e. a datastore for user posts. Each post has a ID, but also has a user-id, but since users can post multiple times, the user-id is not a primary key. How would you add a clustered index to the user-id, and is that even a good idea?

562

asked Dec 08 '12 15:12

Wiz

3 Answers

According to Clustered and Secondary Indexes, you can have only one clustered index per table.

All indexes other than the clustered index are known as secondary indexes.

If a table has no primary index but another unique index, this is used as the clustered index.

If you do not define a PRIMARY KEY for your table, MySQL locates the first UNIQUE index where all the key columns are NOT NULL and InnoDB uses it as the clustered index.

So, I would conclude, that you don't add a clustered index yourself, but MySQL chooses either the primary or the first unique index of a table as the clustered index.

If you haven't defined a primary or unique index, MySQL creates an index itself

If the table has no PRIMARY KEY or suitable UNIQUE index, InnoDB internally generates a hidden clustered index named GEN_CLUST_INDEX on a synthetic column containing row ID values. The rows are ordered by the ID that InnoDB assigns to the rows in such a table. The row ID is a 6-byte field that increases monotonically as new rows are inserted. Thus, the rows ordered by the row ID are physically in insertion order.

177

answered Oct 17 '22 04:10

Olaf Dietsche

As @Olaf describes, InnoDB chooses which column or column combination will be the clustered index (the primary key, or the first unique index if there is not a primary key, or a hidden column if there is none of the two).

If you want to have a non-unique column as the clustered index, you could define the post_id as a unique key and make the combination of user_id and post_id the primary key which will be chosen as the clustered index:

CREATE TABLE Post
( post_id INT NOT NULL AUTO_INCREMENT
, user_id INT NOT NULL
--- other columns
, CONSTRAINT Post_PK
    PRIMARY KEY (user_id, post_id)     -- your clustered index
, CONSTRAINT post_id_UQ
    UNIQUE (post_id)             -- you still want uniqueness for the `post_id`
) ENGINE = InnoDB ;

Whether this is a good idea or not depends on your application, the data volumes and the queries you have. In general the best properties of a clustered key are unique, narrow, static and ever-increasing. That's why auto-incrementing columns are best. Read about it in Kimberly L. Tripp's blog articles: Ever-increasing clustering key - the Clustered Index Debate..........again! and The Clustered Index Debate Continues... (don't stop because they are for SQL-Server, the same issues apply 99% to InnoDB's clustered indexing)

A clustered key like the (user_id, post_id) has the first 3 properties but it is not ever-increasing. This will result in fragmentation of the CI and possibly slower insertions into the table.

It will however result in more efficient queries that have WHERE user_id = ? conditions or range conditions WHERE user_id BETWEEN ? AND ? or GROUP BY user_id groupings as the wanted data will be found in the clustered index in one place and in the required order.

I suggest you make tests to choose which is best in your case.

There is also a MySQL variant, TokuDB, that allows multiple clustered indexes in a table. Details in their article: Introducing Multiple Clustering Indexes

answered Oct 17 '22 06:10

ypercubeᵀᴹ

When you define a primary key for an InnoDB table, MySQL uses the primary key as the clustered index.

If you do not have a primary key for a table, MySQL will search for the first UNIQUE index where all the key columns are NOT NULL and use this UNIQUE index as the clustered index.

In case the InnoDB table has no primary key or suitable UNIQUE index, MySQL internally generates a hidden clustered index named GEN_CLUST_INDEX on a synthetic column which contains the row ID values.

answered Oct 17 '22 04:10

Gauravk

Related questions
                            
                                Logging in as a different user on MySQL
                            
                                Doctrine 2 mysql FIELD function in order by
                            
                                MySQL CASE...WHERE...THEN statements
                            
                                Can MySQL INT type be non-zero NULL
                            
                                Is there a reason not to use <=> (null safe equals operator) in mysql instead of =?
                            
                                mysql join with limit 1
                            
                                Cygwin connecting to MySQL: Can't connect to local MySQL server through socket '/var/run/mysql.sock'
                            
                                Find duplicates for combination of two columns in a MySQL database
                            
                                MySQL - Select last record from second table matching with first table
                            
                                select first N columns of MySQL table
                            
                                How do I insert multiple checkbox values into a table?
                            
                                Can't connect to MySQL server on (ip or domain name)
                            
                                SELECT row with MAX id from each GROUP BY (unique_id) and ORDER BY number
                            
                                mysql COUNT results with GROUP BY
                            
                                TRIGGERs that cause INSERTs to fail? Possible?
                            
                                Query output to a file gives access denied error
                            
                                Does an enum in MySQL need to be NOT NULL?
                            
                                Get table names from a database
                            
                                Limiting a left join to returning one result?
                            
                                How to check if a particular database in mysql already exists using java

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Mysql How do you create a clustered index?

Tags:

sql

mysql

clustered-index

Wiz

People also ask

3 Answers

Olaf Dietsche

ypercubeᵀᴹ

Gauravk

Recent Activity

Donate For Us