I need to change the primary key of a table to an identity column, and there's already a number of rows in table. I've got a script to clean up the IDs to ensure they're sequential starting at 1, works fine on my test database. What's the SQL command to alter the column to have an identity property?

In SQL 2005 and above, there's a trick to solve this problem without changing the table's data pages. This is important for large tables where touching every data page can take minutes or hours. The trick also works even if the identity column is a primary key, is part of a clustered or non-clustered index, or other gotchas which can trip up the the simpler "add/remove/rename column" solution. Here's the trick: you can use SQL Server's ALTER TABLE...SWITCH statement to change the schema of a table without changing the data, meaning you can replace a table with an IDENTITY with an identical table schema, but without an IDENTITY column. The same trick works to add IDENTITY to an existing column. Normally, ALTER TABLE...SWITCH is used to efficiently replace a full partition in a partitioned table with a new, empty partition. But it can also be used in non-partitioned tables too. I've used this trick to convert, in under 5 seconds, a column of a of a 2.5 billion row table from IDENTITY to a non-IDENTITY (in order to run a multi-hour query whose query plan worked better for non-IDENTITY columns), and then restored the IDENTITY setting, again in less than 5 seconds. Here's a code sample of how it works. <pre class="prettyprint"><code> CREATE TABLE Test ( id int identity(1,1), somecolumn varchar(10) ); INSERT INTO Test VALUES ('Hello'); INSERT INTO Test VALUES ('World'); -- copy the table. use same schema, but no identity CREATE TABLE Test2 ( id int NOT NULL, somecolumn varchar(10) ); ALTER TABLE Test SWITCH TO Test2; -- drop the original (now empty) table DROP TABLE Test; -- rename new table to old table's name EXEC sp_rename 'Test2','Test'; -- update the identity seed DBCC CHECKIDENT('Test'); -- see same records SELECT * FROM Test; </code></pre> This is obviously more involved than the solutions in other answers, but if your table is large this can be a real life-saver. There are some caveats: <ul> <li>As far as I know, identity is the only thing you can change about your table's columns with this method. Adding/removing columns, changing nullability, etc. isn't allowed. </li> <li>You'll need to drop foriegn keys before you do the switch and restore them after.</li> <li>Same for WITH SCHEMABINDING functions, views, etc. </li> <li>new table's indexes need to match exactly (same columns, same order, etc.)</li> <li>Old and new tables need to be on the same filegroup.</li> <li>Only works on SQL Server 2005 or later</li> <li>I previously believed that this trick only works on the Enterprise or Developer editions of SQL Server (because partitions are only supported in Enterprise and Developer versions), but Mason G. Zhwiti in his comment below says that it also works in SQL Standard Edition too. I assume this means that the restriction to Enterprise or Developer doesn't apply to ALTER TABLE...SWITCH.</li> </ul> There's a good article on TechNet detailing the requirements above. UPDATE - Eric Wu had a comment below that adds important info about this solution. Copying it here to make sure it gets more attention: <blockquote> There's another caveat here that is worth mentioning. Although the new table will happily receive data from the old table, and all the new rows will be inserted following a identity pattern, they will start at 1 and potentially break if the said column is a primary key. Consider running <code>DBCC CHECKIDENT('<newTableName>')</code> immediately after switching. See msdn.microsoft.com/en-us/library/ms176057.aspx for more info. </blockquote> If the table is actively being extended with new rows (meaning you don't have much if any downtime between adding IDENTITY and adding new rows, then instead of <code>DBCC CHECKIDENT</code> you'll want to manually set the identity seed value in the new table schema to be larger than the largest existing ID in the table, e.g. <code>IDENTITY (2435457, 1)</code>. You might be able to include both the <code>ALTER TABLE...SWITCH</code> and the <code>DBCC CHECKIDENT</code> in a transaction (or not-- haven't tested this) but seems like setting the seed value manually will be easier and safer. Obviously, if no new rows are being added to the table (or they're only added occasionally, like a daily ETL process) then this race condition won't happen so <code>DBCC CHECKIDENT</code> is fine.

Adding an identity to an existing column

2 Answers

You can't alter the existing columns for identity.

You have 2 options,

Create a new table with identity & drop the existing table
Create a new column with identity & drop the existing column

Approach 1. (New table) Here you can retain the existing data values on the newly created identity column. Note that you will lose all data if 'if not exists' is not satisfied, so make sure you put the condition on the drop as well!

CREATE TABLE dbo.Tmp_Names     (       Id int NOT NULL              IDENTITY(1, 1),       Name varchar(50) NULL     ) ON  [PRIMARY] go  SET IDENTITY_INSERT dbo.Tmp_Names ON go  IF EXISTS ( SELECT  *             FROM    dbo.Names )      INSERT  INTO dbo.Tmp_Names ( Id, Name )             SELECT  Id,                     Name             FROM    dbo.Names TABLOCKX go  SET IDENTITY_INSERT dbo.Tmp_Names OFF go  DROP TABLE dbo.Names go  Exec sp_rename 'Tmp_Names', 'Names'

Approach 2 (New column) You can’t retain the existing data values on the newly created identity column, The identity column will hold the sequence of number.

Alter Table Names Add Id_new Int Identity(1, 1) Go  Alter Table Names Drop Column ID Go  Exec sp_rename 'Names.Id_new', 'ID', 'Column'

See the following Microsoft SQL Server Forum post for more details:

How to alter column to identity(1,1)

170

answered Sep 28 '22 05:09

John Sansom

In SQL 2005 and above, there's a trick to solve this problem without changing the table's data pages. This is important for large tables where touching every data page can take minutes or hours. The trick also works even if the identity column is a primary key, is part of a clustered or non-clustered index, or other gotchas which can trip up the the simpler "add/remove/rename column" solution.

Here's the trick: you can use SQL Server's ALTER TABLE...SWITCH statement to change the schema of a table without changing the data, meaning you can replace a table with an IDENTITY with an identical table schema, but without an IDENTITY column. The same trick works to add IDENTITY to an existing column.

Normally, ALTER TABLE...SWITCH is used to efficiently replace a full partition in a partitioned table with a new, empty partition. But it can also be used in non-partitioned tables too.

I've used this trick to convert, in under 5 seconds, a column of a of a 2.5 billion row table from IDENTITY to a non-IDENTITY (in order to run a multi-hour query whose query plan worked better for non-IDENTITY columns), and then restored the IDENTITY setting, again in less than 5 seconds.

Here's a code sample of how it works.

 CREATE TABLE Test  (    id int identity(1,1),    somecolumn varchar(10)  );   INSERT INTO Test VALUES ('Hello');  INSERT INTO Test VALUES ('World');   -- copy the table. use same schema, but no identity  CREATE TABLE Test2  (    id int NOT NULL,    somecolumn varchar(10)  );   ALTER TABLE Test SWITCH TO Test2;   -- drop the original (now empty) table  DROP TABLE Test;   -- rename new table to old table's name  EXEC sp_rename 'Test2','Test';   -- update the identity seed  DBCC CHECKIDENT('Test');   -- see same records  SELECT * FROM Test;

This is obviously more involved than the solutions in other answers, but if your table is large this can be a real life-saver. There are some caveats:

As far as I know, identity is the only thing you can change about your table's columns with this method. Adding/removing columns, changing nullability, etc. isn't allowed.
You'll need to drop foriegn keys before you do the switch and restore them after.
Same for WITH SCHEMABINDING functions, views, etc.
new table's indexes need to match exactly (same columns, same order, etc.)
Old and new tables need to be on the same filegroup.
Only works on SQL Server 2005 or later
I previously believed that this trick only works on the Enterprise or Developer editions of SQL Server (because partitions are only supported in Enterprise and Developer versions), but Mason G. Zhwiti in his comment below says that it also works in SQL Standard Edition too. I assume this means that the restriction to Enterprise or Developer doesn't apply to ALTER TABLE...SWITCH.

There's a good article on TechNet detailing the requirements above.

UPDATE - Eric Wu had a comment below that adds important info about this solution. Copying it here to make sure it gets more attention:

There's another caveat here that is worth mentioning. Although the new table will happily receive data from the old table, and all the new rows will be inserted following a identity pattern, they will start at 1 and potentially break if the said column is a primary key. Consider running DBCC CHECKIDENT('<newTableName>') immediately after switching. See msdn.microsoft.com/en-us/library/ms176057.aspx for more info.

If the table is actively being extended with new rows (meaning you don't have much if any downtime between adding IDENTITY and adding new rows, then instead of DBCC CHECKIDENT you'll want to manually set the identity seed value in the new table schema to be larger than the largest existing ID in the table, e.g. IDENTITY (2435457, 1). You might be able to include both the ALTER TABLE...SWITCH and the DBCC CHECKIDENT in a transaction (or not-- haven't tested this) but seems like setting the seed value manually will be easier and safer.

Obviously, if no new rows are being added to the table (or they're only added occasionally, like a daily ETL process) then this race condition won't happen so DBCC CHECKIDENT is fine.

answered Sep 28 '22 05:09

Justin Grant

Related questions
                            
                                Cannot insert explicit value for identity column in table 'table' when IDENTITY_INSERT is set to OFF
                            
                                MySQL error: key specification without a key length
                            
                                Nested select statement in SQL Server
                            
                                SQL query return data from multiple tables
                            
                                how can I Update top 100 records in sql server
                            
                                How to print a query string with parameter values when using Hibernate
                            
                                Using LIMIT within GROUP BY to get N results per group?
                            
                                Delete all Duplicate Rows except for One in MySQL? [duplicate]
                            
                                When to use "ON UPDATE CASCADE"
                            
                                Exporting data In SQL Server as INSERT INTO
                            
                                What is the syntax for an inner join in LINQ to SQL?
                            
                                How to create id with AUTO_INCREMENT on Oracle?
                            
                                How to declare a variable in MySQL?
                            
                                Difference between EXISTS and IN in SQL?
                            
                                How to convert java.util.Date to java.sql.Date?
                            
                                How do I (or can I) SELECT DISTINCT on multiple columns?
                            
                                What is an index in SQL?
                            
                                Is it possible to specify condition in Count()?
                            
                                How to delete duplicate rows in SQL Server?
                            
                                Error related to only_full_group_by when executing a query in MySql

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Adding an identity to an existing column

Tags:

sql

sql-server

alter-table

identity-column

Kirschstein

People also ask

2 Answers

John Sansom

Justin Grant

Recent Activity

Donate For Us