How do you merge tables with autonumber primary keys?

Tags:

I suppose everyone runs into this problem once in a while: you have two tables that have autonumber primary keys that need to be merged. There are many good reasons why autonumber primary keys are used in favour of say application-generated keys, but merging with other tables must be one of the biggest drawbacks.

Some problems that arise are overlapping ids and out of sync foreign keys. I would like to hear your approach for tackling this. I always run into problems, so I'm very curious if anybody has some sort of a general solution.

-- EDIT --

In response to the answers suggesting to use guids or other non-numeric keys, there are situations where in advance it just seems a better idea to use autonumber keys (and you regret this later), or you're taking over someone else's project, or you get some legacy database that you have to work with. So I'm really looking for a solution where you have no control over the database design anymore.

605

asked Sep 29 '10 18:09

Carvellis

2 Answers

Solutions include:

Use GUIDs as primary keys instead of a simpler identity field. Very likely to avoid overlaps, but GUIDs are harder to use and don't play nicely with clustered indexes.
Make the primary key into a multi-column key, the second column resolving overlapping values by identifying the source of the merged data. Portable, works better with clustered indexes, but developers hate multi-column keys.
Use natural keys instead of pseudokeys.
Allocate new primary key values for one of the merged tables, and cascade these changes to any dependent rows. This changes a merge operation into an ETL operation. This is the only solution you can use for legacy data, if you can't change the database design.

I'm not sure there's a one-size-fits-all solution. Choose one of these based on the situation.

195

answered Sep 30 '22 18:09

Bill Karwin

Hm, I'm kind of enthousiastic about the idea that I just put in a comment at AlexKuznetsov's answer, so I'll make a whole answer about it.

Consider the tables to be named table1 and table2, with id1 and id2 as autonumber primary keys. They will be merged to table3 with id3 (a non-autonumber primary key).

Why not:

Remove all foreign key constraints to table1 and table2
For all foreign key fields referring to table1, execute an UPDATE table SET id1 = id1 * 2, and for FK fields referring to table2, execute an UPDATE table SET id2 = (id2) * 2 + 1
Fill table3 by executing an INSERT INTO table3 SELECT id1 * 2 AS id3, ... FROM table1 UNION ALL SELECT id2 * 2 + 1 AS id3 FROM table2
Create new foreign key constraints to table3

It can even work with 3 or more tables, just by using a higher multiplier.

answered Sep 30 '22 18:09

littlegreen

Related questions
                            
                                Disable password policy in Sql Server Docker container
                            
                                How to return multiple tables as one XML?
                            
                                Entity Framework Core 3.0 query causes "SqlException: 'Execution Timeout Expired'" and tempdb become full. Works with EF Core 2.2.6
                            
                                Select products where the category belongs to any category in the hierarchy
                            
                                Why, when I impersonate within a WCF service, can my service not load System.Transactions when I try to run a LINQ to SQL query?
                            
                                How can I run sql server stored procedures in parallel?
                            
                                Log changes made to all fields in a table to another table (SQL Server 2005)
                            
                                Automatically print SSRS report?
                            
                                Select Parent Record With All Children in SQL
                            
                                How do I get Linq to SQL to recognize the result set of a dynamic Stored Procedure?
                            
                                SSIS Multicast - Wait for one fork to finish before executing next fork
                            
                                Is there a way to multithread a SqlDataReader?
                            
                                How can I do a Cascading Delete with the SQL 2008 HierarchyID data type?
                            
                                SQL Error: The multi-part identifier "tableName.ColumnName" could not be bound
                            
                                Is it possible to create a Unique ID in an SQL Server View that will remain the same each time the view is called?
                            
                                MS SQL datetime precision problem
                            
                                What would be the best way to store the questions and responses for a survey where I need to keep the traffic on the database to a minimum?
                            
                                Alter stored procedure if condition is met
                            
                                SQL Server stored procedure return code oddity
                            
                                SQL Server Profiler - Evaluating Reads. What is considered 'good' or 'bad'?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How do you merge tables with autonumber primary keys?

Tags:

sql-server

database-design

identity-column

Carvellis

People also ask

2 Answers

Bill Karwin

littlegreen

Recent Activity

Donate For Us