I have a database used by several clients. I don't really want surrogate incremental key values to bleed between clients. I want the numbering to start from 1 and be client specific. I'll use a two-part composite key of the <code>tenant_id</code> as well as an incremental id. What is the best way to create an incremental key per tenant? I am using SQL Server Azure. I'm concerned about locking tables, duplicate keys, etc. I'd typically set the primary key to IDENTITY and move on. Thanks

If you're looking to duplicate the convenience of having an automatically assigned unique INT key upon insert, you could add an <code>INSTEAD OF INSERT</code> trigger that uses MAX of the existing column +1 to determine the next value. If the column with the identity value is the first key in an index, the MAX query will be a simple index seek, very efficient. Transactions will ensure that unique values are assigned but this approach will have different locking semantics than the standard identity column. IIRC, SQL Server can allocate a different identity value for each transaction that requests it in parallel and if a transaction is rolled back, the value(s) allocated to it are discarded. The MAX approach would only allow one transaction to insert rows into the table at a time. A related approach could be to have a dedicated key value table keyed by the table name, tenant ID and current identity value. It would require the same <code>INSTEAD OF INSERT</code> trigger and more boilerplate to query and keep that key table updated. It wouldn't improve parallel operations though; the lock would just be on a different table's record. One possibility to fix the locking bottleneck would be to include the current SPID in the key's value (now the identity key is a combination of sequential int and whatever SPID happened to allocate it and not simply sequential), use the dedicated identity value table and insert records there per SPID as necessary; the identity table PK would be (table name, tenant, SPID) and have a non-key column with the current sequential value. That way, each SPID would have its own dynamically allocated identity pool and would only ever have its own SPID specific records locked. Another downside is maintaining triggers that have to be updated whenever you change the columns in any of the special identity tables.

Best approach for multi-tenant primary keys

2 Answers

Are you planning on using SQL Azure Federations in the future? If so, the current version of SQL Azure Federations does not support the use of IDENTITY as part of a clustered index. See this What alternatives exist to using guid as clustered index on tables in SQL Azure (Federations) for more details.

If you haven't looked at Federations yet, you might want to check it out as it provides an interesting way to both shard the database and for tenant isolation within the database.

Depending upon your end goal, using Federations you might be able to use a GUID as the primary clustered index on the table and also use an incremental INT IDENTITY field on the table. This INT IDENTITY field could be shown to end-users. If you are federating on the TenantID each "Tenant table" effectively becomes a silo (as I understand it at least) so the use of IDENTITY on a field within that table would effectively be an ever increasing auto generated value which increments within a given Tenant.

When \ if data is merged together (combining data from multiple Tenants) you would wind up with collisions on this INT IDENTITY field (hence why IDENTITY isn't supported as a primary key in federations) but as long as you aren't using this field as a unique identifier within the system at large you should be ok.

135

answered Oct 02 '22 14:10

Tim Lentine

If you're looking to duplicate the convenience of having an automatically assigned unique INT key upon insert, you could add an INSTEAD OF INSERT trigger that uses MAX of the existing column +1 to determine the next value.

If the column with the identity value is the first key in an index, the MAX query will be a simple index seek, very efficient.

Transactions will ensure that unique values are assigned but this approach will have different locking semantics than the standard identity column. IIRC, SQL Server can allocate a different identity value for each transaction that requests it in parallel and if a transaction is rolled back, the value(s) allocated to it are discarded. The MAX approach would only allow one transaction to insert rows into the table at a time.

A related approach could be to have a dedicated key value table keyed by the table name, tenant ID and current identity value. It would require the same INSTEAD OF INSERT trigger and more boilerplate to query and keep that key table updated. It wouldn't improve parallel operations though; the lock would just be on a different table's record.

One possibility to fix the locking bottleneck would be to include the current SPID in the key's value (now the identity key is a combination of sequential int and whatever SPID happened to allocate it and not simply sequential), use the dedicated identity value table and insert records there per SPID as necessary; the identity table PK would be (table name, tenant, SPID) and have a non-key column with the current sequential value. That way, each SPID would have its own dynamically allocated identity pool and would only ever have its own SPID specific records locked.

Another downside is maintaining triggers that have to be updated whenever you change the columns in any of the special identity tables.

answered Oct 02 '22 12:10

Chris Smith

Related questions
                            
                                Join to an in-memory list efficiently
                            
                                Sql server 2012 fetch vs old row_number performance. What am I missing? Why is row_number 17x faster?
                            
                                How do I set a root password for a Cloud SQL instance in Google App Engine? ["Instance busy" error message]
                            
                                SQL: WITH clause with parameters?
                            
                                SQL Query Limit for DB2 AS/400 Version 4
                            
                                Postgresql 9.4 query gets progressively slower when joining TSTZRANGE with &&
                            
                                How to code a certain maths algorithm
                            
                                MS Access VBA Data Type Mismatch Error in SQL Query
                            
                                Convert OData to sql string
                            
                                Are there any existing, elegant, patterns for an optional TOP clause?
                            
                                MySQL json_search on numeric values
                            
                                DISTINCT and LAG window function
                            
                                Does MySQL or MariaDB have any kind of in-memory database?
                            
                                How to choose solutions for secure messaging front-end and HIPAA compliant database?
                            
                                In MySQL, what is the most effective query design for joining large tables with many to many relationships between the join predicates?
                            
                                Convert columns to rows in SQL [duplicate]
                            
                                SQL Return Null if One Column is Null (Opposite of COALESCE())
                            
                                Tuple Versioning and composite primary key
                            
                                List and Linq To Sql Performance Issue
                            
                                Difference between DROP USER and deleting a row from the mysql.user table

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Best approach for multi-tenant primary keys

Tags:

sql

primary-key

multi-tenant

Paul Deen

People also ask

2 Answers

Tim Lentine

Chris Smith

Recent Activity

Donate For Us