ID Best Practices for Databases [closed]

Tags:

I was wondering what the best practices were for building and storing IDs. A few years ago, a professor told me about the dangers of a poorly constructed ID system, using the Social Security Number as an example. In particular, because SSNs do not have any error detection... it is impossible to tell the difference between a 9-digit string and a valid SSN. And now government agencies need things like Last Name + SSN or Birthday + SSN to keep track of your data and ensure its verification. Plus, your Social Security number is somewhat predictable based on where you were born.

Now I'm building a User database... and based off of this advice "userid mediumint auto_increment" would be unacceptable. Especially if I plan to use this ID as the primary identification for the user. (for example, if I allow the users to change their username, then the username would be more difficult to keep track than the numerical userid... requiring cascading foreign keys and whatnot.) Emails change, usernames can change, passwords change... but a userid should remain constant forever.

Clearly, auto_increment is only designed for surrogate_keys. That is, its a useful shortcut only when you already have a primary identification mechanism, but it shouldn't be used as an "innate identifier" for the data. Creating random UUID looks interesting, but the randomness turns me off.

And so I ask: whats the best practices for creating a "primary key" identification number?

837

asked Dec 03 '10 22:12

Dragontamer5788

2 Answers

You are confusing internal database functionality with external search criteria.

Auto-increment surrogate keys are useful for internal application use. Never pass those on to the user. Identifying business objects, whether it is a user or an invoice, are done with unique information about the object, like SSN, CCN or DOB. Use as much info as necessary to uniquely identify the object.

I highly recommend that if you must supply some newly invented ID value to each customer, that it NOT be the field you link all the customer data tables on.

176

answered Oct 11 '22 11:10

Bill

The best practice is to use an auto-increment integer. There's no real reason it shouldn't be used as an "innate identifier". It'll provide the most compact usage in foreign keys and fastest searches. Almost any other value can change and is inappropriate for use as a key.

answered Oct 11 '22 13:10

Samuel Neff

Related questions
                            
                                ORA-12899 value too large for column despite of same length
                            
                                SQL doesnt differentiate u and ü although collation is utf8mb4_unicode_ci
                            
                                Postgres: array_agg throws 'cannot accumulate empty arrays' for empty array(s)
                            
                                Postgres upsert using results from select
                            
                                ORA-01735: invalid ALTER TABLE option - Toad
                            
                                Get Value From Json object contain table column using SQL Query
                            
                                How to write multi line sql query (nodejs)
                            
                                Alternatives of array_agg() or string_agg() on redshift
                            
                                1:1 Foreign Key Constraints
                            
                                Performance Tuning SQL - How?
                            
                                Large Text and Images In SQL
                            
                                Store and retrieve a multidimensional array using php and mysql
                            
                                Select Rows with Maximum Column Value group by Another Column
                            
                                SQL Compact (CE) problem with creating foreign key
                            
                                atomic compare and swap in a database
                            
                                MySQL SELECT INTO Equivalent?
                            
                                SQL Bulk Stored Procedure call C#
                            
                                Deleting from table with millions of records
                            
                                Most efficient way to count rows of a query
                            
                                What Java data type corresponds to the Oracle SQL data type NUMERIC?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

ID Best Practices for Databases [closed]

Tags:

language-agnostic

sql

auto-increment

identity

surrogate-key

Dragontamer5788

People also ask

2 Answers

Bill

Samuel Neff

Recent Activity

Donate For Us