Should I obscure primary key values?

Tags:

I'm building a web application where the front end is a highly-specialized search engine. Searching is handled at the main URL, and the user is passed off to a sub-directory when they click on a search result for a more detailed display. This hand-off is being done as a GET request with the primary key being passed in the query string. I seem to recall reading somewhere that exposing primary keys to the user was not a good idea, so I decided to implement reversible encryption.

I'm starting to wonder if I'm just being paranoid. The reversible encryption (base64) is probably easily broken by anybody who cares to try, makes the URLs very ugly, and also longer than they otherwise would be. Should I just drop the encryption and send my primary keys in the clear?

729

asked Dec 13 '09 05:12

Scott

2 Answers

What you're doing is basically obfuscation. A reversible encrypted (and base64 doesn't really count as encryption) primary key is still a primary key.

What you were reading comes down to this: you generally don't want to have your primary keys have any kind of meaning outside the system. This is called a technical primary key rather than a natural primary key. That's why you might use an auto number field for Patient ID rather than SSN (which is called a natural primary key).

Technical primary keys are generally favoured over natural primary keys because things that seem constant do change and this can cause problems. Even countries can come into existence and cease to exist.

If you do have technical primary keys you don't want to make them de facto natural primary keys by giving them meaning they didn't otherwise have. I think it's fine to put a primary key in a URL but security is a separate topic. If someone can change that URL and get access to something they shouldn't have access to then it's a security problem and needs to be handled by authentication and authorization.

Some will argue they should never be seen by users. I don't think you need to go that far.

182

answered Sep 19 '22 00:09

cletus

On the dangers of exposing your primary key, you'll want to read "autoincrement considered harmful", By Joshua Schachter.

URLs that include an identifier will let you down for three reasons.

The first is that given the URL for some object, you can figure out the URLs for objects that were created around it. This exposes the number of objects in your database to possible competitors or other people you might not want having this information (as famously demonstrated by the Allies guessing German tank production levels by looking at the serial numbers.)

Secondly, at some point some jerk will get the idea to write a shell script with a for-loop and try to fetch every single object from your system; this is definitely no fun.

Finally, in the case of users, it allows people to derive some sort of social hierarchy. Witness the frequent hijacking and/or hacking of high-prestige low-digit ICQ ids.

answered Sep 17 '22 00:09

John Wiseman

Related questions
                            
                                Deserialization vs. parsing
                            
                                Compressed Graph Representation?
                            
                                UML : Internal class in a class diagram
                            
                                Shortest distance between points on a toroidally wrapped (x- and y- wrapping) map?
                            
                                How do I build a lockless queue?
                            
                                Is Model-View-Controller Poor Object-Oriented Design? [closed]
                            
                                Avoiding repeat of code after loop?
                            
                                What's the best way to serialize data in a language-independent binary format?
                            
                                What are some good examples of Mixins and or Traits?
                            
                                The term "clause" in the context of programming
                            
                                Realistic time estimates for progress bars etc
                            
                                Randomly choosing from a list with weighted probabilities
                            
                                How do I generate a random string of up to a certain length?
                            
                                How to convert decimal fractions to hexadecimal fractions?
                            
                                Where does super-linear speedup come from?
                            
                                What is "production-level code"? [closed]
                            
                                How to implement three stacks using a single array
                            
                                Efficient Algorithm for String Concatenation with Overlap
                            
                                What is the preferred method for handling unexpected enum values?
                            
                                How can I generate a set of points evenly distributed along the perimeter of an ellipse?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Should I obscure primary key values?

Tags:

language-agnostic

web-applications

primary-key

Scott

People also ask

2 Answers

cletus

John Wiseman

Recent Activity

Donate For Us