SQL Table linking... is it better to have a linking table, or a delimited column?

Tags:

My database has two tables, one contains a list of users, the other a list of roles. Each user will belong to one or more roles, and of course each role will have multiple users in it.

I've come across two ways to link the information. The first is to add a third table which contains the ID's from both tables. A simple join will then return all the users that belong to a role, or all the roles to which a user belongs. However, as the database grows, the datasets returned by these simple queries will grow exponentially.

The second method is to add a column to the users table in which a delimited list of roles is stored. This will eliminate the need for the third linking table, which may have a positive effect on database growth. The downside is that SQL does not have the ability to use delimited lists. The only way I've found to process that information is to use a temporary table and a custom function.

Is viewing my execution plans, the "table scan" event is the one that takes the most resources. It makes sense that eliminating a table from the equation would speed things up. The function takes up less than 1% of the resources.

These tests were done on a database with less than 20 records. As the size of the database grows, the table scans will take longer, so perhaps limiting them is the best choice.

If using the delimited list is a good way to go, why is nobody doing it?

Please tell me which is your preferred method (even if it's different from my two) and why.

Thank you.

828

asked Jan 21 '10 15:01

RichieACC

1 Answers

If you have a delimited list, finding users with a given role is going to become very expensive: effectively, you need to do a FULL scan of that table, and look at all the values for that column in every row, trying to see if it contains a given role.

A separate table (normalized, many to many relation) is the way to go, and with proper indexing you will not have full scans happening.

eg:

User:  UserId, Name, ....
Role:  RoleId, Name, ....
UserRole:  UserRoleId, UserId, RoleId

(UserRoleId is optional, you could alternatively have the PK be UserId+RoleId, I won't get into the discussion here of surrogate vs compound keys here)

You'll want an index on (UserId, RoleId) that is UNIQUE, to enforce no duplicates. This will also help with any queries where you're trying to see if a specific user has a specific role (WHERE userId = x AND roleId = y)

If you are looking up all the roles a user has, you'll want an index on just UserId.

Conversely, if you are looking up all the users a given role has, an index on just roleId will speed that up. If you don't do this query, or do it very rarely, then not having this index will speed up performance slightly for insert/updates, as it is one less thing to do. This is the careful balancing act that is database tuning.

answered Oct 27 '22 11:10

gregmac

Related questions
                            
                                SQL Query to sum fields from different tables
                            
                                Use function as default value for column in Oracle11g
                            
                                PHP/MySQL: Sort by time, then by date
                            
                                SQL select statement with where clause
                            
                                Microsoft OLE DB Provider for SQL Server error '80004005'
                            
                                In SQL, how to delete a row from one table if it doesn't have a corresponding row in another table?
                            
                                How to check for the existence of a DB?
                            
                                SQL: Limit on CASE (number of WHEN, THEN conditions)
                            
                                Three table join with joins other than INNER JOIN
                            
                                sql-server: how do i know who is in my database?
                            
                                How to export a csv without header in SSRS
                            
                                Sending null parameters to Sql Server
                            
                                sql update random between two dates
                            
                                What are "SQL-Hints"?
                            
                                SQL: ORDER BY two columns intermixed, not priority based
                            
                                Last id value in a table. SQL Server
                            
                                How to use LIKE in a t-sql dynamic statement in a stored procedure?
                            
                                COUNT of DISTINCT items in a column
                            
                                Using Excel like solver in Python or SQL
                            
                                How to express a range over multiple columns with hierarchic relation?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

SQL Table linking... is it better to have a linking table, or a delimited column?

Tags:

sql

database

tsql

database-design

RichieACC

People also ask

1 Answers

gregmac

Recent Activity

Donate For Us