Many-to-many relationship: use associative table or delimited values in a column?

Tags:

Update 2009.04.24

The main point of my question is not developer confusion and what to do about it.

The point is to understand when delimited values are the right solution.

I've seen delimited data used in commercial product databases (Ektron lol).

SQL Server even has an XML datatype, so that could be used for the same purpose as delimited fields.

/end Update

The application I'm designing has some many-to-many relationships. In the past, I've often used associative tables to represent these in the database. This has caused some confusion to the developers.

Here's an example DB structure:

Document
---------------
ID (PK)
Title
CategoryIDs (varchar(4000))


Category
------------
ID (PK)
Title

There is a many-to-many relationship between Document and Category.

In this implementation, Document.CategoryIDs is a big pipe-delimited list of CategoryIDs.

To me, this is bad because it requires use of substring matching in queries -- which cannot make use of indexes. I think this will be slow and will not scale.

With that model, to get all Documents for a Category, you would need something like the following:

select * from documents where categoryids like '%|' + @targetCategoryId + '|%'

My solution is to create an associative table as follows:

Document_Category
-------------------------------
DocumentID (PK)
CategoryID (PK)

This is confusing to the developers. Is there some elegant alternate solution that I'm missing?

I'm assuming there will be thousands of rows in Document. Category may be like 40 rows or so. The primary concern is query performance. Am I over-engineering this?

Is there a case where it's preferred to store lists of IDs in database columns rather than pushing the data out to an associative table?

Consider also that we may need to create many-to-many relationships among documents. This would suggest an associative table Document_Document. Is that the preferred design or is it better to store the associated Document IDs in a single column?

Thanks.

264

asked Apr 24 '09 17:04

frankadelic

1 Answers

This is confusing to the developers.

Get better developers. That is the right approach.

193

answered Sep 17 '22 22:09

Tom Ritter

Related questions
                            
                                How do you do date math that ignores the year?
                            
                                Index autoincrement for Microsoft SQL Server 2008 R2
                            
                                Get top results for each group (in Oracle)
                            
                                for each in MS SQL SERVER?
                            
                                Why use the BETWEEN operator when we can do without it?
                            
                                Syntax error when running a MySQL CREATE TABLE statement [duplicate]
                            
                                Query the two cities in STATION with the shortest and longest CITY names, [closed]
                            
                                Don't understand error message: Must declare the scalar variable "@Username".
                            
                                Comparing DateTime structs to find free slots
                            
                                How do I select an extra row for each row in the result set in SQL?
                            
                                How do I help a person who wants relational database data in a CSV format?
                            
                                How can I generate a temporary table filled with dates in SQL Server 2000?
                            
                                Extract number from string with Oracle function
                            
                                Pivoting of data using two columns
                            
                                Why 'delete from table' takes a long time when 'truncate table' takes 0 time?
                            
                                SQL query need to get names where count(id) = 2
                            
                                Counting rows for all tables at once
                            
                                MySQL query to select events between start/end date
                            
                                DELETE*FROM table
                            
                                SQL query to find third highest salary in company

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Many-to-many relationship: use associative table or delimited values in a column?

Tags:

sql

database

architecture

database-design

many-to-many

frankadelic

People also ask

1 Answers

Tom Ritter

Recent Activity

Donate For Us