SQL Joins vs Single Table : Performance Difference?

Tags:

I am trying to stick to the practice of keeping the database normalized, but that leads to the need to run multiple join queries. Is there a performance degradation if many queries use joins vs having a call to a single table that might contain redundant data?

688

asked Jan 25 '09 06:01

zsharp

2 Answers

Keep the Database normalised UNTIL you have discovered a bottleneck. Then only after careful profiling should you denormalise.

In most instances, having a good covering set of indexes and up to date statistics will solve most performance and blocking issues without any denormalisation.

Using a single table could lead to worse performance if there are writes as well as reads against it.

135

answered Sep 20 '22 21:09

Mitch Wheat

Michael Jackson (not that one) is famously believed to have said,

The First Rule of Program Optimization: Don't do it.
The Second Rule of Program Optimization – For experts only: Don't do it yet.

That was probably before RDBMSs were around, but I think he'd have extended the Rules to include them.

Multi-table SELECTs are almost always needed with a normalised data model; as is often the case with this kind of question, the "correct" answer to the "denormalise?" question depends on several factors.

DBMS platform.

The relative performance of multi- vs single-table queries is influenced by the platform on which your application lives: the level of sophistication of the query optimisers can vary. MySQL, for example, in my experience, is screamingly fast on single-table queries but doesn't optimise queries with multiple joins so well. This isn't a real issue with smaller tables (less than 10K rows, say) but really hurts with large (10M+) ones.

Data volume

Unless you're looking at tables in the 100K+ row region, there pretty much shouldn't be a problem. If you're looking at table sizes in the hundreds of rows, I wouldn't even bother thinking about indexing.

(De-)normalisation

The whole point of normalisation is to minimise duplication, to try to ensure that any field value that must be updated need only be changed in one place. Denormalisation breaks that, which isn't much of a problem if updates to the duplicated data are rare (ideally they should never occur). So think very carefully before duplicating anything but the most static data, Note that your database may grow significantly

Requirements/Constraints

What performance requirements are you trying to meet? Do you have fixed hardware or a budget? Sometimes a performance boost can be most easily - and even most cheaply - achieved by a hardware upgrade. What transaction volumes are you expecting? A small-business accounting system has a very different profile to, say, Twitter.

One last thought strikes me: if you denormalise enough, how is your database different from a flat file? SQL is superb for flexible data and multi-dimensional retieval, but it can be an order of magnitude (at least) slower than a straight sequential or fairly simply indexed file.

answered Sep 18 '22 21:09

Mike Woodhouse

Related questions
                            
                                Drop Column from Large Table
                            
                                PostgreSQL HASH index
                            
                                Join and Include in Entity Framework
                            
                                SQL - How to transpose?
                            
                                #1060 - Duplicate column name 'id'
                            
                                How do I execute a MS SQL Server stored procedure in java/jsp, returning table data?
                            
                                Adding a static value to the results of an SQL query
                            
                                How can I drop all indexes in a SQL database with one command?
                            
                                How do I import a .bak file into Microsoft SQL Server 2012?
                            
                                T-SQL INSERT INTO with LEFT JOIN
                            
                                Are there Table Literals in Transact-SQL?
                            
                                How do I drop a column with object dependencies in SQL Server 2008?
                            
                                Sqlite NULL and unique? [closed]
                            
                                Any way to SQLBulkCopy "insert or update if exists"?
                            
                                SQL Server stored procedure parameters
                            
                                How to find all rows with a NULL value in any column using PostgreSQL
                            
                                What is the difference between Postgres DISTINCT vs DISTINCT ON?
                            
                                How to use LIMIT keyword in SQL Server 2005?
                            
                                SQL joining three tables, join precedence
                            
                                "Order By" using a parameter for the column name

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

SQL Joins vs Single Table : Performance Difference?

Tags:

sql

join

normalization

zsharp

People also ask

2 Answers

Mitch Wheat

Mike Woodhouse

Recent Activity

Donate For Us