Database design: one huge table or separate tables?

Tags:

Currently I am designing a database for use in our company. We are using SQL Server 2008. The database will hold data gathered from several customers. The goal of the database is to acquire aggregate benchmark numbers over several customers.

Recently, I have become worried with the fact that one table in particular will be getting very big. Each customer has approximately 20.000.000 rows of data, and there will soon be 30 customers in the database (if not more). A lot of queries will be done on this table. I am already noticing performance issues and users being temporarily locked out.

My question, will we be able to handle this table in the future, or is it better to split this table up into smaller tables for each customer?

Update: It has now been about half a year since we first created the tables. Following the advices below, I created a handful of huge tables. Since then, I have been experimenting with indexes and decided on a clustered index on the first two columns (Hospital code and Department code) on which we would have partitioned the table had we had Enterprise Edition. This setup worked fine until recently, as Galwegian predicted, performance issues are springing up. Rebuilding an index takes ages, users lock each other out, queries frequently take longer than they should, and for most queries it pays off to first copy the relevant part of the data into a temp table, create indices on the temp table and run the query. This is not how it should be. Therefore, we are considering to buy Enterprise Edition for use of partitioned tables. If the purchase cannot go through I plan to use a workaround to accomplish partitioning in Standard Edition.

568

asked May 04 '10 14:05

littlegreen

2 Answers

Start out with one large table, and then apply 2008's table partitioning capabilities where appropriate, if performance becomes an issue.

answered Sep 27 '22 17:09

Galwegian

Datawarehouses are supposed to be big (the clue is in the name). Twenty million rows is about medium by warehousing standards, although six hundred million can be considered large.

The thing to bear in mind is that such large tables have a different physics, like black holes. So tuning them takes a different set of techniques. The other thing is, users of a datawarehouse must understand that they are dealing with huge amounts of data, and so they must not expect sub-second response (or indeed sub-minute) for every query.

Partitioning can be useful, especially if you have clear demarcations such as, as in your case, CUSTOMER. You have to be aware that partitioning can degrade the performance of queries which cut across the grain of the partitioning key. So it is not a silver bullet.

answered Sep 27 '22 18:09

APC

Related questions
                            
                                Driver.getConnection hangs using SQLServer driver and Java 1.6.0_29
                            
                                Sharing data between SQL databases
                            
                                Random Weighted Choice in T-SQL
                            
                                Does creating a unique constraint on a column automatically create index?
                            
                                Java JDBC: dates consistently two days off
                            
                                Why does HikariCP recommend fixed size pool for better performance
                            
                                Column, parameter, or variable #10: Cannot find data type
                            
                                SQL Server Error: "%" is not a constraint. Could not drop constraint. See previous errors
                            
                                What resource does a key lock actually lock?
                            
                                Windowed functions can only appear in the SELECT or ORDER BY clauses
                            
                                Pipes and filters at DBMS-level: Splitting the MERGE output stream
                            
                                Where can I find the Northwind database for Microsoft SQL Server 2008?
                            
                                SQL Server 2008 Installation
                            
                                Parse comma-separated string to make IN List of strings in the Where clause
                            
                                Using DateTime in a SqlParameter for Stored Procedure, format error
                            
                                Combine varchar column with int column
                            
                                Is it possible to use a Case statement in a sql From clause
                            
                                how to find "String or binary data would be truncated" error on sql in a big query
                            
                                How to return second newest record in SQL?
                            
                                SQL Server : FOR XML PATH - nesting / grouping

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Database design: one huge table or separate tables?

Tags:

sql-server

sql-server-2008

database-design

data-warehouse

littlegreen

People also ask

2 Answers

Galwegian

APC

Recent Activity

Donate For Us