Resources for high performance SQL Server database design

Tags:

I'd like some suggestions for online resources (blogs, guides, etc - not forums) to help me become good at designing high performance SQL Server databases that operate with large amounts of data and have heavy loads in terms of data turnover and queries per minute.

Suggestions?

EDIT

The load I'm talking about is mainly in terms of data turnover. The main table has up to a million rows, about 30 fields of data of varying size and is updated with about 30-40000 new rows per day and at least 200000 rows are updated with new data every day. These updates happen on a continuing basis throughout the day. On top of this, all changes and updates need to be pulled from the database throughout the day to keep a large Lucene index up to date.

943

asked Mar 20 '10 15:03

Nathan Ridley

4 Answers

Sounds like a fairly manageable load on a moderate server - you haven't said what kind of read operations are happening while these inserts and updates are going on (other than the extractions for Lucene) and the size (byte-wise/data type-wise) of the data (the cardinality you have given seems fine).

At this point, I would recommend just using regular SQL Server best practices - determine a schema which is appropriate (normalize, then denormalize only if necessary), review execution plans, use the index tuning wizard, use the DMVs to find the unused indexes and remove them, choose clustered indexes carefully to manage page splits, carefully choose data types and size and use referential integrity and constraints where possible to give the optimizer as much help as possible. Beyond that is performance counters and ensuring your hardware/software installation is tuned.

In many/most cases, you'll never need to go beyond that to actually re-engineer your architecture.

However, even after all that, if the read load is heavy, the inserts and updates can cause locking issues between reads and writes, and then you are looking at architectural decisions for your application.

Also, the million rows and 200k updates a day wouldn't worry me - but you mention Lucene (i.e. full text indexing), so presumably some of the columns are rather large. Updating large columns and exporting them obviously takes far longer - and far more bandwidth and IO. 30 columns in a narrow million row table with traditional data type columns would be a completely different story. You might want to look at the update profile and see if you need to partition the table vertically to move some columns out of the row (if they are large, they will already be stored out of row) to improve the locking behavior.

So the key thing when you have heavy read load: Inserts and updates need to be as fast as possible, lock as little as possible (avoiding lock escalation), update as few indexes as can be afforded to support the read operation.

If the read load is so heavy (so that the inserts/updates start to conflict) but does not require 100% up to date information (say a 5 minute or 15 minute delay is not noticeable), you can have a read only version of the database which is maintained (either identical through replication, differently indexed for performance, denormalized or differently modeled - like a dimensional model). Perhaps your Lucene indexes can contain additional information so that the expensive read operations all stay in Lucene - i.e. Lucene becomes covering for many large read operations, thereby reducing your read load on the database to essential reads which support the inserts/updates (these are typically small reads) and the transactional part of your app (i.e. say a customer service information screen would use the regular database, while your hourly dashboard would use the secondary database).

answered Nov 15 '22 06:11

Cade Roux

You might try the SQL Server samples on CodePlex or DatabaseAnswers.com.

answered Nov 15 '22 05:11

tvanfosson

Here are some resources about troubleshooting and optimizing performance in SQL Server, that I've found really helpful:

http://updates.sqlservervideos.com/2009/09/power-up-with-sql-server-sql-server-performance.html

In particular, effective use of indexes can be a huge performance booster. I think that most web applications, in most circumstances, do a lot more reading than writing. Also, the sargability of an expression can have a serious impact on performance.

answered Nov 15 '22 07:11

RMorrisey

http://www.amazon.com/s/ref=nb_sb_noss?url=search-alias%3Daps&field-keywords=high+performance+database

This is subject better explored first with books as it is highly technical and complex.

I will point out that the people who created this website include several who work with very large databases. You can learn alot from them. http://lessthandot.com/

answered Nov 15 '22 05:11

HLGEM

Related questions
                            
                                STRING_AGG with line break
                            
                                Using LIKE in SQL with multiple search terms
                            
                                Why does this SQL script work as it does?
                            
                                SQL Server Alerts - Best Practices
                            
                                SQL Server - how to use 'ALTER INDEX' with variables as the parameters
                            
                                Microsoft SQL: CASE WHEN vs ISNULL/NULLIF
                            
                                Cause of SSIS Custom Dataflow Component Error - Cannot be upgraded
                            
                                SQL Server Boolean Expression evaluation
                            
                                SQL Server xml string parsing in varchar field
                            
                                Allow special characters SQL Server 2008
                            
                                Do all parts of a SQL SERVER expression using 'OR' get evaluated?
                            
                                SQL Server CONVERT(NUMERIC(18,0), '') fails but CONVERT(INT, '') succeeds?
                            
                                How to Download A file stored in SQL DB in Binary Format
                            
                                Versioning SQL Server DDL code
                            
                                Fetch the row which has the Max value for a column in SQL Server
                            
                                Question on how to read a SQL Execution plan
                            
                                Get Weeks in SQL
                            
                                Take 'Backup' of SQL Server Compact database
                            
                                ODBC query on MS SQL Server returning first 255 characters only in PHP PDO (FreeTDS)
                            
                                Can I make an identity field span multiple tables in SQL Server?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Resources for high performance SQL Server database design

Tags:

performance

sql-server

database-design