Why & When should I use SPARSE COLUMN? (SQL SERVER 2008)

Tags:

After going thru some tutorials on SQL Server 2008's new feature "SPARSE COLUMN", I have found that it doesn't take any space if the column value is 0 or NULL but when there is a value, it takes 4 times the space a regular(non sparse) column holds.

If my understanding is correct, then why I will go for that at the time of database design? And if I use that, then at what situation will I be?

Also out of curiosity, how does no space get reserved when a column is defined as sparse column (I mean to say, what is the internal implementation for that?)

640

asked Sep 09 '09 09:09

priyanka.sarkar

2 Answers

A sparse column doesn't use 4x the amount of space to store a value, it uses a (fixed) 4 extra bytes per non-null value. (As you've already stated, a NULL takes 0 space.)

So a non-null value stored in a bit column would be 1 bit + 4 bytes = 4.125 bytes. But if 99% of these are NULL, it is still a net savings.
A non-null value stored in a GUID (UniqueIdentifier) column is 16 bytes + 4 bytes = 20 bytes. So if only 50% of these are NULL, that's still a net savings.

So the "expected savings" depends strongly on what kind of column we're talking about, and your estimate of what ratio will be null vs non-null. Variable width columns (varchars) are probably a little more difficult to predict accurately.

This Books Online Page has a table showing what percentage of different data types would need to be null for you to end up with a benefit.

So when should you use a Sparse Column? When you expect a significant percentage of the rows to have a NULL value. Some examples that come to mind:

A "Order Return Date" column in an order table. You would hope that a very small percent of sales would result in returned products.
A "4th Address" line in an Address table. Most mailing addresses, even if you need a Department name and a "Care Of" probably don't need 4 separate lines.
A "Suffix" column in a customer table. A fairly low percent of people have a "Jr." or "III" or "Esquire" after their name.

149

answered Sep 28 '22 09:09

BradC

Storing a null in a sparse column takes up no space at all.
To any external application the column will behave the same
Sparse columns work really well with filtered indexes as you will only want to create an index to deal with the non-empty attributes in the column.
You can create a column set over the sparse columns that returns an xml clip of all of the non-null data from columns covered by the set. The column set behaves like a column itself. Note: you can only have one column set per table.
Change Data Capture and Transactional replication both work, but not the column sets feature.

Downsides

If a sparse column has data in it it will take 4 more bytes than a normal column e.g. even a bit (0.125 bytes normally) is 4.125 bytes and unique identifier rises form 16 bytes to 20 bytes.
Not all data type can be sparse: text, ntext, image, timestamp, user-defined data type, geometry, or geography or varbinray (max) with the FILESTREAM attribute cannot be sparse. (Changed17/5/2009 thanks Alex for spotting the typo)
computed columns can't be sparse (although sparse columns can take part in a calculation in another computed column)
You can't apply rules or have default values.
Sparse columns cannot form part of a clustered index. If you need to do that use a computed column based on the sparse column and create the clustered index on that (which sort of defeats the object).
Merge replication doesn't work.
Data compression doesn't work.
Access (read and write) to sparse columns is more expensive, but I haven't been able to find any exact figures on this.

Reference

answered Sep 28 '22 09:09

rahul

Related questions
                            
                                Running a .sql script using MySQL with JDBC
                            
                                Grouping timestamps by day, not by time
                            
                                Most recent record in a left join
                            
                                Compare dates in T-SQL, ignoring the time part
                            
                                MySQL LEFT JOIN 3 tables
                            
                                New Line Issue when copying data from SQL Server 2012 to Excel
                            
                                Query to check index on a table
                            
                                Getting date list in a range in PostgreSQL
                            
                                Doesn't Linq to SQL miss the point? Aren't ORM-mappers (SubSonic, etc.) sub-optimal solutions?
                            
                                Passing a varchar full of comma delimited values to a SQL Server IN function
                            
                                Search and replace part of string in database
                            
                                How to check if a string is a uniqueidentifier?
                            
                                What is the fastest way to truncate timestamps to 5 minutes in Postgres?
                            
                                What's the right way to compare an NTEXT column with a constant value?
                            
                                Cannot access field in Big Query with type ARRAY<STRUCT<hitNumber INT64, time INT64, hour INT64, ...>>
                            
                                Dynamic Database Schema [closed]
                            
                                Having issues with a MySQL Join that needs to meet multiple conditions
                            
                                SQL Insert into table only if record doesn't exist [duplicate]
                            
                                Left Join without duplicate rows from left table
                            
                                Difference between language sql and language plpgsql in PostgreSQL functions

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why & When should I use SPARSE COLUMN? (SQL SERVER 2008)

Tags:

sql

sql-server

sql-server-2008

database-design

priyanka.sarkar

People also ask

2 Answers

BradC

rahul

Recent Activity

Donate For Us