Best strategy for storing documents in SQL Server 2008

Tags:

One of our teams is going to be developing an application to store records in a SQL2008 database and each of these records will have an associated PDF file. There is currently about 340GB of files, with most (70%) being about 100K, but some are several Megabytes in size. Data is mostly inserted and read, but the files are updated on occasion. We are debating between the following options:

Store the files as BLOBs in the database.
Store the files outside the database and store the paths in the database.
Use SQL2008's Filestream feature to store the files.

We have read the Micrsoft best practices regarding filestream data, but since the files vary in size, we are not sure which path to choose. We are leaning toward option 3 (filestream), but have some questions:

Which architecture would you choose given the amount of data and file sizes noted above?
Data access will be done using SQL authentication, not Windows authentication, and the web server will likely not be able to access the files using Windows API. Would this make filstream perform worse than the other two options?
Since the SQL backups include the filestream data, this would lead to very large database backups. How do others handle backing up databases with a large amount of filestream data?

663

asked Sep 30 '10 18:09

DCNYAM

2 Answers

OK, here we go. Option 2 is a really bad idea - you end up with untestable integrity constraints and backups that are not guaranteed to be consistent per definition because you can not take point in time backups. Not a problem in MOST scenarios, it turns into one the moment you have a more complicated (point in time) recovery.

Options 1 and 3 are pretty equal, albeit with some implications.

Filestream can use a lot more disc space. Basically, every version has a guid, if you make updates the old files stay around until the next backup.
OTOH the files do not count as db size (express edition - not against the 10gb limit should you use it) and access is further down possible using a file share. This is added flexibility.
In database has the most limited options regarding access (no way for the web server to just open the file after getting the path from the sql - it has to funnel the complete file through the sql protocol layer) but has advantages in regards of having less files (numbers). Putting the blobs into a separate table and that one a separate set of spindles may be strategically a good idea.

Regarding your questions:

1: I would go with in database storage. Try out both - filestream and not. As you use the same API anyway, this is a simple change in the table definition.

2: Yes, worse than direct file access, but it would be more protected than direct file access. Otherwise I do not think filestream and blob make a significant difference.

3: where do you have a huge backup here? Sorry to ask, but your 340gb is not exactly a large database. And you need to back it up ANYWAY. Better do it in one consistent state, which is what you achieve with db storage. Plus integrity (no one accidentally deleting unused documents without cleaning up the database). The DB is not significantly larger than doing that split, and it is a simple one place backup.

At the end, the question is db integrity and ease of backing things up. Win for SQL Server unless you get large - and this means 360 terabyte of data.

answered Nov 15 '22 23:11

TomTom

Store the files outside the database and store the paths in the database.

because it takes too much space to store files in the database.

answered Nov 15 '22 22:11

Beth

Related questions
                            
                                Select query skips records during concurrent updates
                            
                                Routing to Different SQL Server Instances Running through Docker on Default Port
                            
                                Merging duplicated records together with "Merge" syntax
                            
                                What is the best way to attach existing database to sql localdb?
                            
                                SQL Srv 2016: Login failed for user 'MicrosoftAccount\...'
                            
                                SqlConnection vs Sql Session. Do their lifetimes coincide?
                            
                                Convert Historical Local Time to UTC Time in SQL Server
                            
                                How to implement ASP.NET identity: CREATE DATABASE permission denied in database 'master'
                            
                                SSRS report files (.rdl) how to upgrade to latest?
                            
                                SQL Server 2008 Hierarchy Data Type Performance?
                            
                                Is there a practical way to use the hierarchyID datatype in entity framework 4?
                            
                                Django Model Choices: IntegerField vs CharField
                            
                                Migrating from MySQL to SQL Server, issues with constraints
                            
                                What is the purpose of creating a login from a certificate?
                            
                                Case of using filtered statistics
                            
                                Calculation using Date function in SQL Server 2008
                            
                                Client collation and SQL Server 2005
                            
                                How to write large files to SQL Server FILESTREAM?
                            
                                How do I visually design my database with Entity Framework Core? [closed]
                            
                                SQL Agent job failure with SSIS package to Access DB

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Best strategy for storing documents in SQL Server 2008

Tags:

sql-server

sql-server-2008

blob

filestream

DCNYAM

People also ask

2 Answers

TomTom

Beth

Recent Activity

Donate For Us