I'm using SQL-Server 2008 with Visual Studio Database Edition. With this setup, keeping your schema in sync is very easy. Basically, there's a 'compare schema' tool that allow me to sync the schema of two databases and/or a database schema with a source-controlled creation script folder. However, the situation is less clear when it comes to data, which can be of three different kind : <ul> <li>static data referenced in the code. typical example : my users can change their setting, and their configuration is stored on the server. However, there's a system-wide default value for each setting that is used in case the user didn't override it. The table containing those default settings grows as more options are added to the program. This means that when a new feature/option is checked in, the system-wide default setting is usually created in the database as well.</li> <li>static data. eg. a product list populating a dropdown list. The program doesn't rely on the existence of a specific product in the list to work. This can be for example a list of unicode-encoded products that should be deployed in production when the new "unicode version" of the program is deployed.</li> <li>other data, ie everything else (logs, user accounts, user data, etc.)</li> </ul> It seems obvious to me that my third item shouldn't be source-controlled (of course, it should be backuped on a regular basis) But regarding the static data, I'm wondering what to do. <ul> <li>Should I append the insert scripts to the creation scripts? or maybe use separate scripts?</li> <li>How do I (as a developer) warn the people doing the deployment that they should execute an insert statement ?</li> <li>Should I differentiate my two kind of data? (the first one being usually created by a dev, while the second one is usually created by a non-dev)</li> </ul> How do you manage your DB static data ?

I have explained the technique I used in my blog Version Control and Your Database. I use database metadata (in this case SQL Server extended properties) to store the deployed application version. I only have scripts that upgrade from version to version. At startup the application reads the deployed version from the database metadata (lack of metadata is interpreted as version 0, ie. nothing is yet deployed). For each version there is an application function that upgrades to the next version. Usually this function runs an internal resource T-SQL script that does the upgrade, but it can be something else, like deploying a CLR assembly in the database. There is no script to deploy the 'current' database schema. New installments iterate trough all intermediate versions, from version 1 to current version. There are several advantages I enjoy by this technique: <ul> <li>Is easy for me to test a new version. I have a backup of the previous version, I apply the upgrade script, then I can revert to the previous version, change the script, try again, until I'm happy with the result.</li> <li>My application can be deployed on top of any previous version. Various clients have various deployed version. When they upgrade, my application supports upgrade from any previous version.</li> <li>There is no difference between a fresh install and an upgrade, it runs the same code, so I have fewer code paths to maintain and test.</li> <li>There is no difference between DML and DDL changes (your original question). they all treated the same way, as script run to change from one version to next. When I need to make a change like you describe (change a default), I actually increase the schema version even if no other DDL change occurs. So at version 5.1 the default was 'foo', in 5.2 the default is 'bar' and that is the only difference between the two versions, and the 'upgrade' step is simply an UPDATE statement (followed of course by the version metadata change, ie. sp_updateextendedproperty).</li> <li>All changes are in source control, part of the application sources (T-SQL scripts mostly).</li> <li>I can easily get to any previous schema version, eg. to repro a customer complaint, simply by running the upgrade sequence and stopping at the version I'm interested in.</li> </ul> This approach saved my skin a number of times and I'm a true believer now. There is only one disadvantage: there is no obvious place to look in source to find 'what is the current form of procedure foo?'. Because the latest version of foo might have been upgraded 2 or 3 versions ago and it wasn't changed since, I need to look at the upgrade script for that version. I usually resort to just looking into the database and see what's in there, rather than searching through the upgrade scripts. One final note: this is actually not my invention. This is modeled exactly after how SQL Server itself upgrades the database metadata (mssqlsystemresource).

Do you put your database static data into source-control ? How?

Tags:

version-control

sql

I'm using SQL-Server 2008 with Visual Studio Database Edition.

With this setup, keeping your schema in sync is very easy. Basically, there's a 'compare schema' tool that allow me to sync the schema of two databases and/or a database schema with a source-controlled creation script folder.

However, the situation is less clear when it comes to data, which can be of three different kind :

static data referenced in the code. typical example : my users can change their setting, and their configuration is stored on the server. However, there's a system-wide default value for each setting that is used in case the user didn't override it. The table containing those default settings grows as more options are added to the program. This means that when a new feature/option is checked in, the system-wide default setting is usually created in the database as well.
static data. eg. a product list populating a dropdown list. The program doesn't rely on the existence of a specific product in the list to work. This can be for example a list of unicode-encoded products that should be deployed in production when the new "unicode version" of the program is deployed.
other data, ie everything else (logs, user accounts, user data, etc.)

It seems obvious to me that my third item shouldn't be source-controlled (of course, it should be backuped on a regular basis)

But regarding the static data, I'm wondering what to do.

Should I append the insert scripts to the creation scripts? or maybe use separate scripts?
How do I (as a developer) warn the people doing the deployment that they should execute an insert statement ?
Should I differentiate my two kind of data? (the first one being usually created by a dev, while the second one is usually created by a non-dev)

How do you manage your DB static data ?

369

asked Oct 06 '09 13:10

Brann

1 Answers

I have explained the technique I used in my blog Version Control and Your Database. I use database metadata (in this case SQL Server extended properties) to store the deployed application version. I only have scripts that upgrade from version to version. At startup the application reads the deployed version from the database metadata (lack of metadata is interpreted as version 0, ie. nothing is yet deployed). For each version there is an application function that upgrades to the next version. Usually this function runs an internal resource T-SQL script that does the upgrade, but it can be something else, like deploying a CLR assembly in the database.

There is no script to deploy the 'current' database schema. New installments iterate trough all intermediate versions, from version 1 to current version.

There are several advantages I enjoy by this technique:

Is easy for me to test a new version. I have a backup of the previous version, I apply the upgrade script, then I can revert to the previous version, change the script, try again, until I'm happy with the result.
My application can be deployed on top of any previous version. Various clients have various deployed version. When they upgrade, my application supports upgrade from any previous version.
There is no difference between a fresh install and an upgrade, it runs the same code, so I have fewer code paths to maintain and test.
There is no difference between DML and DDL changes (your original question). they all treated the same way, as script run to change from one version to next. When I need to make a change like you describe (change a default), I actually increase the schema version even if no other DDL change occurs. So at version 5.1 the default was 'foo', in 5.2 the default is 'bar' and that is the only difference between the two versions, and the 'upgrade' step is simply an UPDATE statement (followed of course by the version metadata change, ie. sp_updateextendedproperty).
All changes are in source control, part of the application sources (T-SQL scripts mostly).
I can easily get to any previous schema version, eg. to repro a customer complaint, simply by running the upgrade sequence and stopping at the version I'm interested in.

This approach saved my skin a number of times and I'm a true believer now. There is only one disadvantage: there is no obvious place to look in source to find 'what is the current form of procedure foo?'. Because the latest version of foo might have been upgraded 2 or 3 versions ago and it wasn't changed since, I need to look at the upgrade script for that version. I usually resort to just looking into the database and see what's in there, rather than searching through the upgrade scripts.

One final note: this is actually not my invention. This is modeled exactly after how SQL Server itself upgrades the database metadata (mssqlsystemresource).

answered Oct 19 '22 16:10

Remus Rusanu

Related questions
                            
                                Hibernate @SQLDelete sql not adding schema
                            
                                How do I get a SUM to calculate properly with a join?
                            
                                Optimizing Slick generated SQL query
                            
                                Data source name not found, and no default driver specified
                            
                                Optimizing SELECT query performance
                            
                                Is there a way to pass a TVP to dapper on .Net Core right now?
                            
                                GoLang, REST, PATCH and building an UPDATE query
                            
                                How to represent and insert into an ordered list in SQL?
                            
                                How can I speed up update/replace operations in PostgreSQL?
                            
                                Are there any SQL Validators that can check syntax against multiple database servers?
                            
                                Getting an error when executing a dynamic sql within a function (SQL Server)?
                            
                                What is the purpose of views in SQL? [duplicate]
                            
                                Need help to optimize MySQL query
                            
                                auto increment on composite primary key
                            
                                Oracle SQL: Understanding the behavior of SYS_GUID() when present in an inline view?
                            
                                Filtering on a left join in SQLalchemy
                            
                                Pizza & Food - database design
                            
                                Elegant way of handling PostgreSQL exceptions?
                            
                                show all not empty tables in postgres
                            
                                Why is some sql query much slower when used with SqlCommand?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With