How to version control a record in a database [closed]

People also ask

Can you version control a database?

What is database versioning? Database versioning begins with database schema, the structure of the database. In order to effectively version a database, you need to track and understand the changes that are happening.

How do I manage database versions?

The most basic method for managing database changes is to save the alter command in a script or set of scripts, and manage them in the exiting file-based version control. This guarantees a single repository that stores all the application component assets.

How do you use version control in SQL?

Open SQL Server Management Studio and connect to a SQL Server instance. Right-click on your database in the Object Explorer pane and select "Connect to Version Control". This will open the "Connect Database to Version Control" dialog. Copy the https repository path from GitHub and paste it into VersionSQL.

Let's say you have a FOO table that admins and users can update. Most of the time you can write queries against the FOO table. Happy days.

Then, I would create a FOO_HISTORY table. This has all the columns of the FOO table. The primary key is the same as FOO plus a RevisionNumber column. There is a foreign key from FOO_HISTORY to FOO. You might also add columns related to the revision such as the UserId and RevisionDate. Populate the RevisionNumbers in an ever-increasing fashion across all the *_HISTORY tables (i.e. from an Oracle sequence or equivalent). Do not rely on there only being one change in a second (i.e. do not put RevisionDate into the primary key).

Now, every time you update FOO, just before you do the update you insert the old values into FOO_HISTORY. You do this at some fundamental level in your design so that programmers can't accidentally miss this step.

If you want to delete a row from FOO you have some choices. Either cascade and delete all the history, or perform a logical delete by flagging FOO as deleted.

This solution is good when you are largely interested in the current values and only occasionally in the history. If you always need the history then you can put effective start and end dates and keep all the records in FOO itself. Every query then needs to check those dates.

I think you are looking for versioning the content of database records (as StackOverflow does when someone edits a question/answer). A good starting point might be looking at some database model that uses revision tracking.

The best example that comes to mind is MediaWiki, the Wikipedia engine. Compare the database diagram here, particularly the revision table.

Depending on what technologies you're using, you'll have to find some good diff/merge algorithms.

Check this question if it's for .NET.

In the BI world, you could accomplish this by adding a startDate and endDate to the table you want to version. When you insert the first record into the table, the startDate is populated, but the endDate is null. When you insert the second record, you also update the endDate of the first record with the startDate of the second record.

When you want to view the current record, you select the one where endDate is null.

This is sometimes called a type 2 Slowly Changing Dimension. See also TupleVersioning

Upgrade to SQL 2008.

Try using SQL Change Tracking, in SQL 2008. Instead of timestamping and tombstone column hacks, you can use this new feature for tracking changes on data in your database.

MSDN SQL 2008 Change Tracking

Just wanted to add that one good solution to this problem is to use a Temporal database. Many database vendors offer this feature either out of the box or via an extension. I've successfully used the temporal table extension with PostgreSQL but others have it too. Whenever you update a record in the database, the database holds on to the previous version of that record too.

Two options:

Have a history table - insert the old data into this history table whenever the original is updated.
Audit table - store the before and after values - just for the modified columns in an audit table along with other information like who updated and when.

You can perform auditing on a SQL table via SQL triggers. From a trigger you can access 2 special tables (inserted and deleted). These tables contain the exact rows that were inserted or deleted each time the table is updated. In the trigger SQL you can take these modified rows and insert them into the audit table. This approach means that your auditing is transparent to the programmer; requiring no effort from them or any implementational knowledge.

The added bonus of this approach is that the auditing will occur regardless of whether the sql operation took place via your data access DLLs, or via a manual SQL query; (as the auditing is performed on the server itself).

Related questions
                            
                                Database Structure for Tree Data Structure [closed]
                            
                                How to design a product table for many kinds of product where each product has many parameters
                            
                                NoSql vs Relational database
                            
                                Storing SHA1 hash values in MySQL
                            
                                Subqueries vs joins
                            
                                What should I name a table that maps two tables together? [closed]
                            
                                When/Why to use Cascading in SQL Server?
                            
                                What's wrong with nullable columns in composite primary keys?
                            
                                Database design for audit logging [closed]
                            
                                Implementing Comments and Likes in database
                            
                                Is there ever a time where using a database 1:1 relationship makes sense?
                            
                                How to Store Historical Data [closed]
                            
                                PostgreSQL: Which Datatype should be used for Currency?
                            
                                How big is too big for a PostgreSQL table?
                            
                                Relational table naming convention [closed]
                            
                                Database Design for Tagging [closed]
                            
                                Is it better to use multiple databases with one schema each, or one database with multiple schemas?
                            
                                Storing money in a decimal column - what precision and scale? [closed]
                            
                                Surrogate vs. natural/business keys [closed]
                            
                                Is there a good reason I see VARCHAR(255) used so often (as opposed to another length)?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to version control a record in a database [closed]

Tags:

architecture

versioning

database-design

auditing

People also ask

Recent Activity

Donate For Us