Sql Server Change Data Capture: Preserving history when adding columns?

Tags:

When a new column is added to table that is configured for change data capture (cdc), the capture instance table will not have the new column until cdc is disabled and re-enabled for the source table. In the process the existing capture instance is dropped.

I thought I could copy existing data out to a temp table and then copy back using the following SQL. However, other CDC meta information, such as the cdc.change_tables.start_lsn, becomes invalid.

How can the capture instance history be preserved, using the same capture instance name, if at all?

Thanks, Rich

/*Change Data Capture Test - Alter table definition test */

/*Enter restricted mode so we don't lose data changes during this process*/
alter database ChangeDataCaptureTest set AUTO_UPDATE_STATISTICS_ASYNC OFF
alter database ChangeDataCaptureTest set RESTRICTED_USER with ROLLBACK IMMEDIATE
go

/*Add a column to the table*/
alter table dbo.Table1 add value3 varchar(20) DEFAULT '' not null

/*Copy the existing change tracking into a temp table*/
select * into cdc.dbo_Table1_temp from cdc.dbo_Table1_CT

/*Add the new column to the temp table so that we don't have to map
all columns when we copy back, note that we use NULL as the default*/
alter table cdc.dbo_Table1_temp add value3 varchar(20) DEFAULT NULL

/*Disable CDC on the source table, this will drop the associated cdc table*/
exec sys.sp_cdc_disable_table 
@source_schema='dbo',
@source_name='Table1', 
@capture_instance='dbo_Table1'

/*Enable CDC for the table which recreates the CDC table*/
EXEC sys.sp_cdc_enable_table
@source_schema = N'dbo',
@source_name   = N'Table1',
@role_name     = NULL,
@supports_net_changes = 1,
@filegroup_name = N'ChangeDataCapture'
GO

/*Insert values from the temp table back into the new CDC Table*/
Insert into cdc.dbo_Table1_CT 
SELECT * 
From cdc.dbo_Table1_temp
go

/*Drop the temp table*/
drop table cdc.dbo_Table1_temp

/*Go back into multi-user mode*/
alter database ChangeDataCaptureTest set AUTO_UPDATE_STATISTICS_ASYNC ON
alter database ChangeDataCaptureTest set MULTI_USER
go

/*Add a new row to the table*/
insert into table1
values(12,'zz','g')

931

asked Apr 16 '10 17:04

Richard Collette

1 Answers

Rich,

The best method to preserve such data is to create a staging persisted table to capture the _CT table data periodically. Knowing that cdc data generally has a short shelf-life prior to be consumed by the endpoint (warehouse/data mart, etc.) you can ensure that any changes are completed within a maintenance window at which time the _CT table data is copied off into staging prior to the changes being implemented.

The one aspect to consider in this is that once the _CT schema has been changed (by adding or removing one or more columns) the process used to pull that data out into the endpoint must also be updated.

To overcome this we implemented a script store that stores the intended schema of the staging table (used between _CT and endpoint) and once the changes are implemented on the client DB, then we move the data from staging into endpoint and update the staging schema.

Hopefully this will provide food for thought.

answered Sep 22 '22 23:09

LogicalMan

Related questions
                            
                                How to get number of days in a month in SQL Server
                            
                                Which of two ways of coding an Inner join is faster?
                            
                                SQLServer - Select bool if column begins with a string
                            
                                Is SQL Server/Windows integrated security good for anything?
                            
                                Why is it so difficult to do a loop in T-SQL
                            
                                SQL use comma-separated values with IN clause
                            
                                Splitting Date into 2 Columns (Date + Time) in SQL
                            
                                SQL MERGE with variables
                            
                                Get row count of all tables in database: SQL Server [duplicate]
                            
                                SQL Server - select substring of all characters following last hyphen
                            
                                Why is LAST_VALUE() not working in SQL Server?
                            
                                What kinds of problems are most likely to occur?
                            
                                How to insert xml into a node in another xml using XQuery?
                            
                                How to remove duplicate rows from flat file using SSIS?
                            
                                Best way to parse DateTime to SQL server
                            
                                How to insert a string with ( ' ) in to the sql database?
                            
                                Select the last row in a SQL table
                            
                                Correct concurrency handling using EF Core 2.1 with SQL Server
                            
                                How to implement a conditional Upsert stored procedure?
                            
                                Why does ToString() degrade Entity Framework's performance so dramatically

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Sql Server Change Data Capture: Preserving history when adding columns?

Tags:

sql-server

cdc

Richard Collette

People also ask

1 Answers

LogicalMan

Recent Activity

Donate For Us