Archiving Large Table (SQL Server 2008)

Tags:

I have a very large table being filled with about 100s of million records each quarter.

I manually move data from the existing table to another database using this script, to minimize the backup size, and to off load the production database when performing queries.

Is there any better way, for example, some scheduled script that will move data from the production database to some other database and then delete the records from the source database every day or week efficiently?

Note that my log file is growing rapidly due to the high number of INSERTs into this table, also when I move data to the archive database, DELETEs will be logged.

Thanks

227

asked Oct 23 '12 15:10

PyQL

1 Answers

Let me recap the requirements:

reduce the backup size
reduce the number of records in the database by archiving
archive the data without excessive logging

In order to reduce the backup size, you'll need to move the data into a different database.

As far as logging goes, you'll want to look over the rules of minimal logging and make sure that you are following them. Make sure that the recovery model of the database you are inserting into is in the simple or bulk-logged recovery model.

For inserting the archived data, you want to disable non-clustereds (and rebuild them after the insert has completed), utilize trace flag 610 if there is a clustered index, and put a table lock on the destination table. There are many more rules in the link that you'll want to check off, but these are the basics.

There is no minimal logging for deletes, but you can minimize log file growth by deleting in chunks with the top clause. The basic idea is (switch to simple recovery model for the duration of the delete to limit file growth):

SELECT NULL;

WHILE @@ROWCOUNT > 0

     DELETE TOP (50000) FROM TABLE WHERE Condition = TRUE;

Adjust the top number to adjust how much logging per delete is done. You'll also want to make sure the predicate condition is correct so that you only delete what you intend to. This will delete 50000, then if a rowcount is returned, it will repeat until the rowcount returned is 0.

If you really want minimal logging for everything, you can partition the source table by week, create a clone of the source table (on the same partition function and identical indexing structure), switch the partition from the source table to the cloned table, insert from the cloned table to the archive table, then truncate the cloned table. The advantage of this is a truncate rather than a delete. The disadvantage is that it's much more complicated to setup, maintain, and query (you get one heap or b-tree per partition, so if all queries don't utilize partition elimination, a clustered index/table scan would have to scan multiple b-trees/heaps instead of just one).

answered Oct 03 '22 15:10

brian

Related questions
                            
                                Difference between the AND statement in an Inner Join or in a WHERE clause
                            
                                Query to find out if foreign key is referenced anywhere else in the database
                            
                                Clarification of Java/SQLite batch and auto-commit
                            
                                "ORA-00922: missing or invalid option" when creating tables
                            
                                Creating a view from a union query
                            
                                How do I print a triangle of stars using SQL
                            
                                Group records by time
                            
                                How to map the result from a query to a custom object in sqlalchemy?
                            
                                What date format does Oracle/Toad expect?
                            
                                IF / ELSE depending on result of stored procedure
                            
                                Timezone conversion in PLSQL
                            
                                Executing SQL Functions in Coldfusion
                            
                                Android MediaStore get distinct folders of music files
                            
                                INSERT INTO table VALUES.. vs INSERT INTO table SET Error
                            
                                How to update one columns data using another tables data TSQL
                            
                                How do I decide if I should use a CTE or not?
                            
                                Postgresql batch insert or ignore
                            
                                Number of rows in a table keeps changing every time I SELECT, can anyone shed light?
                            
                                Dynamic Pivot Table in SQL Server
                            
                                How can I group by on a field which has NULL values?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Archiving Large Table (SQL Server 2008)

Tags:

sql

sql-server

sql-server-2008

PyQL

People also ask

1 Answers

brian

Recent Activity

Donate For Us