Change column types in a huge table

Tags:

I have a table in SQL Server 2008 R2 with close to a billion rows. I want to change the datatype of two columns from int to bigint. Two times ALTER TABLE zzz ALTER COLUMN yyy works, but it's very slow. How can I speed the process up? I was thinking to copy the data to another table, drop, create, copy back and switching to simple recovery mode or somehow doing it with a cursor a 1000 rows a time but I'm not sure if those will actually lead to any improvement.

439

asked May 25 '12 12:05

user1417408

Video Answer

1 Answers

Depending on what change you are making, sometimes it can be easier to take a maintenance window. During that window (where nobody should be able to change the data in the table) you can:

drop any indexes/constraints pointing to the old column, and disable triggers
add a new nullable column with the new data type (even if it is meant to be NOT NULL)
update the new column setting it equal to the old column's value (and you can do this in chunks of individual transactions (say, affecting 10000 rows at a time using UPDATE TOP (10000) ... SET newcol = oldcol WHERE newcol IS NULL) and with CHECKPOINT to avoid overrunning your log)
once the updates are all done, drop the old column
rename the new column (and add a NOT NULL constraint if appropriate)
rebuild indexes and update statistics

The key here is that it allows you to perform the update incrementally in step 3, which you can't do in a single ALTER TABLE command.

This assumes the column is not playing a major role in data integrity - if it is involved in a bunch of foreign key relationships, there are more steps.

EDIT

Also, and just wondering out loud, I haven't done any testing for this (but adding it to the list). I wonder if page + row compression would help here? If you change an INT to a BIGINT, with compression in place SQL Server should still treat all values as if they still fit in an INT. Again, I haven't tested if this would make an alter faster or slower, or how much longer it would take to add compression in the first place. Just throwing it out there.

171

answered Sep 30 '22 04:09

Aaron Bertrand

Related questions
                            
                                How do I use order by in HQL?
                            
                                Are IEnumerable Linq methods thread-safe?
                            
                                Maven deploy: forcing the deploy even if artifact already exists
                            
                                WiFi Direct (Android 4.0) with multiple (3+) devices
                            
                                How to make formule bigger in org-mode of Emacs?
                            
                                How to combine static_assert with sizeof and stringify?
                            
                                Matplotlib requirements with pip install in virtualenv
                            
                                Can I use Google drive for chrome extensions (not App)
                            
                                Modify data as part of an alembic upgrade
                            
                                add data files to python projects setup.py
                            
                                How to use gems not in a Gemfile when working with bundler?
                            
                                Identity insert on linked server fails

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With