<p>I have this largish table with three columns as such:</p> <pre class="prettyprint"><code>+-----+-----+----------+ | id1 | id2 | associd | +-----+-----+----------+ | 1 | 38 | 73157604 | | 1 | 112 | 73157605 | | 1 | 113 | 73157606 | | 1 | 198 | 31936810 | | 1 | 391 | 73157607 | +-----+-----+----------+ </code></pre> <p>This continues for 38m rows. The problem is I want to remove the 'associd' column but running <code>ALTER TABLE table_name DROP COLUMN associd;</code> simply takes too long. I wanted to do something like: <code>ALTER TABLE table_name SET UNUSED associd;</code> and <code>ALTER TABLE table_name DROP UNUSED COLUMNS CHECKPOINT 250;</code> then which apparently speeds up the process but it isn't possible in MySQL?</p> <p>Is there an alternative to remove this column-- maybe creating a new table with only the two columns or getting a drop with checkpoints?</p>

<p>Anything that you do is going to require reading and writing 38m rows, so nothing is going to be real fast. Probably the fastest method is probably to put the data into a new table:</p> <pre class="prettyprint"><code>create table newTable as select id1, id2 from oldTable; </code></pre> <p>Or, if you want to be sure that you preserve types and indexes:</p> <pre class="prettyprint"><code>create table newTable like oldTable; alter table newTable drop column assocId; insert into newTable(id1, id2) select id1, id2 from oldTable; </code></pre> <p>However, it is usually faster to drop all index on a table before loading a bunch of data and then recreate the indexes afterwards.</p>

<p>Disclaimer: this answer is MySQL oriented and might not work for other databases.</p> <p>I think in the accepted answer there are some things missing, I have tried to expose here a generic sequence I use to do this kind of operations in a production environment, not only for adding/removing columns but also to add indexes for example.</p> <p>We call it the Indiana Jones' movement.</p> <h3>Create a new table</h3> <p>A new table using the old one as template:</p> <pre class="prettyprint"><code>create table my_table_new like my_table; </code></pre> <h3>Remove the column in the new table</h3> <p>In the new table:</p> <pre class="prettyprint"><code>alter table my_table_new drop column column_to_delete; </code></pre> <h3>Add the foreign keys to the new table</h3> <p>The are not generate automatically in the <code>create table like</code> command.</p> <p>You can check the actual foreign keys:</p> <pre class="prettyprint"><code>mysql> show create table my_table; </code></pre> <p>Then apply them to the new table:</p> <pre class="prettyprint"><code>alter table my_table_new add constraint my_table_fk_1 foreign key (field_1) references other_table_1 (id), add constraint my_table_fk_2 foreign key (field_2) references other_table_2 (id) </code></pre> <h3>Clone the table</h3> <p>Copy all fields but the one you want to delete.</p> <p>I use a <code>where</code> sentence to be able to run this command many times if necessary.</p> <p>As I suppose this is a production environment the <code>my_table</code> will have new records continuously so we have to keep synchronizing until we are capable to do the name changing.</p> <p>Also I have added a <code>limit</code> because if the table is too big and the indexes are too heavy making a one-shot clone can shut down the performance of your database. Plus, if in the middle of the process you want to cancel the operation it will must to rollback all the already done insertions which means your database won't be recovered instantly (https://dba.stackexchange.com/questions/5654/internal-reason-for-killing-process-taking-up-long-time-in-mysql)</p> <pre class="prettyprint"><code>insert my_table_new select field_1, field_2, field_3 from my_table where id > ifnull((select max(id) from my_table_new), 0) limit 100000; </code></pre> <p>As I was doing this several times I created a procedure: https://gist.github.com/fguillen/5abe87f922912709cd8b8a8a44553fe7</p> <h3>Do the name changing</h3> <p>Be sure you run this commands inmediately after you have replicate the last records from your table. Idealy run all commands at once.</p> <pre class="prettyprint"><code>rename table my_table to my_table_3; rename table my_table_new to my_table; </code></pre> <h3>Delete the old table</h3> <p>Be sure you have a back up before you do this ;)</p> <pre class="prettyprint"><code>drop table my_table_3 </code></pre> <p>Disclaimer: I am not sure what will happen with foreign keys that were pointing to the old table.</p>

Drop Column from Large Table

Tags:

sql

mysql

I have this largish table with three columns as such:

Click to copy

+-----+-----+----------+ | id1 | id2 | associd  | +-----+-----+----------+ |   1 |  38 | 73157604 | |   1 | 112 | 73157605 | |   1 | 113 | 73157606 | |   1 | 198 | 31936810 | |   1 | 391 | 73157607 | +-----+-----+----------+

This continues for 38m rows. The problem is I want to remove the 'associd' column but running ALTER TABLE table_name DROP COLUMN associd; simply takes too long. I wanted to do something like: ALTER TABLE table_name SET UNUSED associd; and ALTER TABLE table_name DROP UNUSED COLUMNS CHECKPOINT 250; then which apparently speeds up the process but it isn't possible in MySQL?

Is there an alternative to remove this column-- maybe creating a new table with only the two columns or getting a drop with checkpoints?

542

asked Apr 19 '14 18:04

nico

2 Answers

Anything that you do is going to require reading and writing 38m rows, so nothing is going to be real fast. Probably the fastest method is probably to put the data into a new table:

Click to copy

create table newTable as     select id1, id2     from oldTable;

Or, if you want to be sure that you preserve types and indexes:

Click to copy

create table newTable like oldTable;  alter table newTable drop column assocId;  insert into newTable(id1, id2)     select id1, id2     from oldTable;

However, it is usually faster to drop all index on a table before loading a bunch of data and then recreate the indexes afterwards.

answered Sep 24 '22 14:09

Gordon Linoff

Disclaimer: this answer is MySQL oriented and might not work for other databases.

I think in the accepted answer there are some things missing, I have tried to expose here a generic sequence I use to do this kind of operations in a production environment, not only for adding/removing columns but also to add indexes for example.

We call it the Indiana Jones' movement.

Create a new table

A new table using the old one as template:

Click to copy

create table my_table_new like my_table;

Remove the column in the new table

In the new table:

Click to copy

alter table my_table_new drop column column_to_delete;

Add the foreign keys to the new table

The are not generate automatically in the create table like command.

You can check the actual foreign keys:

Click to copy

mysql> show create table my_table;

Then apply them to the new table:

Click to copy

alter table my_table_new   add constraint my_table_fk_1 foreign key (field_1) references other_table_1 (id),   add constraint my_table_fk_2 foreign key (field_2) references other_table_2 (id)

Clone the table

Copy all fields but the one you want to delete.

I use a where sentence to be able to run this command many times if necessary.

As I suppose this is a production environment the my_table will have new records continuously so we have to keep synchronizing until we are capable to do the name changing.

Also I have added a limit because if the table is too big and the indexes are too heavy making a one-shot clone can shut down the performance of your database. Plus, if in the middle of the process you want to cancel the operation it will must to rollback all the already done insertions which means your database won't be recovered instantly (https://dba.stackexchange.com/questions/5654/internal-reason-for-killing-process-taking-up-long-time-in-mysql)

Click to copy

insert my_table_new select field_1, field_2, field_3 from my_table  where id > ifnull((select max(id) from my_table_new), 0) limit 100000;

As I was doing this several times I created a procedure: https://gist.github.com/fguillen/5abe87f922912709cd8b8a8a44553fe7

Do the name changing

Be sure you run this commands inmediately after you have replicate the last records from your table. Idealy run all commands at once.

Click to copy

rename table my_table to my_table_3; rename table my_table_new to my_table;

Delete the old table

Be sure you have a back up before you do this ;)

Click to copy

drop table my_table_3

Disclaimer: I am not sure what will happen with foreign keys that were pointing to the old table.

answered Sep 23 '22 14:09

fguillen

Related questions
                            
                                Composite key as foreign key (sql)
                            
                                MySQL "IN" queries terribly slow with subquery but fast with explicit values
                            
                                Getting the count of rows in a Java resultset
                            
                                Fetching UTF-8 text from MySQL in R returns "????"
                            
                                MySQL, Error 126: Incorrect key file for table
                            
                                Where can I find the MySQL log file in XAMPP
                            
                                Can you index tables differently on Master and Slave (MySQL)
                            
                                mySQL DataSource on Visual Studio 2012
                            
                                mysql fake select
                            
                                MySQL save results of EXECUTE in a variable?
                            
                                What is the difference between mysqli_connect and mysql_connect?
                            
                                Separate different version of a website
                            
                                Get ids array from related laravel model which is having belongsToMany relationship
                            
                                Best practices for efficiently storing md5 hashes in mysql
                            
                                In MySQL: How to pass a table name as stored procedure and/or function argument?
                            
                                JDBC vs Web Service for Android
                            
                                How can I use executemany to insert into MySQL a list of dictionaries in Python
                            
                                NULL vs DEFAULT NULL vs NULL DEFAULT NULL in MYSQL column creation?
                            
                                Executing SQL scripts on docker container
                            
                                Why would an IN condition be slower than "=" in sql?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Drop Column from Large Table

Tags:

sql

mysql

nico

People also ask

2 Answers

Gordon Linoff

Create a new table

Remove the column in the new table

Add the foreign keys to the new table

Clone the table

Do the name changing

Delete the old table

fguillen

Recent Activity

Donate For Us