If I stop a long running query, does it rollback?

Tags:

A query that is used to loop through 17 millions records to remove duplicates has been running now for about 16 hours and I wanted to know if the query is stopped right now if it will finalize the delete statements or if it has been deleting while running this query? Indeed, if I do stop it, does it finalize the deletes or rolls back?

I have found that when I do a

Click to copy

 select count(*) from myTable

That the rows that it returns (while doing this query) is about 5 less than what the starting row count was. Obviously the server resources are extremely poor, so does that mean that this process has taken 16 hours to find 5 duplicates (when there are actually thousands), and this could be running for days?

This query took 6 seconds on 2000 rows of test data, and it works great on that set of data, so I figured it would take 15 hours for the complete set.

Any ideas?

Below is the query:

Click to copy

--Declare the looping variable DECLARE @LoopVar char(10)       DECLARE      --Set private variables that will be used throughout       @long DECIMAL,       @lat DECIMAL,       @phoneNumber char(10),       @businessname varchar(64),       @winner char(10)      SET @LoopVar = (SELECT MIN(RecordID) FROM MyTable)      WHILE @LoopVar is not null     BEGIN        --initialize the private variables (essentially this is a .ctor)       SELECT          @long = null,         @lat = null,         @businessname = null,         @phoneNumber = null,         @winner = null        -- load data from the row declared when setting @LoopVar         SELECT         @long = longitude,         @lat = latitude,         @businessname = BusinessName,         @phoneNumber = Phone       FROM MyTable       WHERE RecordID = @LoopVar        --find the winning row with that data. The winning row means        SELECT top 1 @Winner = RecordID       FROM MyTable       WHERE @long = longitude         AND @lat = latitude         AND @businessname = BusinessName         AND @phoneNumber = Phone       ORDER BY         CASE WHEN webAddress is not null THEN 1 ELSE 2 END,         CASE WHEN caption1 is not null THEN 1 ELSE 2 END,         CASE WHEN caption2 is not null THEN 1 ELSE 2 END,         RecordID        --delete any losers.       DELETE FROM MyTable       WHERE @long = longitude         AND @lat = latitude         AND @businessname = BusinessName         AND @phoneNumber = Phone         AND @winner != RecordID        -- prep the next loop value to go ahead and perform the next duplicate query.       SET @LoopVar = (SELECT MIN(RecordID)      FROM MyTable     WHERE @LoopVar < RecordID)     END

340

asked Oct 02 '08 12:10

RyanKeeter

2 Answers

no, sql server will not roll back the deletes it has already performed if you stop query execution. oracle requires an explicit committal of action queries or the data gets rolled back, but not mssql.

with sql server it will not roll back unless you are specifically running in the context of a transaction and you rollback that transaction, or the connection closes without the transaction having been committed. but i don't see a transaction context in your above query.

you could also try re-structuring your query to make the deletes a little more efficient, but essentially if the specs of your box are not up to snuff then you might be stuck waiting it out.

going forward, you should create a unique index on the table to keep yourself from having to go through this again.

198

answered Oct 07 '22 18:10

user10635

Your query is not wrapped in a transaction, so it won't rollback the changes already made by the individual delete statements.

I specifically tested this myself on my own SQL Server using the following query, and the ApplicationLog table was empty even though I cancelled the query:

Click to copy

declare @count int select @count = 5 WHILE @count > 0 BEGIN   print @count   delete from applicationlog;   waitfor time '20:00';   select @count = @count -1 END

However your query is likely to take many days or weeks, much longer then 15 hours. Your estimate that you can process 2000 records every 6 seconds is wrong because each iteration in your while loop will take significantly longer with 17 million rows then it does with 2000 rows. So unless your query takes significantly less then a second for 2000 rows, it will take days for all 17 million.

You should ask a new question on how you can delete duplicate rows efficiently.

answered Oct 07 '22 17:10

jwanagel

Related questions
                            
                                How do I escape a single quote in Presto?
                            
                                SQLite: Should LIKE 'searchstr%' use an index?
                            
                                Windowed functions can only appear in the SELECT or ORDER BY clauses
                            
                                Pipes and filters at DBMS-level: Splitting the MERGE output stream
                            
                                Impact of defining VARCHAR2 column with greater length
                            
                                How to pivot rows into columns (custom pivoting)
                            
                                Parse comma-separated string to make IN List of strings in the Where clause
                            
                                Combine varchar column with int column
                            
                                Is it possible to use a Case statement in a sql From clause
                            
                                Is there an alternative to TOP in MySQL?
                            
                                Sorting string column containing numbers in SQL?
                            
                                Running SQL user defined function that returns boolean, in where clause
                            
                                how to find "String or binary data would be truncated" error on sql in a big query
                            
                                SQL: how to limit a join on the first found row?
                            
                                In an Oracle database, what is the difference between ROWNUM and ROW_NUMBER?
                            
                                What's the best way to store the bible in SQL?
                            
                                How to return second newest record in SQL?
                            
                                Android - sqlite in clause using values from array
                            
                                Rails `where` for time less than queries
                            
                                How to drop a SQL Server user with db owner privilege

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

If I stop a long running query, does it rollback?

Tags:

sql

sql-server

duplicate-data

RyanKeeter

People also ask

2 Answers

user10635

jwanagel

Recent Activity

Donate For Us