I have the following SQL: <pre class="prettyprint"><code> IF EXISTS ( SELECT 1 FROM SomeTable T1 WHERE SomeField = 1 AND SomeOtherField = 1 AND NOT EXISTS(SELECT 1 FROM SomeOtherTable T2 WHERE T2.KeyField = T1.KeyField) ) RAISERROR ('Blech.', 16, 1) </code></pre> The <code>SomeTable</code> table has around 200,000 rows, and the <code>SomeOtherTable</code> table has about the same. If I execute the inner SQL (the <code>SELECT</code>), it executes in sub-second time, returning no rows. But, if I execute the entire script (<code>IF...RAISERROR</code>) then it takes well over an hour. Why? Now, obviously, the execution plan is different - I can see that in Enterprise Manager - but again, why? I could probably do something like <code>SELECT @num = COUNT(*) WHERE</code> ... and then <code>IF @num > 0 RAISERROR</code> but... I think that's missing the point somewhat. You can only code around a bug (and it sure looks like a bug to me) if you know that it exists. <hr> EDIT: I should mention that I already tried re-jigging the query into an OUTER JOIN as per @Bohemian's answer, but this made no difference to the execution time. <hr> EDIT 2: I've attached the query plan for the inner <code>SELECT</code> statement: <img src="https://i.stack.imgur.com/2wb2L.png" alt="Query Plan - inner SELECT statement"> ... and the query plan for the whole <code>IF...RAISERROR</code> block: <img src="https://i.stack.imgur.com/hB1ka.png" alt="Query Plan - whole IF statement"> Obviously these show the real table/field names, but apart from that the query is exactly as shown above.

It's probably because the optimizer can figure out how to turn your query into a more efficient query, but somehow the IF prevents that. Only an EXPLAIN will tell you why the query is taking so long, but I can tell you how to make this whole thing more efficient... Indtead of using a correlated subquery, which is incredibly inefficient - you get "n" subqueries run for "n" rows in the main table - use a JOIN. Try this: <pre class="prettyprint"><code>IF EXISTS ( SELECT 1 FROM SomeTable T1 LEFT JOIN SomeOtherTable T2 ON T2.KeyField = T1.KeyField WHERE SomeField = 1 AND SomeOtherField = 1 AND T2.KeyField IS NULL ) RAISERROR ('Blech.', 16, 1) </code></pre> The "trick" here is to use s LEFT JOIN and filter out all joined rows by testing for a null in the WHERE clause, which is executed after the join is made.

Why does IF (query) take over an hour, when query takes less than a second?

Tags:

sql

sql-server

I have the following SQL:

 IF EXISTS
 (
    SELECT
        1
    FROM
        SomeTable T1
    WHERE
        SomeField = 1
    AND SomeOtherField = 1
    AND NOT EXISTS(SELECT 1 FROM SomeOtherTable T2 WHERE T2.KeyField = T1.KeyField)
)
    RAISERROR ('Blech.', 16, 1)

The SomeTable table has around 200,000 rows, and the SomeOtherTable table has about the same.

If I execute the inner SQL (the SELECT), it executes in sub-second time, returning no rows. But, if I execute the entire script (IF...RAISERROR) then it takes well over an hour. Why?

Now, obviously, the execution plan is different - I can see that in Enterprise Manager - but again, why?

I could probably do something like SELECT @num = COUNT(*) WHERE ... and then IF @num > 0 RAISERROR but... I think that's missing the point somewhat. You can only code around a bug (and it sure looks like a bug to me) if you know that it exists.

EDIT:

I should mention that I already tried re-jigging the query into an OUTER JOIN as per @Bohemian's answer, but this made no difference to the execution time.

EDIT 2:

I've attached the query plan for the inner SELECT statement:

Query Plan - inner SELECT statement

... and the query plan for the whole IF...RAISERROR block:

Query Plan - whole IF statement

Obviously these show the real table/field names, but apart from that the query is exactly as shown above.

383

asked Mar 26 '13 10:03

Gary McGill

2 Answers

The IF does not magically turn off optimizations or damage the plan. The optimizer just noticed that EXISTS only needs one row at most (like a TOP 1). This is called a "row goal" and it normally happens when you do paging. But also with EXISTS, IN, NOT IN and such things.

My guess: if you write TOP 1 to the original query you get the same (bad) plan.

The optimizer tries to be smart here and only produce the first row using much cheaper operations. Unfortunately, it misestimates cardinality. It guesses that the query will produce lots of rows although in reality it produces none. If it estimated correctly you'd just get a more efficient plan, or it would not do the transformation at all.

I suggest the following steps:

fix the plan by reviewing indexes and statistics
if this didn't help, change the query to IF (SELECT COUNT(*) FROM ...) > 0 which will give the original plan because the optimizer does not have a row goal.

172

answered Oct 16 '22 18:10

usr

It's probably because the optimizer can figure out how to turn your query into a more efficient query, but somehow the IF prevents that. Only an EXPLAIN will tell you why the query is taking so long, but I can tell you how to make this whole thing more efficient... Indtead of using a correlated subquery, which is incredibly inefficient - you get "n" subqueries run for "n" rows in the main table - use a JOIN.

Try this:

IF EXISTS (
  SELECT 1
  FROM SomeTable T1
  LEFT JOIN SomeOtherTable T2 ON T2.KeyField = T1.KeyField
  WHERE SomeField = 1
  AND SomeOtherField = 1
  AND T2.KeyField IS NULL
) RAISERROR ('Blech.', 16, 1)

The "trick" here is to use s LEFT JOIN and filter out all joined rows by testing for a null in the WHERE clause, which is executed after the join is made.

answered Oct 16 '22 17:10

Bohemian

Related questions
                            
                                Get error "mismatched input 'as' expecting FROM near ')' in from clause" when run sql query Hadoop Java
                            
                                Difference between driver and provider
                            
                                Is it possible to ignore NULL values when using LAG() and LEAD() functions in SQL Server?
                            
                                How to pass a set of rows from one function into another?
                            
                                Improving on GROUP BY in SQL
                            
                                Modelling algebraic data types using relational database
                            
                                SQL - How to find a value in a tree level data structure
                            
                                Unable to use Common Table Expressions in Postgres Crosstab Query
                            
                                Get encrypted column name with their encryption key and certificate in sql server
                            
                                How to make Oracle compare JSON as JSON, not as Strings
                            
                                More than 24 hours in a day in postgreSQL
                            
                                Use an Oracle clob in a predicate created from a String > 4k
                            
                                How to select maximum 3 items per users in MySQL?
                            
                                Java: Prepare a statement without a connection
                            
                                Error 18456. State 6 "Attempting to use an NT account name with SQL Server Authentication." [closed]
                            
                                Why does a query slow down drastically if in the WHERE clause a constant is replaced by a parameter (having the same value)?
                            
                                Match a hash created in C# with sql
                            
                                Is there a good way to extend the Code-First Migrations
                            
                                Create SQL table with correct column types from CSV
                            
                                SQL Parsing library for Python [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With