I have a problem where I need to find records that either have a measurement that matches a value, or do not have that measurement at all. I solved that problem with three or four different approaches, using JOIN
s, using NOT IN
and using NOT EXISTS
. However, the query ended up being extremely slow every time. I then tried splitting the query in two, and they both run very fast (three seconds). But combining the queries using OR
takes more than five minutes.
Reading on SO I tried UNION
, which is very fast, but very inconvenient for the script I am using.
So two questions:
UNION
so much faster? (Or why is OR
so slow)?MSSQL
to use a different approach for the OR
statement that is fast?The reason is that using OR
in a query will often cause the Query Optimizer to abandon use of index seeks and revert to scans. If you look at the execution plans for your two queries, you'll most likely see scans where you are using the OR
and seeks where you are using the UNION
. Without seeing your query it's not really possible to give you any ideas on how you might be able to restructure the OR
condition. But you may find that inserting the rows into a temporary table and joining on to it may yield a positive result.
Also, it is generally best to use UNION ALL
rather than UNION
if you want all results, as you remove the cost of row-matching.
There is currently no way in SQL Server to force a UNION
execution plan if no UNION
statement was used. If the only difference between the two parts is the WHERE
clause, create a view with the complex query. The UNION
query then becomes very simple:
SELECT * FROM dbo.MyView WHERE <cond1>
UNION ALL
SELECT * FROM dbo.MyView WHERE <cond2>
It is important to use UNION ALL
in this context when ever possible. If you just use UNION
SQL Server has to filter out duplicate rows, which requires an expensive sort operation in most cases.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With