SQL Server - why is scanning done twice for the same table?

Tags:

Does anyone know why sql server chooses to query the table 'building' twice? Is there any explanation? Can it be done with only one table seek?

This is the code sample:

DECLARE @id1stBuild INT = 1
    ,@number1stBuild INT = 2
    ,@idLastBuild INT = 5
    ,@numberLastBuild INT = 1;
DECLARE @nr TABLE (nr INT);

INSERT @nr
VALUES (1),(2),(3),(4),(5),(6),(7),(8),(9),(10);

CREATE TABLE building (
    id INT PRIMARY KEY identity(1, 1)
    ,number INT NOT NULL
    ,idStreet INT NOT NULL
    ,surface INT NOT NULL
    )

INSERT INTO building (number,idStreet,surface)
SELECT bl.b
    ,n.nr
    ,abs(convert(BIGINT, convert(VARBINARY, NEWID()))) % 500
FROM (
    SELECT ROW_NUMBER() OVER (ORDER BY n1.nr) b
    FROM @nr n1
    CROSS JOIN @nr n2
    CROSS JOIN @nr n3
    ) bl
CROSS JOIN @nr n

--***** execution plan for the select below
SELECT *
FROM building b
WHERE b.id = @id1stBuild
    AND b.number = @number1stBuild
    OR b.id = @idLastBuild
    AND b.number = @numberLastBuild

DROP TABLE building

The execution plan for this is always the same: Two Clustered Index Seek unified through Merge Join (Concatenation). The rest is less important. Here is the execution plan:

enter image description here

708

asked Jan 19 '15 11:01

Emarian

Video Answer

3 Answers

It's not scanning twice. It is seeking twice.

Your query is semantically the same as the below.

SELECT *
FROM   building b
WHERE  b.id = @id1stBuild
       AND b.number = @number1stBuild
UNION
SELECT *
FROM   building b
WHERE  b.id = @idLastBuild
       AND b.number = @numberLastBuild

And the execution plan performs two seeks and unions the result.

184

answered Oct 13 '22 01:10

Martin Smith

why is scanning done twice for the same table?

Is not a scan, is a seek, and that makes all the difference.

Implementing OR as a UNION, and then implementing the UNION via a MERGE JOIN. Is called a 'merge union':

Merge union

Now let’s change the query slightly:
select a from T where b = 1 or c = 3

  |--Stream Aggregate(GROUP BY:([T].[a]))
   |--Merge Join(Concatenation)
        |--Index Seek(OBJECT:([T].[Tb]), SEEK:([T].[b]=(1)) ORDERED FORWARD)
        |--Index Seek(OBJECT:([T].[Tc]), SEEK:([T].[c]=(3)) ORDERED FORWARD)
Instead of the concatenation and sort distinct operators, we now have a merge join (concatenation) and a stream aggregate. What happened? The merge join (concatenation) or “merge union” is not really a join at all. It is implemented by the same iterator as the merge join, but it really performs a union all while preserving the order of the input rows. Finally, we use the stream aggregate to eliminate duplicates. (See this post for more about using stream aggregate to eliminate duplicates.) This plan is generally a better choice since the sort distinct uses memory and could spill data to disk if it runs out of memory while the stream aggregate does not use memory.

answered Oct 13 '22 02:10

Remus Rusanu

You can try the following, which gives only one seek and a slight performance improvement. As @Martin_Smith says what you have coded is the equivalent of a Union

SELECT *
FROM building b
WHERE b.id IN (@id1stBuild , @idLastBuild) 
    AND 
        (
            (b.id = @id1stBuild AND b.number = @number1stBuild) OR 
            (b.id = @idLastBuild AND b.number = @numberLastBuild)
        )

answered Oct 13 '22 02:10

Steve Ford

Related questions
                            
                                One select for multiple records by composite key
                            
                                Out of memory exception in SQL Server 2012
                            
                                How to use the result of a select statement as a column of another select statement?
                            
                                Django - Filter a queryset by Max(date) year
                            
                                Entity Framework: Get all rows from the table for the ids in list [duplicate]
                            
                                MySQL - Exclude all rows from one table if match on another table
                            
                                Bigquery SQL for sliding window aggregate
                            
                                Does the order of natural joins matter
                            
                                Get count query results with ignoring the LIMIT statement
                            
                                No mapping exists from ObjectParameter to a known managed provider native type
                            
                                How to convert empty string to null in SQLite
                            
                                Error: Make sure the Cursor is initialized correctly before accessing data from it?
                            
                                H2 and MySQL compatibility issues
                            
                                Count number of NULL values in each column in SQL
                            
                                SQL full join without any conditions
                            
                                PHP PDO - Using MySQL Variables
                            
                                Combining multiple rows in postgreSQL into one row?
                            
                                Web API Returning Nested JSON Values
                            
                                Identify primary key candidates through SQL code
                            
                                Retrieving specific key-values from a query and fetch count of their pair in query

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

SQL Server - why is scanning done twice for the same table?

Tags:

sql

sql-server

sql-execution-plan