Quite a lot of database scripts are of the form: <pre class="prettyprint"><code>IF NOT EXISTS(SELECT * FROM Countries WHERE Name = 'France') INSERT INTO(Countries) </code></pre> However, I've also seen people do: <pre class="prettyprint"><code>IF NOT EXISTS(SELECT CountryID FROM Countries WHERE Name = 'France') INSERT INTO(Countries) </code></pre> And even: <pre class="prettyprint"><code>IF NOT EXISTS(SELECT 1 FROM Countries WHERE Name = 'France') INSERT INTO(Countries) </code></pre> The advantage of the last one is supposedly that its more efficient: the query doesn't actually use any of the columns in the subquery, so it might be quicker to not bring any of them back. But it looks odd, so it strikes me that it might confuse some people. And does it make any difference anyway to the actual execution time?

I think it was back in the 6.5 - 7 period of SQL Server that they made the query optimizer smart enough to know that: <pre class="prettyprint"><code>IF NOT EXISTS(SELECT * FROM Countries WHERE Name = 'France') </code></pre> Does not actually need to return any row data. The advice to use <code>SELECT 1</code> pre-dates that, yet continues on as a myth. Arguably, it's a fault with the SQL standard - they ought to allow <code>EXISTS</code> to start with the <code>FROM</code> clause and not have a <code>SELECT</code> portion at all. <hr> And from Subqueries with EXISTS: <blockquote> The select list of a subquery introduced by EXISTS almost always consists of an asterisk (*). There is no reason to list column names because you are just testing whether rows that meet the conditions specified in the subquery exist. </blockquote>

When using "IF NOT EXISTS(SELECT..." in Sql Server, does it matter which columns you choose?

Tags:

sql

sql-server

Quite a lot of database scripts are of the form:

Click to copy

IF NOT EXISTS(SELECT * FROM Countries WHERE Name = 'France')
INSERT INTO(Countries)

However, I've also seen people do:

Click to copy

IF NOT EXISTS(SELECT CountryID FROM Countries WHERE Name = 'France')
INSERT INTO(Countries)

And even:

Click to copy

IF NOT EXISTS(SELECT 1 FROM Countries WHERE Name = 'France')
INSERT INTO(Countries)

The advantage of the last one is supposedly that its more efficient: the query doesn't actually use any of the columns in the subquery, so it might be quicker to not bring any of them back. But it looks odd, so it strikes me that it might confuse some people. And does it make any difference anyway to the actual execution time?

886

asked Oct 23 '13 13:10

Paul Richards

3 Answers

I think it was back in the 6.5 - 7 period of SQL Server that they made the query optimizer smart enough to know that:

Click to copy

IF NOT EXISTS(SELECT * FROM Countries WHERE Name = 'France')

Does not actually need to return any row data. The advice to use SELECT 1 pre-dates that, yet continues on as a myth.

Arguably, it's a fault with the SQL standard - they ought to allow EXISTS to start with the FROM clause and not have a SELECT portion at all.

And from Subqueries with EXISTS:

The select list of a subquery introduced by EXISTS almost always consists of an asterisk (*). There is no reason to list column names because you are just testing whether rows that meet the conditions specified in the subquery exist.

165

answered Oct 11 '22 15:10

Related questions
                            
                                Can you define a new column in a SQL Server table which auto-generates Unique Identifiers for new rows?
                            
                                SELECT MAX of COUNT
                            
                                Return value at max date for a particular id
                            
                                Alter table then update
                            
                                Group OHLC-Stockmarket Data into multiple timeframes - Mysql
                            
                                I need best practice in T-SQL Export data to CSV (with header)
                            
                                How to save the result of a SQL query in a variable in C#.net?
                            
                                How to select row with the latest timestamp from duplicated rows in a database table?
                            
                                Select rows with no match between two tables by sem
                            
                                ORA-06511: PL/SQL Cursor already open
                            
                                tSQLt.FakeTable doesnt seem to work with views that have constants/derived fields
                            
                                SQL Server equivalent to Oracle CONNECT BY and LEVEL pseudocolumn [duplicate]
                            
                                Combine two columns in one column
                            
                                Update mysql table with select query from another database
                            
                                mysql insert information from multiple rows from another table
                            
                                Three way join in sql
                            
                                SQL syntax: select only if more than X results
                            
                                The used table type doesn't support SPATIAL indexes
                            
                                SSRS shows no records in report but query returns results
                            
                                Calculating the respective z-score of several columns

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

When using "IF NOT EXISTS(SELECT..." in Sql Server, does it matter which columns you choose?

Tags:

sql

sql-server

Paul Richards

People also ask

3 Answers

Damien_The_Unbeliever

RAS

Filipe Silva

Recent Activity

Donate For Us