How can I force a subquery to perform as well as a #temp table?

Tags:

I am re-iterating the question asked by Mongus Pong Why would using a temp table be faster than a nested query? which doesn't have an answer that works for me.

Most of us at some point find that when a nested query reaches a certain complexity it needs to broken into temp tables to keep it performant. It is absurd that this could ever be the most practical way forward and means these processes can no longer be made into a view. And often 3rd party BI apps will only play nicely with views so this is crucial.

I am convinced there must be a simple queryplan setting to make the engine just spool each subquery in turn, working from the inside out. No second guessing how it can make the subquery more selective (which it sometimes does very successfully) and no possibility of correlated subqueries. Just the stack of data the programmer intended to be returned by the self-contained code between the brackets.

It is common for me to find that simply changing from a subquery to a #table takes the time from 120 seconds to 5. Essentially the optimiser is making a major mistake somewhere. Sure, there may be very time consuming ways I could coax the optimiser to look at tables in the right order but even this offers no guarantees. I'm not asking for the ideal 2 second execute time here, just the speed that temp tabling offers me within the flexibility of a view.

I've never posted on here before but I have been writing SQL for years and have read the comments of other experienced people who've also just come to accept this problem and now I would just like the appropriate genius to step forward and say the special hint is X...

621

asked Sep 12 '13 13:09

Adamantish

2 Answers

There are a few possible explanations as to why you see this behavior. Some common ones are

The subquery or CTE may be being repeatedly re-evaluated.
Materialising partial results into a #temp table may force a more optimum join order for that part of the plan by removing some possible options from the equation.
Materialising partial results into a #temp table may improve the rest of the plan by correcting poor cardinality estimates.

The most reliable method is simply to use a #temp table and materialize it yourself.

Failing that regarding point 1 see Provide a hint to force intermediate materialization of CTEs or derived tables. The use of TOP(large_number) ... ORDER BY can often encourage the result to be spooled rather than repeatedly re evaluated.

Even if that works however there are no statistics on the spool.

For points 2 and 3 you would need to analyse why you weren't getting the desired plan. Possibly rewriting the query to use sargable predicates, or updating statistics might get a better plan. Failing that you could try using query hints to get the desired plan.

131

answered Sep 25 '22 00:09

Martin Smith

I do not believe there is a query hint that instructs the engine to spool each subquery in turn.

There is the OPTION (FORCE ORDER) query hint which forces the engine to perform the JOINs in the order specified, which could potentially coax it into achieving that result in some instances. This hint will sometimes result in a more efficient plan for a complex query and the engine keeps insisting on a sub-optimal plan. Of course, the optimizer should usually be trusted to determine the best plan.

Ideally there would be a query hint that would allow you to designate a CTE or subquery as "materialized" or "anonymous temp table", but there is not.

answered Sep 24 '22 00:09

Dan Bellandi

Related questions
                            
                                Problems with sending notification emails from SQL Server 2008
                            
                                How to Identify the primary key duplication from a SQL Server 2008 error code?
                            
                                Error Code: 1406. Data too long for column - MySQL
                            
                                Pandas SQL chunksize
                            
                                What is the difference between drop table and delete table in SQL Server?
                            
                                Convert bit type to Yes or No by query Sql Server 2005
                            
                                How can I use a SQL UPDATE statement to add 1 year to a DATETIME column?
                            
                                Select query to get data from SQL Server
                            
                                Backup/Restore from different database causing Restore failed exclusive access could not be obtained
                            
                                How to check for database availability
                            
                                Unable to change Identity Specification to Yes in Sql Server table
                            
                                EF: The text data type cannot be selected as DISTINCT because it is not comparable
                            
                                SQL Server: datediff function resulted in an overflow when using MILLISECOND
                            
                                The type or namespace name 'SQLConnection' could not be found
                            
                                SQL problem with error "Invalid data type"
                            
                                Using Foreign Key (FK) as Discriminator for Table-Per-Hierarchy (TPH)
                            
                                How to research unmanaged memory leaks in .NET?
                            
                                SQL Server - Does column order matter?
                            
                                Why can't I perform an aggregate function on an expression containing an aggregate but I can do so by creating a new select statement around it?
                            
                                SQL Server stored procedure Nullable parameter

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How can I force a subquery to perform as well as a #temp table?

Tags:

performance

optimization

sql-server

sql-server-2008

query-optimization

Adamantish

People also ask

2 Answers

Martin Smith

Dan Bellandi

Recent Activity

Donate For Us