If I have a really big table will this query load the whole table in memory before it filters the resets: <pre class="prettyprint"><code>with parent as ( select * from a101 ) select * from parent where value1 = 159 </code></pre> As you can see the parent query reference the whole table. Will this loaded in memory. This is a very simplified version of the query. The real query has a few joins to other tables. I am evaluating sql server 2012 and postgrsql.

I just made <code>EXPLAIN</code> for this query in PostgreSQL. Surprisingly it does sequence scan instead of index lookup: <pre class="prettyprint"><code> CTE Scan on parent (cost=123.30..132.97 rows=2 width=1711) Filter: (value1 = 159) CTE parent -> Seq Scan on a101 (cost=0.00..123.30 rows=430 width=2060) </code></pre> I have a primary key index on <code>value1</code> and it is used for simple <code>select * from a101 where value1 = 159</code> query. So, the answer is it will scan the whole table. I am surprised, I thought it will work as a view or subquery, but it does not. You can use this to use index: <pre class="prettyprint"><code>select * from (select * from a101) parent where value1 = 159` </code></pre>

Will this query load the whole table in memory

Tags:

sql

sql-server

postgresql

If I have a really big table will this query load the whole table in memory before it filters the resets:

with parent as
(
    select * from a101
)
select * from parent 
where value1 = 159

As you can see the parent query reference the whole table. Will this loaded in memory. This is a very simplified version of the query. The real query has a few joins to other tables. I am evaluating sql server 2012 and postgrsql.

574

asked Mar 25 '14 05:03

Luke101

2 Answers

In PostgreSQL (true as of 9.4, at least) CTEs act as optimisation fences.

The query optimiser will not flatten CTE terms into the outer query, push down qualifiers, or pull up qualifiers, even in trivial cases. So an unqualified SELECT inside a CTE term will always do a full table scan (or an index-only scan if there's a suitable index).

Thus, in PostgreSQL, these two things are very different indeed, as a simple EXPLAIN would show:

with parent as
(
    select * from a101
)
select * from parent 
where value1 = 159

and

SELECT *
FROM 
(
   SELECT * FROM a101
) AS parent
WHERE value1 = 159;

However, that "will scan the whole table" doesn't necessarily mean "will load the whole table in memory". PostgreSQL will use a TupleStore, which will transparently spill to a tempfile on disk as it gets larger.

The original justification was that DML in CTE terms was planned (and later implemented). If there's DML in a CTE term it's vital that its execution be predictable and complete. This may also be true if the CTE calls data-modifying functions.

Unfortunately, nobody seems to have thought "... but what if it's just a SELECT and we want to inline it?".

Many in the community appear to see this as a feature and regularly promulgate it as a workaround for optimiser issues. I find this attitude utterly perplexing. As a result, it's going to be really hard to fix this later, because people are intentionally using CTEs when they want to prevent the optimiser from altering a query.

In other words, PostgreSQL abuses CTEs as pseudo-query-hints (along with the OFFSET 0 hack), because project policy says real query hints aren't desired or supported.

AFAIK MS SQL Server may optimise CTE barriers, but may also choose to materialise a result set.

185

answered Sep 16 '22 23:09

Craig Ringer

I just made EXPLAIN for this query in PostgreSQL. Surprisingly it does sequence scan instead of index lookup:

 CTE Scan on parent  (cost=123.30..132.97 rows=2 width=1711)
   Filter: (value1 = 159)
   CTE parent
     ->  Seq Scan on a101  (cost=0.00..123.30 rows=430 width=2060)

I have a primary key index on value1 and it is used for simple select * from a101 where value1 = 159 query.

So, the answer is it will scan the whole table. I am surprised, I thought it will work as a view or subquery, but it does not. You can use this to use index:

select * from (select * from a101) parent 
where value1 = 159`

answered Sep 19 '22 23:09

Suor

Related questions
                            
                                Split a row on 2 or more rows depending on a column
                            
                                Using Alias In When Portion of a Case Statement in Oracle SQL
                            
                                What does second check constraint mean?
                            
                                Add column to existing SQL Server table - Implications
                            
                                sql query to list all the items in a group in one record
                            
                                Select columns from one table based on the column names from another table
                            
                                Get the per-row number of keys of hstore data in postgresql
                            
                                Display two column values as two row in SQL
                            
                                How can inefficient SQL queries be prevented from slowing a database server
                            
                                MySQL second auto increment field based on foreign key
                            
                                Case statement in Join Clause
                            
                                #1054 - Unknown column in 'on clause'
                            
                                MySQL Case Statement - Unknown column in where
                            
                                How to remove carriage returns and line feeds from a column?
                            
                                Toad: Table Autocomplete Functionality Not Working
                            
                                ora-01086 : save point was not established or invalid
                            
                                Intentionally Cause ORA-00600 exception in Oracle
                            
                                Money, Decimal or Numeric for Currency Columns
                            
                                Show JPA's SQL-Statements when using Play Framework 2
                            
                                Loop in stored procedure in SQL server

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With