Limiting the number of rows in subqueries with Teradata

Tags:

I'm new to Teradata and I'm facing a problem I didn't have with the previous database I used. Basically, I'm trying to reduce the number of rows returned in subqueries inside a where clause. I had no problem doing this previously with the ROWNUM function.

My previous query was something like:

SELECT * FROM myTable
WHERE field1 = 'foo' AND field2 in(
    SELECT field2 FROM anotherTable
    WHERE field3 = 'bar' AND ROWNUM<100);

Since I can't use ROWNUM in TD, I've looked for equivalent functions or at least functions that would get me where I wanted even if they were'nt exactly equivalent. I found and tried : ROW_NUMBER, TOP and SAMPLE.

I tried ROW_NUMBER() but Teradata doesn't allow analytic functions in WHERE clauses. I tried TOP N but this option is not supported in a subquery. I tried SAMPLE N but it is not supported in subqueries either.

So... I have to admit I'm a bit stuck right now and was wondering if there was any solution that would allow me to limit the number of rows returned in a subquery using Teradata and that would be pretty similar to what I did up to now? Also, if there aren't any, how would it be possible to build the query differently to use it appropriately with Teradata?

Thanks!

809

asked Aug 06 '14 16:08

Charles

1 Answers

The limited usage of SAMPLE or TOP in a subquery is probably because this might be a Correlated Subquery.

But there are two workarounds.

Put SAMPLE or TOP in a Derived Table within the subquery (so this can't be correlated anymore):

SELECT * FROM myTable
WHERE field1 = 'foo'
AND field2 IN (
     SELECT * FROM
       ( SELECT field2 FROM anotherTable -- or TOP 100
         WHERE field3 = 'bar'  SAMPLE 100
       ) AS dt
    );

Or rewrite it as a join to a Derived Table:

SELECT * FROM myTable
JOIN ( SELECT DISTNCT field2 FROM anotherTable -- can't use TOP if you need DISTINCT 
         WHERE field3 = 'bar' SAMPLE 100
       ) AS dt
WHERE field1 = 'foo'
AND myTable.field2 = dt.field1;

TOP without ORDER BY is quite similar to ROWNUM. It's not random at all, but running it a 2nd time might still return a different result set.

SAMPLE is truly random, every time returning a different result.

ROW_NUMBER is also possible using QUALIFY instead of WHERE, but OLAP functions always need some ORDER BY, so is much more overhead.

162

answered Oct 29 '22 16:10

dnoeth

Related questions
                            
                                Date_trunc by month? Postgresql
                            
                                Disable DELETE for a table in SQL Server
                            
                                Right join with a where clause
                            
                                Postgres function returning one record while I have many records?
                            
                                Show gaps between dates in MySQL
                            
                                How to UPDATE a column of all duplicate records in MySQL?
                            
                                Query to get the data in related table
                            
                                Casting variables to integers in SQL queries in PHP
                            
                                PYODBC does not like %, "The SQL contains 2 parameter markers, but 1 parameters were supplied."
                            
                                sql server rewrites my query incorrectly?
                            
                                Accessing another password protected database in an SQL query in Access 97
                            
                                How to check if a table exists in jOOQ?
                            
                                How to set a point as default value for a geography column?
                            
                                how to execute LIKE query in sqlalchemy?
                            
                                Interpreting HASH JOIN in Oracle query plan
                            
                                Performance difference between NOT Exists and LEFT JOIN IN SQL Server
                            
                                SQL IF SELECT query is null then do another query
                            
                                Procedurally transform subquery into join
                            
                                WHERE_IN query with a composite key?
                            
                                distinct vs group by which is better

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Limiting the number of rows in subqueries with Teradata

Tags:

sql

subquery

limit

teradata

Charles

People also ask

1 Answers

dnoeth

Recent Activity

Donate For Us