I'm working with Teradata conversion to Hive (version 0.10.0). Teradata Query : <pre class="prettyprint"><code>QUALIFY ROW_NUMBER() OVER (PARTITION BY ADJSTMNT,SRC_CMN , TYPE_CMD,IOD_TYPE_CD,ROE_PST ,ORDR_SYC,SOR_CD,PROS_ED ORDER BY ADJSTMNT )=1 </code></pre> I did my search and found UDF for Row_Sequence in hive. I also replaced Over Partition with Distribute All and sort By. But I am stuck with QUALIFY. Any ideas to convert the above to hive are really appreciated and will help us a lot.

a QUALIFY with analytics function (ROW_NUMBER(), SUM(), COUNT(), ... over (partition by ...)) is just a WHERE on a subquery containing the analytics value. eg: <pre class="prettyprint"><code>select A,B,C from X QUALIFY ROW_NUMBER() over (...) = 1 </code></pre> is equivalent to : <pre class="prettyprint"><code>select A,B,C from ( select A,B,C, ROW_NUMBER() over (...) as RNUM from X ) t where RNUM = 1 </code></pre> NB: analytics function are available in Hive 0.12

Using QUALIFY Row_Number in hive

Tags:

sql

window-functions

hive

I'm working with Teradata conversion to Hive (version 0.10.0).

Teradata Query :

QUALIFY ROW_NUMBER() OVER (PARTITION BY ADJSTMNT,SRC_CMN , TYPE_CMD,IOD_TYPE_CD,ROE_PST ,ORDR_SYC,SOR_CD,PROS_ED ORDER BY ADJSTMNT )=1

I did my search and found UDF for Row_Sequence in hive. I also replaced Over Partition with Distribute All and sort By. But I am stuck with QUALIFY.

Any ideas to convert the above to hive are really appreciated and will help us a lot.

773

asked Jul 09 '13 04:07

dyuti

1 Answers

a QUALIFY with analytics function (ROW_NUMBER(), SUM(), COUNT(), ... over (partition by ...)) is just a WHERE on a subquery containing the analytics value.

eg:

select A,B,C
from X 
QUALIFY  ROW_NUMBER() over (...) = 1

is equivalent to :

select A,B,C
from (
   select A,B,C, ROW_NUMBER() over (...) as RNUM
   from X
) t
where RNUM = 1

NB: analytics function are available in Hive 0.12

answered Sep 29 '22 15:09

R. Chevallier

Related questions
                            
                                Use result of query in a function (postgres 8.3)
                            
                                Collapse multiple rows having contiguous timestamps
                            
                                Is a REPLACE INTO query good practice?
                            
                                Do the results of a SQL query explain depend on the size of the database?
                            
                                SQL Server Query to group sequential dates
                            
                                Update column based on matching values in other table in mysql
                            
                                Select Query for the fixed length
                            
                                Determine if an sql query modifies the database
                            
                                JPA - updating an embedded entity generates invalid SQL
                            
                                Not so simple SQL queries
                            
                                MySQL - Referencing aggregate column in where clause
                            
                                Improve PostgreSQL query performance
                            
                                Updating rows in Liquibase with a complex WHERE statement
                            
                                Ruby NoMethodError - undefined method `blah_url' for BlahController
                            
                                convert int to varchar working for SQL Server and MS-Access
                            
                                Nested if else statement
                            
                                How to structure a database with multiple join tables
                            
                                Bulk insert into partitioned table and table level lock
                            
                                How will this affect the data using Loops with rollback transaction
                            
                                Entity Framework with optional parameters?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With