LAG functions and NULLS

Tags:

How can I tell the LAG function to get the last "not null" value?

For example, see my table bellow where I have a few NULL values on column B and C. I'd like to fill the nulls with the last non-null value. I tried to do that by using the LAG function, like so:

Click to copy

case when B is null then lag (B) over (order by idx) else B end as B,

but that doesn't quite work when I have two or more nulls in a row (see the NULL value on column C row 3 - I'd like it to be 0.50 as the original).

Any idea how can I achieve that? (it doesn't have to be using the LAG function, any other ideas are welcome)

A few assumptions:

The number of rows is dynamic;
The first value will always be non-null;
Once I have a NULL, is NULL all up to the end - so I want to fill it with the latest value.

Thanks

enter image description here

354

asked Apr 25 '16 10:04

2 Answers

You can do it with outer apply operator:

Click to copy

select t.id,
       t1.colA,
       t2.colB,
       t3.colC 
from table t
outer apply(select top 1 colA from table where id <= t.id and colA is not null order by id desc) t1
outer apply(select top 1 colB from table where id <= t.id and colB is not null order by id desc) t2
outer apply(select top 1 colC from table where id <= t.id and colC is not null order by id desc) t3;

This will work, regardless of the number of nulls or null "islands". You may have values, then nulls, then again values, again nulls. It will still work.

If, however the assumption (in your question) holds:

Once I have a NULL, is NULL all up to the end - so I want to fill it with the latest value.

there is a more efficient solution. We only need to find the latest (when ordered by idx) values. Modifying the above query, removing the where id <= t.id from the subqueries:

Click to copy

select t.id,
       colA = coalesce(t.colA, t1.colA),
       colB = coalesce(t.colB, t2.colB),
       colC = coalesce(t.colC, t3.colC) 
from table t
outer apply (select top 1 colA from table 
             where colA is not null order by id desc) t1
outer apply (select top 1 colB from table 
             where colB is not null order by id desc) t2
outer apply (select top 1 colC from table 
             where colC is not null order by id desc) t3;

answered Oct 01 '22 09:10

Giorgi Nakeuri

You could make a change to your ORDER BY, to force the NULLs to be first in your ordering, but that may be expensive...

Click to copy

lag(B) over (order by CASE WHEN B IS NULL THEN -1 ELSE idx END)

Or, use a sub-query to calculate the replacement value once. Possibly less expensive on larger sets, but very clunky.
- Relies on all the NULLs coming at the end
- The LAG doesn't rely on that

Click to copy

COALESCE(
    B,
    (
        SELECT
            sorted_not_null.B
        FROM
        (
            SELECT
                table.B,
                ROW_NUMBER() OVER (ORDER BY table.idx DESC)   AS row_id
            FROM
                table
            WHERE
                table.B IS NOT NULL
        )
           sorted_not_null
        WHERE
           sorted_not_null.row_id = 1
    )
)

(This should be faster on larger data-sets, than LAG or using OUTER APPLY with correlated sub-queries, simply because the value is calculated once. For tidiness, you could calculate and store the [last_known_value] for each column in variables, then just use COALESCE(A, @last_known_A), COALESCE(B, @last_known_B), etc)

answered Oct 01 '22 08:10

MatBailie

Related questions
                            
                                CASE expression with NULL value
                            
                                Netezza SQL convert VARCHAR to binary string
                            
                                Rails - Distinct ON after a join
                            
                                BigQuery: Computing aggregate over window of time for each person
                            
                                How to convert number to words - ORACLE
                            
                                Check alphabets in SQL Server
                            
                                MERGE - Multiple WHEN MATCHED cases with update
                            
                                Incorrect string value errors in AES_ENCRYPT/AES_DECRYPT
                            
                                SQL select where column begins with Letters
                            
                                Upsert in Postgres 9.5
                            
                                Configuring postgresql driver through Spring xml datasource
                            
                                MySQL very slow query
                            
                                If-else statement in DB2/400
                            
                                SQL iPython Magic Extension won't load
                            
                                SQL CTE vs Temp Table
                            
                                BigQuery COALESCE/IFNULL type mismatch with literals
                            
                                Inserting datetime with milliseconds into SQL Server table issue
                            
                                Select multiple rows from MySQL tables for 1 user
                            
                                Count of unique values in a rolling date range for R
                            
                                Postgres GROUP BY, then sort

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

LAG functions and NULLS

Tags:

sql

sql-server

tsql

sql-server-2016

Diego

People also ask

2 Answers

Giorgi Nakeuri

MatBailie

Recent Activity

Donate For Us