How do I efficiently select the previous non-null value?

Tags:

3 Answers

I found this answer for SQL Server that also works in Postgres. Having never done it before, I thought the technique was quite clever. Basically, he creates a custom partition for the windowing function by using a case statement inside of a nested query that increments a sum when the value is not null and leaves it alone otherwise. This allows one to delineate every null section with the same number as the previous non-null value. Here's the query:

SELECT
  id, value, value_partition, first_value(value) over (partition by value_partition order by id)
FROM (
  SELECT
    id,
    value,
    sum(case when value is null then 0 else 1 end) over (order by id) as value_partition

  FROM p
  ORDER BY id ASC
) as q

And the results:

 id | value | value_partition | first_value
----+-------+-----------------+-------------
  1 |   100 |               1 |         100
  2 |       |               1 |         100
  3 |       |               1 |         100
  4 |       |               1 |         100
  5 |       |               1 |         100
  6 |       |               1 |         100
  7 |       |               1 |         100
  8 |   200 |               2 |         200
  9 |       |               2 |         200
(9 rows)

156

answered Oct 11 '22 10:10

adamlamar

You can create a custom aggregate function in Postgres. Here's an example for the int type:

CREATE FUNCTION coalesce_agg_sfunc(state int, value int) RETURNS int AS
$$
    SELECT coalesce(value, state);
$$ LANGUAGE SQL;

CREATE AGGREGATE coalesce_agg(int) (
    SFUNC = coalesce_agg_sfunc,
    STYPE  = int);

Then query as usual.

SELECT *, coalesce_agg(b) over w, sum(b) over w FROM y
  WINDOW w AS (ORDER BY a);

a b coalesce_agg sum 
- - ------------ ---
a 0            0   0
b ∅            0   0
c 2            2   2
d 3            3   5
e ∅            3   5
f 5            5  10
(6 rows)

answered Oct 11 '22 11:10

Slobodan Pejic

Well, I can't guarantee this is the most efficient way, but works:

SELECT id, value, (
    SELECT p2.value
    FROM p p2
    WHERE p2.value IS NOT NULL AND p2.id <= p1.id
    ORDER BY p2.id DESC
    LIMIT 1
) AS new_value
FROM p p1 ORDER BY id;

The following index can improve the sub-query for large datasets:

CREATE INDEX idx_p_idvalue_nonnull ON p (id, value) WHERE value IS NOT NULL;

Assuming the value is sparse (e.g. there are a lot of nulls) it will run fine.

answered Oct 11 '22 11:10

MatheusOl

Related questions
                            
                                How can I hash passwords in postgresql?
                            
                                How to group by week in postgresql
                            
                                PostgreSQL syntax check without running the query
                            
                                Insert line break in postgresql when updating text field
                            
                                postgresql migrating JSON to JSONB [duplicate]
                            
                                alembic util command error can't find identifier
                            
                                SQL Populate table with random data
                            
                                PostgreSQL database service
                            
                                Date in mmm yyyy format in postgresql
                            
                                Installing PostgreSQL Client v10 on AWS Amazon Linux (EC2) AMI
                            
                                PostgreSQL server wouldn't shutdown on Lion (Mac OS 10.7)
                            
                                PostgreSQL Index on JSON
                            
                                Join postgres table on two columns?
                            
                                Postgres password authentication fails
                            
                                How to map a PostgreSQL array with Hibernate
                            
                                Postgresql change column type from int to UUID
                            
                                PostgreSQL : cast string to date DD/MM/YYYY
                            
                                Postgres error in batch insert : relation "hibernate_sequence" does not exist position 17
                            
                                Inserting text string with hex into PostgreSQL as a bytea
                            
                                How do I get the primary key(s) of a table from Postgres via plpgsql?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How do I efficiently select the previous non-null value?

Tags:

postgresql

adamlamar

People also ask

3 Answers

adamlamar

Slobodan Pejic

MatheusOl

Recent Activity

Donate For Us