How to write SQL query that selects distinct pair values for specific criteria?

Tags:

I'm having trouble formulating a query for the following problem:

For pair values that have a certain score, how do you group them in way that will only return distinct pair values with the best respective scores?

For example, lets say I have a table with the following row values:

(t1,p1,65)
(t1,p2,60)
(t1,p3,20)
(t2,p1,60)
(t2,p2,59)
(t2,p3,15)

The first two columns indicate the pair values and the third column represents the pair score.The best score is (t1,p1,65). Since t1 and p1 are now used, I want to exclude them from further analysis.

The next best score is (t2,p2,59). Even though (t1,p2) has a score of 60, I want to exclude it because "t1" has already been used. (t2,p1) also has a score of 60, but since p1 is also already used, this pair is excluded.

This results in the distinct pair score values of:

(t1,p1,65)
(t2,p2,59)

Is there any way to generate this result with just a query? I've tried to think of ways of grouping and partitioning the results, but since there has to be some accounting of values already used according to score rank, I'm finding this very difficult to approach.

EDIT:

To generate the data:

with t(t, p, score) as (
    (values ('t1','p1',65),
           ('t1','p2',60),
           ('t1','p3',20),
           ('t2','p1',60),
           ('t2','p2',59),
           ('t2','p3',15)
     ))
select t.* from t;

633

asked Nov 01 '16 17:11

Stephen Tableau

1 Answers

This problem has obviously been bothering me. The following appears to implement your logic, keeping arrays of visited values in rows:

with recursive t(t, p, score) as (
    (values ('t1','p1',65),
           ('t1','p2',60),
           ('t1','p3',20),
           ('t2','p1',60),
           ('t2','p2',59),
           ('t2','p3',15)
     )),
     cte(t, p, score, cnt, lastt, lastp, ts, ps) as (
        (select t.*, count(*) over ()::int, tt.t, tt.p, ARRAY[tt.t], ARRAY[tt.p]
         from t cross join
              (select t.* from t order by score desc limit 1) tt
        ) 
        union all
        select t, p, score, 
               sum(case when not (ts @> ARRAY[t] or ps @> ARRAY[p]) then 1 else 0 end) over ()::int,
               first_value(t) over (order by case when not (ts @> ARRAY[t] or ps @> ARRAY[p]) then score end desc nulls last),
               first_value(p) over (order by case when not (ts @> ARRAY[t] or ps @> ARRAY[p]) then score end desc nulls last),
               ts || first_value(t) over (order by case when not (ts @> ARRAY[t] or ps @> ARRAY[p]) then score end desc nulls last),
               ps || first_value(p) over (order by case when not (ts @> ARRAY[t] or ps @> ARRAY[p]) then score end desc nulls last)
        from cte 
        where cnt > 0
       )
 select *
 from cte
 where lastt = t and lastp = p and cnt > 0;

112

answered Sep 19 '22 15:09

Gordon Linoff

Related questions
                            
                                Mysql deadlock explanation needed
                            
                                Representing ecommerce products and variations cleanly in the database
                            
                                Inverse of COALESCE
                            
                                How to combine two rows and calculate the time difference between two timestamp values in MySQL?
                            
                                SQL query on multiple databases
                            
                                Is it better to maintain a separate count table vs running count query every time?
                            
                                Oracle: Full text search with condition
                            
                                Is there a way to turn off implicit type conversion in SQL Server?
                            
                                Optimizing MySQL query with multiple left joins
                            
                                How do I extend this query to find valid combinations of three items?
                            
                                Automatically generate a user defined table type that matches an existing table
                            
                                Simplified/shorthand SQL Data Definition Languages?
                            
                                SQLite Bracket "don't work"
                            
                                SQL query for join table and multiple values
                            
                                How to query a join table so that multiple criteria are met?
                            
                                How to handle Python multiprocessing database concurrency, specifically with django?
                            
                                Correct SQL index for Partition + Order to remove SORT
                            
                                SQL Azure Geo Replication for non-redunancy purposes
                            
                                Merge multiple rows with same ID into one row
                            
                                What is the difference between Joining two different DB Context using ToList() and .AsQueryable()?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to write SQL query that selects distinct pair values for specific criteria?

Tags:

sql

postgresql

group-by

data-partitioning

Stephen Tableau

People also ask

1 Answers

Gordon Linoff

Recent Activity

Donate For Us