Postgresql : How do I select top n percent(%) entries from each group/category

Tags:

We are new to postgres, we have following query by which we can select top N records from each category.

 create table temp (
     gp char,
     val int
 );

 insert into temp values ('A',10);
 insert into temp values ('A',8);
 insert into temp values ('A',6);
 insert into temp values ('A',4);
 insert into temp values ('B',3);
 insert into temp values ('B',2);
 insert into temp values ('B',1);

 select a.gp,a.val
 from   temp a
 where  a.val in (
              select b.val
              from   temp b
              where  a.gp=b.gp
              order by b.val desc
             limit 2);

Output of above query is something like this

 gp   val
 ----------
 A    10
 A    8
 B    3
 B    2

But our requirement is different, we want to select top n% records from each category where n is not fixed, n is based of some percent of elements in each group.

784

asked Jul 08 '14 07:07

2 Answers

Referencing the response from a_horse_with_no_name, you can achieve something similar using percent_rank()

SELECT
    gp,
    val,
    pct_rank
FROM (
    SELECT
        gp,
        val,
        percent_rank() over (order by val desc) as pct_rank
    FROM variables.temp
    ) t
WHERE pct_rank <= 0.75;

You can then set the final WHERE clause to return data at whatever percent_rank() threshold you require.

118

answered Sep 26 '22 12:09

xrpza

To retrieve the rows based on the percentage of the number of rows in each group you can use two window functions: one to count the rows and one to give them a unique number.

select gp,
       val
from (
  select gp, 
         val,
         count(*) over (partition by gp) as cnt,
         row_number() over (partition by gp order by val desc) as rn
  from temp
) t
where rn / cnt <= 0.75;

SQLFiddle example: http://sqlfiddle.com/#!15/94fdd/1

Btw: using char is almost always a bad idea because it is a fixed-length data type that is padded to the defined length. I hope you only did that for setting up the example and don't use it in your real table.

answered Sep 22 '22 12:09

a_horse_with_no_name

Related questions
                            
                                VB.NET: How to camel case words that are uppercased
                            
                                Execute a Create Table Query through the JPA EntityManager
                            
                                Generate id row for a view with grouping
                            
                                Comparing two tables in SQLite
                            
                                Only display certain columns in DataGrid from an Entity Object
                            
                                PostgreSQL query not using index in production
                            
                                SQL MIN Function with where clause
                            
                                Oracle + dbunit gets AmbiguousTableNameException
                            
                                If field is null, pull certain fields; otherwise, pull other fields
                            
                                Sequence error in sql. Sequence number not allowed here
                            
                                Bulk Insert doesn't insert any rows
                            
                                Delete Every Alternate Row in SQL
                            
                                T-SQL SELECT with GROUP BY id
                            
                                select distinct timestamp as DD/MM/YYYY mysql
                            
                                SQL Server 2012 Windowing function to calculate a running total
                            
                                How to convert Visual Foxpro database into SQL Server database
                            
                                Cast string+ntext to nvarchar error
                            
                                Check if a variable contains any non-numeric digits in SQL Server
                            
                                An exception of type 'System.Data.SqlClient.SqlException' occurred in System.Data.dll
                            
                                Joining All Rows of Two Tables in SQL Server

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Postgresql : How do I select top n percent(%) entries from each group/category

Tags:

sql

database

postgresql

postgresql-9.1

dpilwal

People also ask

2 Answers

xrpza

a_horse_with_no_name

Recent Activity

Donate For Us