I need to take the first N rows for each group, ordered by custom column. Given the following table: <pre class="prettyprint"><code>db=# SELECT * FROM xxx; id | section_id | name ----+------------+------ 1 | 1 | A 2 | 1 | B 3 | 1 | C 4 | 1 | D 5 | 2 | E 6 | 2 | F 7 | 3 | G 8 | 2 | H (8 rows) </code></pre> I need the first 2 rows (ordered by name) for each section_id, i.e. a result similar to: <pre class="prettyprint"><code> id | section_id | name ----+------------+------ 1 | 1 | A 2 | 1 | B 5 | 2 | E 6 | 2 | F 7 | 3 | G (5 rows) </code></pre> I am using PostgreSQL 8.3.5.

Here's another solution (PostgreSQL <= 8.3). <pre class="prettyprint"><code>SELECT * FROM xxx a WHERE ( SELECT COUNT(*) FROM xxx WHERE section_id = a.section_id AND name <= a.name ) <= 2 </code></pre>

<pre class="prettyprint"><code> -- ranking without WINDOW functions -- EXPLAIN ANALYZE WITH rnk AS ( SELECT x1.id , COUNT(x2.id) AS rnk FROM xxx x1 LEFT JOIN xxx x2 ON x1.section_id = x2.section_id AND x2.name <= x1.name GROUP BY x1.id ) SELECT this.* FROM xxx this JOIN rnk ON rnk.id = this.id WHERE rnk.rnk <=2 ORDER BY this.section_id, rnk.rnk ; -- The same without using a CTE -- EXPLAIN ANALYZE SELECT this.* FROM xxx this JOIN ( SELECT x1.id , COUNT(x2.id) AS rnk FROM xxx x1 LEFT JOIN xxx x2 ON x1.section_id = x2.section_id AND x2.name <= x1.name GROUP BY x1.id ) rnk ON rnk.id = this.id WHERE rnk.rnk <=2 ORDER BY this.section_id, rnk.rnk ; </code></pre>

Grouped LIMIT in PostgreSQL: show the first N rows for each group?

Tags:

sql

postgresql

I need to take the first N rows for each group, ordered by custom column.

Given the following table:

db=# SELECT * FROM xxx;
 id | section_id | name
----+------------+------
  1 |          1 | A
  2 |          1 | B
  3 |          1 | C
  4 |          1 | D
  5 |          2 | E
  6 |          2 | F
  7 |          3 | G
  8 |          2 | H
(8 rows)

I need the first 2 rows (ordered by name) for each section_id, i.e. a result similar to:

 id | section_id | name
----+------------+------
  1 |          1 | A
  2 |          1 | B
  5 |          2 | E
  6 |          2 | F
  7 |          3 | G
(5 rows)

I am using PostgreSQL 8.3.5.

466

asked Jul 14 '09 10:07

Kouber Saparev

6 Answers

New solution (PostgreSQL 8.4)

SELECT
  * 
FROM (
  SELECT
    ROW_NUMBER() OVER (PARTITION BY section_id ORDER BY name) AS r,
    t.*
  FROM
    xxx t) x
WHERE
  x.r <= 2;

answered Oct 02 '22 16:10

Dave

Since v9.3 you can do a lateral join

select distinct t_outer.section_id, t_top.id, t_top.name from t t_outer
join lateral (
    select * from t t_inner
    where t_inner.section_id = t_outer.section_id
    order by t_inner.name
    limit 2
) t_top on true
order by t_outer.section_id;

It might be faster but, of course, you should test performance specifically on your data and use case.

answered Oct 02 '22 16:10

poshest

Here's another solution (PostgreSQL <= 8.3).

SELECT
  *
FROM
  xxx a
WHERE (
  SELECT
    COUNT(*)
  FROM
    xxx
  WHERE
    section_id = a.section_id
  AND
    name <= a.name
) <= 2

answered Oct 02 '22 14:10

Kouber Saparev

SELECT  x.*
FROM    (
        SELECT  section_id,
                COALESCE
                (
                (
                SELECT  xi
                FROM    xxx xi
                WHERE   xi.section_id = xo.section_id
                ORDER BY
                        name, id
                OFFSET 1 LIMIT 1
                ),
                (
                SELECT  xi
                FROM    xxx xi
                WHERE   xi.section_id = xo.section_id
                ORDER BY 
                        name DESC, id DESC
                LIMIT 1
                )
                ) AS mlast
        FROM    (
                SELECT  DISTINCT section_id
                FROM    xxx
                ) xo
        ) xoo
JOIN    xxx x
ON      x.section_id = xoo.section_id
        AND (x.name, x.id) <= ((mlast).name, (mlast).id)

answered Oct 02 '22 15:10

Quassnoi

        -- ranking without WINDOW functions
-- EXPLAIN ANALYZE
WITH rnk AS (
        SELECT x1.id
        , COUNT(x2.id) AS rnk
        FROM xxx x1
        LEFT JOIN xxx x2 ON x1.section_id = x2.section_id AND x2.name <= x1.name
        GROUP BY x1.id
        )
SELECT this.*
FROM xxx this
JOIN rnk ON rnk.id = this.id
WHERE rnk.rnk <=2
ORDER BY this.section_id, rnk.rnk
        ;

        -- The same without using a CTE
-- EXPLAIN ANALYZE
SELECT this.*
FROM xxx this
JOIN ( SELECT x1.id
        , COUNT(x2.id) AS rnk
        FROM xxx x1
        LEFT JOIN xxx x2 ON x1.section_id = x2.section_id AND x2.name <= x1.name
        GROUP BY x1.id
        ) rnk
ON rnk.id = this.id
WHERE rnk.rnk <=2
ORDER BY this.section_id, rnk.rnk
        ;

answered Oct 02 '22 16:10

wildplasser

A lateral join is the way to go, but you should do a nested query first to improve performance on large tables.

SELECT t_limited.*
FROM (
        SELECT DISTINCT section_id
        FROM t
    ) t_groups
    JOIN LATERAL (
        SELECT *
        FROM t t_all
        WHERE t_all.section_id = t_groups.section_id
        ORDER BY t_all.name
        LIMIT 2
    ) t_limited ON true

Without the nested select distinct, the join lateral runs for every line in the table, even though the section_id is often duplicated. With the nested select distinct, the join lateral runs once and only once for each distinct section_id.

answered Oct 02 '22 14:10

David Skinner

Related questions
                            
                                Count the number of occurrences of a string in a VARCHAR field?
                            
                                Truncate (not round) decimal places in SQL Server
                            
                                What is the difference between LATERAL JOIN and a subquery in PostgreSQL?
                            
                                sqlite database default time value 'now'
                            
                                Why historically do people use 255 not 256 for database field magnitudes?
                            
                                In SQL, what's the difference between count(column) and count(*)?
                            
                                Only one expression can be specified in the select list when the subquery is not introduced with EXISTS
                            
                                How to export all data from table to an insertable sql format?
                            
                                What is best tool to compare two SQL Server databases (schema and data)? [duplicate]
                            
                                How to drop SQL default constraint without knowing its name?
                            
                                What are best practices for multi-language database design? [closed]
                            
                                Set database from SINGLE USER mode to MULTI USER
                            
                                ALTER TABLE to add a composite primary key
                            
                                Replacing NULL with 0 in a SQL server query
                            
                                SQL Server equivalent of MySQL's NOW()?
                            
                                How to select multiple rows filled with constants?
                            
                                ERROR 1452: Cannot add or update a child row: a foreign key constraint fails
                            
                                How can a LEFT OUTER JOIN return more records than exist in the left table?
                            
                                How to update only one field using Entity Framework?
                            
                                Import CSV file into SQL Server

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Grouped LIMIT in PostgreSQL: show the first N rows for each group?

Tags:

sql

postgresql

Kouber Saparev

People also ask

6 Answers

Dave

poshest

Kouber Saparev

Quassnoi

wildplasser

David Skinner

Recent Activity

Donate For Us