How can one filter a grouped resultset for only those groups that meet some criterion compared against the other groups? For example, only those groups that have the maximum number of constituent records? I had thought that a subquery as follows should do the trick: <pre class="prettyprint"><code>SELECT * FROM ( SELECT *, COUNT(*) AS Records FROM T GROUP BY X ) t HAVING Records = MAX(Records); </code></pre> However the addition of the final <code>HAVING</code> clause results in an empty recordset... what's going on?

In MySQL (Which I assume you are using since you have posted <code>SELECT *, COUNT(*) FROM T GROUP BY X</code> Which would fail in all RDBMS that I know of). You can use: <pre class="prettyprint"><code>SELECT T.* FROM T INNER JOIN ( SELECT X, COUNT(*) AS Records FROM T GROUP BY X ORDER BY Records DESC LIMIT 1 ) T2 ON T2.X = T.X </code></pre> This has been tested in MySQL and removes the implicit grouping/aggregation. If you can use windowed functions and one of TOP/LIMIT with Ties or Common Table expressions it becomes even shorter: Windowed function + CTE: (MS SQL-Server & PostgreSQL Tested) <pre class="prettyprint"><code>WITH CTE AS ( SELECT *, COUNT(*) OVER(PARTITION BY X) AS Records FROM T ) SELECT * FROM CTE WHERE Records = (SELECT MAX(Records) FROM CTE) </code></pre> Windowed Function with TOP (MS SQL-Server Tested) <pre class="prettyprint"><code>SELECT TOP 1 WITH TIES * FROM ( SELECT *, COUNT(*) OVER(PARTITION BY X) [Records] FROM T ) ORDER BY Records DESC </code></pre> Lastly, I have never used oracle so apolgies for not adding a solution that works on oracle... <hr> EDIT My Solution for MySQL did not take into account ties, and my suggestion for a solution to this kind of steps on the toes of what you have said you want to avoid (duplicate subqueries) so I am not sure I can help after all, however just in case it is preferable here is a version that will work as required on your fiddle: <pre class="prettyprint"><code>SELECT T.* FROM T INNER JOIN ( SELECT X FROM T GROUP BY X HAVING COUNT(*) = ( SELECT COUNT(*) AS Records FROM T GROUP BY X ORDER BY Records DESC LIMIT 1 ) ) T2 ON T2.X = T.X </code></pre>

Compare SQL groups against eachother

Tags:

sql

group-by

How can one filter a grouped resultset for only those groups that meet some criterion compared against the other groups? For example, only those groups that have the maximum number of constituent records?

I had thought that a subquery as follows should do the trick:

SELECT * FROM (
    SELECT   *, COUNT(*) AS Records
    FROM     T
    GROUP BY X
) t HAVING Records = MAX(Records);

However the addition of the final HAVING clause results in an empty recordset... what's going on?

654

asked Mar 27 '12 13:03

eggyal

1 Answers

In MySQL (Which I assume you are using since you have posted SELECT *, COUNT(*) FROM T GROUP BY X Which would fail in all RDBMS that I know of). You can use:

SELECT  T.*
FROM    T
        INNER JOIN
        (   SELECT  X, COUNT(*) AS Records
            FROM    T
            GROUP BY X
            ORDER BY Records DESC
            LIMIT 1
        ) T2
            ON T2.X = T.X

This has been tested in MySQL and removes the implicit grouping/aggregation.

If you can use windowed functions and one of TOP/LIMIT with Ties or Common Table expressions it becomes even shorter:

Windowed function + CTE: (MS SQL-Server & PostgreSQL Tested)

WITH CTE AS
(   SELECT  *, COUNT(*) OVER(PARTITION BY X) AS Records
    FROM    T
)
SELECT  *
FROM    CTE
WHERE   Records = (SELECT MAX(Records) FROM CTE)

Windowed Function with TOP (MS SQL-Server Tested)

SELECT  TOP 1 WITH TIES *
FROM    (   SELECT  *, COUNT(*) OVER(PARTITION BY X) [Records]
            FROM    T
        )
ORDER BY Records DESC

Lastly, I have never used oracle so apolgies for not adding a solution that works on oracle...

EDIT

My Solution for MySQL did not take into account ties, and my suggestion for a solution to this kind of steps on the toes of what you have said you want to avoid (duplicate subqueries) so I am not sure I can help after all, however just in case it is preferable here is a version that will work as required on your fiddle:

SELECT  T.*
FROM    T
        INNER JOIN
        (   SELECT  X
            FROM    T
            GROUP BY X
            HAVING  COUNT(*) = 
                    (   SELECT  COUNT(*) AS Records
                        FROM    T
                        GROUP BY X
                        ORDER BY Records DESC
                        LIMIT 1
                    )
        ) T2
            ON T2.X = T.X

110

answered Sep 30 '22 08:09

GarethD

Related questions
                            
                                Access 2003 VBA SQL "Too few parameters" error
                            
                                Sql query - getting rid of hard-coded values
                            
                                Please help me convert SQL to LINQ
                            
                                SSIS Writing 0x00 Hex Value to Flat File
                            
                                How to update a single column of a table of data from a backup
                            
                                Sql Join for partly overlapping data
                            
                                Managing a subset of the database in a SQL Server 2008 DB Project
                            
                                MySQL, multiple rows to separate fields
                            
                                SQL Server 2008 R2 Encryption - with Entity Framework
                            
                                slow inserts mysql
                            
                                Find columns that contain only zeros
                            
                                Moving in Closure Table with Multiple Parents
                            
                                ORA-01446 - cannot select ROWID from view with DISTINCT, GROUP BY, etc
                            
                                Using sysschedules like table for events SQL Server
                            
                                SQL Modeling Follower/Followed Relationships For Social Networking
                            
                                how to store data to database in HTML5
                            
                                counting null values in sql with where and group by clause
                            
                                SQL - "Save results as CSV" - use comma instead of Semi-Colon
                            
                                Multiple INNER JOIN from the same table
                            
                                MySQL How to INSERT INTO [temp table] FROM [Stored Procedure]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With