SQL select elements where sum of field is less than N

Tags:

Given that I've got a table with the following, very simple content:

# select * from messages;
  id | verbosity 
 ----+-----------
   1 |        20
   2 |        20
   3 |        20
   4 |        30
   5 |       100
 (5 rows)

I would like to select N messages, which sum of verbosity is lower than Y (for testing purposes let's say it should be 70, then correct results will be messages with id 1,2,3). It's really important to me, that solution should be database independent (it should work at least on Postgres and SQLite).

I was trying with something like:

SELECT * FROM messages GROUP BY id HAVING SUM(verbosity) < 70;

However it doesn't seem to work as expected, because it doesn't actually sum all values from verbosity column.

I would be very grateful for any hints/help.

498

asked Jul 27 '12 13:07

user1105595

2 Answers

SELECT m.id, sum(m1.verbosity) AS total
FROM   messages m
JOIN   messages m1 ON m1.id <= m.id
WHERE  m.verbosity < 70    -- optional, to avoid pointless evaluation
GROUP  BY m.id
HAVING SUM(m1.verbosity) < 70
ORDER  BY total DESC
LIMIT  1;

This assumes a unique, ascending id like you have in your example.

In modern Postgres - or generally with modern standard SQL (but not in SQLite):

Simple CTE

WITH cte AS (
   SELECT *, sum(verbosity) OVER (ORDER BY id) AS total
   FROM   messages
   )
SELECT *
FROM   cte
WHERE  total < 70
ORDER  BY id;

Recursive CTE

Should be faster for big tables where you only retrieve a small set.

WITH RECURSIVE cte AS (
   (  -- parentheses required
   SELECT id, verbosity, verbosity AS total
   FROM   messages
   ORDER  BY id
   LIMIT  1
   )

   UNION ALL 
   SELECT c1.id, c1.verbosity, c.total + c1.verbosity 
   FROM   cte c
   JOIN   LATERAL (
      SELECT *
      FROM   messages
      WHERE  id > c.id
      ORDER  BY id
      LIMIT  1
      ) c1 ON  c1.verbosity < 70 - c.total
   WHERE c.total < 70
   )
SELECT *
FROM   cte
ORDER  BY id;

All standard SQL, except for LIMIT.

Strictly speaking, there is no such thing as "database-independent". There are various SQL-standards, but no RDBMS complies completely. LIMIT works for PostgreSQL and SQLite (and some others). Use TOP 1 for SQL Server, rownum for Oracle. Here's a comprehensive list on Wikipedia.

The SQL:2008 standard would be:

...
FETCH  FIRST 1 ROWS ONLY

... which PostgreSQL supports - but hardly any other RDBMS.

The pure alternative that works with more systems would be to wrap it in a subquery and

SELECT max(total) FROM <subquery>

But that is slow and unwieldy.

db<>fiddle here
_{Old sqlfiddle}

172

answered Oct 13 '22 05:10

Erwin Brandstetter

This will work...

select * 
from messages
where id<=
(
    select MAX(id) from
    (
        select m2.id, SUM(m1.verbosity) sv 
        from messages m1
        inner join messages m2 on m1.id <=m2.id
        group by m2.id
    ) v
    where sv<70
)

However, you should understand that SQL is designed as a set based language, rather than an iterative one, so it designed to treat data as a set, rather than on a row by row basis.

answered Oct 13 '22 05:10

podiluska

Related questions
                            
                                Using "like" in a cursor/query with a parameter in python (django)
                            
                                Why does LINQ send sp_executesql instead of directly executing the SQL?
                            
                                ORDER BY on different columns in different directions in SQLite
                            
                                Find a Database table's unique constraint
                            
                                List of all tables with a relationship to a given table or view
                            
                                Is adding a bit mask to all tables in a database useful?
                            
                                Simple SQL Lite table/import question
                            
                                Reordering an ordered list
                            
                                how to best organize the Inner Joins in (select) statement
                            
                                SELECT min and max value from a part of a table in MySQL
                            
                                Oracle: Indexing a subset of rows of a table
                            
                                how to select last 12 months name and year without using tables using sql query?
                            
                                Initialising a pl/sql record type
                            
                                Concat two table columns and update one with result
                            
                                Is this a 1NF failure?
                            
                                how find "holes" in auto_increment column?
                            
                                LIMIT offset or OFFSET in an UPDATE SQL query
                            
                                How to properly add brackets to SQL queries with 'or' and 'and' clauses by using Arel?
                            
                                Postgresql select between month range
                            
                                Store a PHP array in a single SQL cell

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

SQL select elements where sum of field is less than N

Tags:

sql

sqlite

postgresql

aggregate-functions

sql-limit