Is there a [straightforward] way to order results first, then group by another column, with SQL?

Tags:

I see that in an SQL query, the GROUP BY has to precede the ORDER BY expression. Does this imply that ordering is done after grouping would have discarded identical rows?

Because I seem to need to order rows by a timestamp first, then discard the rows with identical timestamp. And I don't know how to accomplish this.

I am using MySQL 5.1.41.

Here is the definition of the table expressed with create table:

Click to copy

create table
(
    A int,
    B timestamp
)

The data could be:

Click to copy

+-----+-----------------------+
|  A  |  B                    |
+-----+-----------------------+
|  1  |  today                |
|  1  |  yesterday            |
|  2  |  yesterday            |
|  2  |  tomorrow             |
+-----+-----------------------+

The results of the query on the above table, which I am after, would be:

Click to copy

+-----+-----------------------+
|  A  |  B                    |
+-----+-----------------------+
|  1  |  today                |
|  2  |  tomorrow             |
+-----+-----------------------+

Basically, I want the rows with the latest timestamp in column "B" (hence the mention of ORDER BY), and only one row for each value in column "A" (think DISTINCT or GROUP BY).

The actual problem behind the simplified example above:

In reality, I have two tables - users and payment_receipts:

Click to copy

create table users
(
    phone_nr int(10) unsigned not null,
    primary key (phone_nr)
)

create table payment_receipts
(
    phone_nr int(10) unsigned not null,
    payed_ts timestamp default current_timestamp not null,
    payed_until_ts timestamp not null,
    primary key (phone_nr, payed_ts, payed_until_ts)
)

The tables may include other columns but I omit these as irrelevant. Implementing a payment scheme, I have to send SMS to users across the cellular network, in periodic intervals depending on whether the payment is due or not. The payment is actualized when the SMS is sent as the recipient is taxed for it. I use the payment_receipts table to keep records of all payments done, i.e. for book-keeping. This is intended to model a real shop where both the buyer and the seller get a copy of the receipt of purchase, for reference. This table stores my (seller's) copy [of each receipt]. The customer's receipt is the received SMS itself. Each time an SMS is sent (and thus a payment is accomplished), the table is inserted a receipt record, stating who paid, when and "until when". To explain the latter, imagine a subscription service, but one which spans indefinitely until the user opt-out explicitly, at which point the corresponding user record is removed. A payment is made a month in advance, so as a rule, the difference between the payed_ts and payed_until_ts is 30 days worth of time.

I have a batch job that executes every day and needs to select a list of users that are due monthly payment as part of the automatic subscription renewal described above. To link this to the dummy example earlier, the phone number column phone_nr would be the column "A" and payed_until_ts would be column "B", but in reality there are two tables, which has to do with the following behaviour: when a user record is removed, the receipt must remain, for book-keeping. So not only do I need to group payments by date and discard all but the latest payment receipt date, I also need to watch out not to select receipts for which there no longer is a matching user record.

To solve the problem of selecting required records -- those that are due payment -- I need to find receipts with the latest payed_until_ts timestamp for each phone_nr (there may be several, obviously) and out of those records I further need to select only those phone numbers where payed_until_ts is earlier than the time the batch job executes. I then would send an SMS to each of these numbers, inserting a receipt record for each sent SMS, where payed_ts is now() and payed_until_ts is now() + interval 30 days.

But I can't seem to come up with the query required.

315

asked Jul 31 '10 12:07

amn

2 Answers

Click to copy

Select a,b from (select a,b from table order by b) as c group by a;

answered Nov 02 '22 04:11

Mike Sherov

Yes, grouping is done first, and it affects a single select whereas ordering affects all the results from all select statements in a union, such as:

Click to copy

select a, 'max', max(b) from tbl group by a
union all select a, 'min', min(b) from tbl group by a
order by 1, 2

(using field numbers in order by since I couldn't be bothered to name my columns). Each group by affects only its select, the order by affects the combined result set.

It seems that what you're after can be achieved with:

Click to copy

select A, max(B) from tbl group by A

This uses the max aggregation function to basically do your pre-group ordering (it doesn't actually sort it in any decent DBMS, rather it will simply choose the maximum from an suitable index if available).

answered Nov 02 '22 05:11

paxdiablo

Related questions
                            
                                JOIN on set returning function results
                            
                                Comma separate values with same number of rows
                            
                                Insert random data in Oracle table
                            
                                Update SQL with Aliased tables still returns "table is ambiguous" error
                            
                                How do I fill a temp table iteratively and stuff() the result into columns?
                            
                                Create a comma separated list from a column
                            
                                Should I use Effective Date or Start Date and End Date for historical recording?
                            
                                left join and group of inner join
                            
                                Database error - RIGHT and FULL OUTER JOINs are not currently supported
                            
                                How to use CASE statement inside a WHERE with an IN clause?
                            
                                Modeling Geographic Locations in an Relational Database
                            
                                SQL Select: Update if exists, Insert if not - With date part comparison?
                            
                                How can I make SQL Developer/SQL+ prompt only once for a substitution variable that occurs multiple times in a single statement?
                            
                                How to efficiently delete rows from a Postgresql 8.1 table?
                            
                                Looking for T-SQL scripts to delete a SQL Job
                            
                                What is "Linq to SQL"?
                            
                                Recommended approach on handling SqlExceptions in db applications
                            
                                Select only integers from char column using SQL Server
                            
                                Sorting SQL by first two characters of fields
                            
                                SQL: how to get the left 3 numbers from an int

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Is there a [straightforward] way to order results first, then group by another column, with SQL?

Tags:

sql

database

mysql

sql-order-by

group-by

The actual problem behind the simplified example above:

amn

People also ask

2 Answers

Mike Sherov

paxdiablo

Recent Activity

Donate For Us

Is there a [straightforward] way to order results *first*, *then* group by another column, with SQL?

Tags:

sql

database

mysql

sql-order-by

group-by

The actual problem behind the simplified example above:

amn

People also ask

2 Answers

Mike Sherov

paxdiablo

Related questions

Recent Activity

Donate For Us

Is there a [straightforward] way to order results first, then group by another column, with SQL?