I have a MySQL table with the fields <code>id</code> and <code>string</code>. <code>id</code>s are unique. <code>string</code>s are varchars and are non-unique. I perform the following query: <pre class="prettyprint"><code>SELECT id, string, COUNT( * ) AS frequency FROM table GROUP BY string ORDER BY frequency DESC, id ASC </code></pre> Questions Assume the table contains three rows with identical <code>string</code> values, and <code>id</code>s 1, 2, and 3. <ol> <li>Which <code>id</code> is going to be returned ( 1, 2, or 3 )?</li> <li>Which <code>id</code> is this query going to <code>ORDER BY</code> ( Same as is returned? ... see question 1 )?</li> <li>Can you control which <code>id</code> is returned / used for ordering? eg. Return the largest <code>id</code>, or the first <code>id</code> from a GROUP.</li> </ol> What I'm ultimately trying to do is get a frequency occurrence for identical strings, order by that frequency, highest to lowest, and on a frequency tie, order by <code>id</code> with the smallest <code>id</code> from the group returned / ordered by. I made the situation more generic to figure out how MySQL handles this situation.

The documentation says that when not grouping by all non-aggregate columns, one row for each unique combination if the grouped by columns is returned. The row selected is up to the server - ie "random" However, in practice it is the first row encountered during processing. You can control which is encountered first by selecting from an inner query that is ordered in the order of preference of return. For example to get the lowest id for each name (yes, undocumented, blah blah, but it works!): <pre class="prettyprint"><code>SELECT id, name, COUNT( * ) AS frequency FROM (select * from table order by id) x GROUP BY name ORDER BY frequency DESC, id ASC </code></pre> I personally am comfortable relying on this behaviour and have never seen or heard of it behaving differently in real life. Many shun this as undocumented and "risky", but if it works, it works.

Which row's fields are returned when Grouping with MySQL?

Tags:

mysql

sql-order-by

group-by

I have a MySQL table with the fields id and string. ids are unique. strings are varchars and are non-unique.

I perform the following query:

SELECT id, string, COUNT( * ) AS frequency
FROM table
GROUP BY string
ORDER BY frequency DESC, id ASC

Questions

Assume the table contains three rows with identical string values, and ids 1, 2, and 3.

Which id is going to be returned ( 1, 2, or 3 )?
Which id is this query going to ORDER BY ( Same as is returned? ... see question 1 )?
Can you control which id is returned / used for ordering? eg. Return the largest id, or the first id from a GROUP.

What I'm ultimately trying to do is get a frequency occurrence for identical strings, order by that frequency, highest to lowest, and on a frequency tie, order by id with the smallest id from the group returned / ordered by. I made the situation more generic to figure out how MySQL handles this situation.

302

asked Sep 10 '13 02:09

T. Brian Jones

2 Answers

Which id is going to be returned ( 1, 2, or 3 )?

A: The server will choose for all the records that have the same name the id it wants (most likely the fastest to fetch, which is unpredictable). To cite the official documentation:

The server is free to choose any value from each group, so unless they are the same, the values chosen are indeterminate.

Much more information in this link.

Which id is this query going to ORDER BY ( Same as is returned? ... see question 1 )?

It makes no sense to find out in what order the data retrieved will be returned as you can't predict the result you are going to get. However, it is very likely that you get the result sorted by the unpredictable ID column.

Can you control which id is returned / used for ordering? eg. Return the largest id, or the first id from a GROUP.

You should be assuming at this point that you can't. Read again the documentation.

Making things even more clear: You can't predict the result of an improperly used GROUP BY clause. The main issue with MySQL is that it allows you to use it in a non-standard way but you need to know how to make use of that feature. The main point behind it is to group by fields that you know will always be the same. EG:

SELECT id, name, COUNT( * ) AS frequency
FROM table
GROUP BY id

Here, you know name will be unique as id functionally determines name. So the result you know is valid. If you grouped also by name this query would be more standard but will perform slightly worse in MySQL.

As a final note, take into account that, in my experience the results in those non-standard queries for the selected and non-grouped fields are usually the ones that you would get applying a GROUP BY and then an ORDER BY on that field. That is why so many times it seems to work. However, if you keep testing you will eventually find out that this happens 95% of the time. And you can not rely on that number.

answered Oct 05 '22 08:10

Mosty Mostacho

The documentation says that when not grouping by all non-aggregate columns, one row for each unique combination if the grouped by columns is returned. The row selected is up to the server - ie "random"

However, in practice it is the first row encountered during processing. You can control which is encountered first by selecting from an inner query that is ordered in the order of preference of return.

For example to get the lowest id for each name (yes, undocumented, blah blah, but it works!):

SELECT id, name, COUNT( * ) AS frequency
FROM (select * from table order by id) x
GROUP BY name
ORDER BY frequency DESC, id ASC

I personally am comfortable relying on this behaviour and have never seen or heard of it behaving differently in real life. Many shun this as undocumented and "risky", but if it works, it works.

answered Oct 05 '22 07:10

Bohemian

Related questions
                            
                                Fetch range from days
                            
                                Warning raised by inserting 4-byte unicode to mysql
                            
                                MySQL INSERT INTO syntax [duplicate]
                            
                                How to give priority to certain queries?
                            
                                Finding exact value from a comma separated string in PHP MySQL
                            
                                How to set default value of a mysql column as unix_timestamp?
                            
                                MySQL: ceasing to use a Scheme (Database)?
                            
                                Execute mysql "create function" statement with PHP
                            
                                When adding a foreign key constraint, which direction is best practice?
                            
                                SQL standard UPSERT call
                            
                                How to get values of a column without knowing the column type
                            
                                How to lock a row for select in MySQL [duplicate]
                            
                                How to find the hierarchy path for a tree representation
                            
                                To get MYSQL query execution time in Query
                            
                                Combine two queries to check for duplicates in MySQL?
                            
                                Unable to select Database Foo using Ec2 and RDS
                            
                                What happened to to phpMyAdmin's "display direction" dropdown list in version 4?
                            
                                MySQL ERROR 1005: Can't create table (errno: 150)
                            
                                mysqli_insert_id: What if someone inserts another row just before I call this?
                            
                                MySQLdb initial connection timeout

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With