Is there a way to group by a unique (primary) key, essentially giving an implicit guarantee that the other columns from that table will be well-defined? <pre class="prettyprint"><code>SELECT myPrimaryKey, otherThing FROM myTable GROUP BY myPrimaryKey </code></pre> I know that I can add the other columns to the statement (<code>GROUP BY myPrimaryKey,otherThing</code>), but I'm trying to avoid that. If you're curious why, read on: <hr> I have a statement which is essentially doing this: <pre class="prettyprint"><code>SELECT nodes.node_id, nodes.node_label, COUNT(1) FROM {a couple of joined tables} INNER JOIN nodes USING (node_id) GROUP BY nodes.node_id, nodes.node_label </code></pre> which works fine, but is a bit slow in MySQL. If I remove <code>nodes.node_label</code> from the <code>GROUP BY</code>, it runs about 10x faster (according to <code>EXPLAIN</code>, this is because one of the earlier joins starts using an index when previously it didn't). We're in the process of migrating to Postgres, so all new statements are supposed to be compatible with both MySQL and Postgres when possible. Now in Postgres, the original statement runs fast, but the new statement (with the reduced group by) won't run (because Postgres is stricter). In this case, it's a false error because the statement is actually well-defined. Is there a syntax I can use which will let the same statement run in both platforms, while letting MySQL use just one column in the group by for speed?

In more recent versions of MySql you might have <code>sql_mode=only_full_group_by</code> enabled which doesn't allow to select non-aggregated columns when using <code>group by</code> i.e. it forces you to use a function like <code>max()</code> or <code>avg()</code> or <code>group_concat()</code>, sometimes you just want any value. This flag is enabled by default in MySql 5.7. The function <code>any_value()</code> is available when that flag is enabled. <blockquote> You can achieve the same effect without disabling ONLY_FULL_GROUP_BY by using ANY_VALUE() to refer to the nonaggregated column. </blockquote> <pre class="prettyprint"><code>select t.index, any_value(t.insert_date) from my_table t group by t.index; </code></pre> More information here: https://dev.mysql.com/doc/refman/5.7/en/sql-mode.html#sqlmode_only_full_group_by and here: https://dev.mysql.com/doc/refman/5.7/en/group-by-handling.html

In Postgres (not in MySQL, though), you could use <code>DISTINCT ON</code> to pick a single, consistent row per value (or group of values) without aggregating them: <pre class="prettyprint"><code>SELECT DISTINCT ON (n.node_id) * -- select any or all columns of all joined tables FROM {a couple of joined tables} JOIN nodes n USING (node_id) </code></pre> That gives you a single, arbitrary row for each <code>node_id</code>. to pick a specific row, add: <pre class="prettyprint"><code>ORDER BY n.node_id, ... -- what to sort first? </code></pre> .. add more <code>ORDER BY</code> items to pick a specific row. Details: Select first row in each GROUP BY group?

GROUP BY only primary key, but select other values

Tags:

sql

mysql

postgresql

group-by

Is there a way to group by a unique (primary) key, essentially giving an implicit guarantee that the other columns from that table will be well-defined?

SELECT myPrimaryKey, otherThing
FROM myTable
GROUP BY myPrimaryKey

I know that I can add the other columns to the statement (GROUP BY myPrimaryKey,otherThing), but I'm trying to avoid that. If you're curious why, read on:

I have a statement which is essentially doing this:

SELECT nodes.node_id, nodes.node_label, COUNT(1)
FROM {a couple of joined tables}
INNER JOIN nodes USING (node_id)
GROUP BY nodes.node_id, nodes.node_label

which works fine, but is a bit slow in MySQL. If I remove nodes.node_label from the GROUP BY, it runs about 10x faster (according to EXPLAIN, this is because one of the earlier joins starts using an index when previously it didn't).

We're in the process of migrating to Postgres, so all new statements are supposed to be compatible with both MySQL and Postgres when possible. Now in Postgres, the original statement runs fast, but the new statement (with the reduced group by) won't run (because Postgres is stricter). In this case, it's a false error because the statement is actually well-defined.

Is there a syntax I can use which will let the same statement run in both platforms, while letting MySQL use just one column in the group by for speed?

442

asked Jun 05 '14 14:06

Dave

2 Answers

In more recent versions of MySql you might have sql_mode=only_full_group_by enabled which doesn't allow to select non-aggregated columns when using group by i.e. it forces you to use a function like max() or avg() or group_concat(), sometimes you just want any value.

This flag is enabled by default in MySql 5.7.

The function any_value() is available when that flag is enabled.

You can achieve the same effect without disabling ONLY_FULL_GROUP_BY by using ANY_VALUE() to refer to the nonaggregated column.

select t.index, any_value(t.insert_date)
from my_table t
group by t.index;

More information here: https://dev.mysql.com/doc/refman/5.7/en/sql-mode.html#sqlmode_only_full_group_by and here: https://dev.mysql.com/doc/refman/5.7/en/group-by-handling.html

answered Oct 08 '22 17:10

santiago arizti

In Postgres (not in MySQL, though), you could use DISTINCT ON to pick a single, consistent row per value (or group of values) without aggregating them:

SELECT DISTINCT ON (n.node_id)
       *                 -- select any or all columns of all joined tables
FROM   {a couple of joined tables}
JOIN   nodes n USING (node_id)

That gives you a single, arbitrary row for each node_id. to pick a specific row, add:

ORDER  BY n.node_id, ... -- what to sort first?

.. add more ORDER BY items to pick a specific row. Details:
Select first row in each GROUP BY group?

answered Oct 08 '22 17:10

Erwin Brandstetter

Related questions
                            
                                Laravel Schema onDelete set default
                            
                                How to instruct SQLAlchemy ORM to execute multiple queries in parallel when loading relationships?
                            
                                Mysqldump Error: ONLY_FULL_GROUP_BY
                            
                                Using `rand()` with `having`
                            
                                Determining which objects are or are not linked to the main root object
                            
                                Remotely Accessing MySQL on Mac Mini/Time Capsule
                            
                                Configure Wildfly to use SSL connection for MariaDB
                            
                                How to calculate MySQL Query Memory/CPU Cost
                            
                                Mysql transaction waiting for lock which is already granted .. This is causing deadlock
                            
                                Save R plot to web server
                            
                                How to use result of an subquery multiple times into an query
                            
                                Recursive group structure in MySQL
                            
                                Is MySQL logic evaluation lazy/short-circuiting in JOIN clause?
                            
                                Efficient external rostering with MySQL and ejabberd
                            
                                Strange MySQL warning 1264 for valid DateTime value
                            
                                MySQL Database Connection Management In PDO
                            
                                Sql query containing 2 databases
                            
                                How can I fetch correct datatypes from MySQL with PDO? [duplicate]
                            
                                Rails: differences in db/schema.rb - null: false at created_at/updated_at columns
                            
                                Why not use the built-in MySQL users and permissions for a website?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With