Is there any purpose for using both DISTINCT and GROUP BY in SQL? Below is a sample code <pre class="prettyprint"><code>SELECT DISTINCT Actors FROM MovieDetails GROUP BY Actors </code></pre> Does anyone know of any situations where both DISTINCT and GROUP BY need to be used, to get any specific desired results? (The general usage of DISTINCT and GROUP BY separately is understood)

<h3>Use <code>DISTINCT</code> to remove duplicate <code>GROUPING SETS</code> from the <code>GROUP BY</code> clause</h3> In a completely silly example using <code>GROUPING SETS()</code> in general (or the special grouping sets <code>ROLLUP()</code> or <code>CUBE()</code> in particular), you could use <code>DISTINCT</code> in order to remove the duplicate values produced by the grouping sets again: <pre class="prettyprint lang-sql prettyprint-override"><code>SELECT DISTINCT actors FROM (VALUES('a'), ('a'), ('b'), ('b')) t(actors) GROUP BY CUBE(actors, actors) </code></pre> With <code>DISTINCT</code>: <pre class="prettyprint lang-none prettyprint-override"><code>actors ------ NULL a b </code></pre> Without <code>DISTINCT</code>: <pre class="prettyprint lang-none prettyprint-override"><code>actors ------ a b NULL a b a b </code></pre> But why, apart from making an academic point, would you do that? <h3>Use <code>DISTINCT</code> to find unique aggregate function values</h3> In a less far-fetched example, you might be interested in the <code>DISTINCT</code> aggregated values, such as, how many different duplicate numbers of actors are there? <pre class="prettyprint lang-sql prettyprint-override"><code>SELECT DISTINCT COUNT(*) FROM (VALUES('a'), ('a'), ('b'), ('b')) t(actors) GROUP BY actors </code></pre> Answer: <pre class="prettyprint lang-none prettyprint-override"><code>count ----- 2 </code></pre> <h3>Use <code>DISTINCT</code> to remove duplicates with more than one <code>GROUP BY</code> column</h3> Another case, of course, is this one: <pre class="prettyprint lang-sql prettyprint-override"><code>SELECT DISTINCT actors, COUNT(*) FROM (VALUES('a', 1), ('a', 1), ('b', 1), ('b', 2)) t(actors, id) GROUP BY actors, id </code></pre> With <code>DISTINCT</code>: <pre class="prettyprint lang-none prettyprint-override"><code>actors count ------------- a 2 b 1 </code></pre> Without <code>DISTINCT</code>: <pre class="prettyprint lang-none prettyprint-override"><code>actors count ------------- a 2 b 1 b 1 </code></pre> For more details, I've written some blog posts, e.g. about <code>GROUPING SETS</code> and how they influence the <code>GROUP BY</code> operation, or about the logical order of SQL operations (as opposed to the lexical order of operations).

Using DISTINCT along with GROUP BY in SQL Server

Tags:

Is there any purpose for using both DISTINCT and GROUP BY in SQL?

Below is a sample code

SELECT DISTINCT Actors FROM MovieDetails GROUP BY Actors

Does anyone know of any situations where both DISTINCT and GROUP BY need to be used, to get any specific desired results?

(The general usage of DISTINCT and GROUP BY separately is understood)

684

asked Sep 21 '15 18:09

Vamsi

1 Answers

Use `DISTINCT` to remove duplicate `GROUPING SETS` from the `GROUP BY` clause

In a completely silly example using GROUPING SETS() in general (or the special grouping sets ROLLUP() or CUBE() in particular), you could use DISTINCT in order to remove the duplicate values produced by the grouping sets again:

SELECT DISTINCT actors FROM (VALUES('a'), ('a'), ('b'), ('b')) t(actors) GROUP BY CUBE(actors, actors)

With DISTINCT:

actors ------ NULL a b

Without DISTINCT:

actors ------ a b NULL a b a b

But why, apart from making an academic point, would you do that?

Use `DISTINCT` to find unique aggregate function values

In a less far-fetched example, you might be interested in the DISTINCT aggregated values, such as, how many different duplicate numbers of actors are there?

SELECT DISTINCT COUNT(*) FROM (VALUES('a'), ('a'), ('b'), ('b')) t(actors) GROUP BY actors

Answer:

count ----- 2

Use `DISTINCT` to remove duplicates with more than one `GROUP BY` column

Another case, of course, is this one:

SELECT DISTINCT actors, COUNT(*) FROM (VALUES('a', 1), ('a', 1), ('b', 1), ('b', 2)) t(actors, id) GROUP BY actors, id

With DISTINCT:

actors  count ------------- a       2 b       1

Without DISTINCT:

actors  count ------------- a       2 b       1 b       1

For more details, I've written some blog posts, e.g. about GROUPING SETS and how they influence the GROUP BY operation, or about the logical order of SQL operations (as opposed to the lexical order of operations).

answered Sep 21 '22 17:09

Lukas Eder

Related questions
                            
                                Generic controller in swift 2.0 using storyboards
                            
                                Airflow - Python file NOT in the same DAG folder
                            
                                Will const and let make the IIFE pattern unnecessary?
                            
                                std::is_base_of for template classes
                            
                                Linking R package vignettes
                            
                                How to read window content (using accessibilityService) and evoking UI using draw over other app permission in Android?
                            
                                Why use JUnit test suites?
                            
                                Flexbox: move middle element to the next line
                            
                                How to hydrate a Dictionary with the results of async calls?
                            
                                How to make Electron tray click events working reliably?
                            
                                Why does JSON.stringify return empty object notation "{}" for an object that seems to have properties?
                            
                                Why can't PowerShell find the gcloud cmdlets?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Using DISTINCT along with GROUP BY in SQL Server

Tags:

Vamsi

People also ask

1 Answers

Use `DISTINCT` to remove duplicate `GROUPING SETS` from the `GROUP BY` clause

Use `DISTINCT` to find unique aggregate function values

Use `DISTINCT` to remove duplicates with more than one `GROUP BY` column

Lukas Eder

Recent Activity

Donate For Us

Using DISTINCT along with GROUP BY in SQL Server

Tags:

Vamsi

People also ask

1 Answers

Use DISTINCT to remove duplicate GROUPING SETS from the GROUP BY clause

Use DISTINCT to find unique aggregate function values

Use DISTINCT to remove duplicates with more than one GROUP BY column

Lukas Eder

Related questions

Recent Activity

Donate For Us

Use `DISTINCT` to remove duplicate `GROUPING SETS` from the `GROUP BY` clause

Use `DISTINCT` to find unique aggregate function values

Use `DISTINCT` to remove duplicates with more than one `GROUP BY` column