Partition Function COUNT() OVER possible using DISTINCT

Tags:

I'm trying to write the following in order to get a running total of distinct NumUsers, like so:

NumUsers = COUNT(DISTINCT [UserAccountKey]) OVER (PARTITION BY [Mth])

Management studio doesn't seem too happy about this. The error disappears when I remove the DISTINCT keyword, but then it won't be a distinct count.

DISTINCT does not appear to be possible within the partition functions. How do I go about finding the distinct count? Do I use a more traditional method such as a correlated subquery?

Looking into this a bit further, maybe these OVER functions work differently to Oracle in the way that they cannot be used in SQL-Server to calculate running totals.

I've added a live example here on SQLfiddle where I attempt to use a partition function to calculate a running total.

250

asked Jun 26 '12 07:06

whytheq

2 Answers

There is a very simple solution using dense_rank()

dense_rank() over (partition by [Mth] order by [UserAccountKey])  + dense_rank() over (partition by [Mth] order by [UserAccountKey] desc)  - 1

This will give you exactly what you were asking for: The number of distinct UserAccountKeys within each month.

110

answered Sep 23 '22 21:09

David

Necromancing:

It's relativiely simple to emulate a COUNT DISTINCT over PARTITION BY with MAX via DENSE_RANK:

;WITH baseTable AS (     SELECT 'RM1' AS RM, 'ADR1' AS ADR     UNION ALL SELECT 'RM1' AS RM, 'ADR1' AS ADR     UNION ALL SELECT 'RM2' AS RM, 'ADR1' AS ADR     UNION ALL SELECT 'RM2' AS RM, 'ADR2' AS ADR     UNION ALL SELECT 'RM2' AS RM, 'ADR2' AS ADR     UNION ALL SELECT 'RM2' AS RM, 'ADR3' AS ADR     UNION ALL SELECT 'RM3' AS RM, 'ADR1' AS ADR     UNION ALL SELECT 'RM2' AS RM, 'ADR1' AS ADR     UNION ALL SELECT 'RM3' AS RM, 'ADR1' AS ADR     UNION ALL SELECT 'RM3' AS RM, 'ADR2' AS ADR ) ,CTE AS (     SELECT RM, ADR, DENSE_RANK() OVER(PARTITION BY RM ORDER BY ADR) AS dr      FROM baseTable ) SELECT      RM     ,ADR      ,COUNT(CTE.ADR) OVER (PARTITION BY CTE.RM ORDER BY ADR) AS cnt1      ,COUNT(CTE.ADR) OVER (PARTITION BY CTE.RM) AS cnt2      -- Not supported     --,COUNT(DISTINCT CTE.ADR) OVER (PARTITION BY CTE.RM ORDER BY CTE.ADR) AS cntDist     ,MAX(CTE.dr) OVER (PARTITION BY CTE.RM ORDER BY CTE.RM) AS cntDistEmu  FROM CTE

Note:
This assumes the fields in question are NON-nullable fields.
If there is one or more NULL-entries in the fields, you need to subtract 1.

answered Sep 21 '22 21:09

Stefan Steiger

Related questions
                            
                                How do I copy data from one table to another in postgres using copy command
                            
                                How to delete multiple rows in SQL where id = (x to y)
                            
                                WHERE Clause to find all records in a specific month
                            
                                How to check if field is null or empty in MySQL?
                            
                                Delete duplicate records in SQL Server?
                            
                                How to exclude rows that don't join with another table?
                            
                                Checking for an empty field with MySQL
                            
                                How to determine the number of days in a month in SQL Server?
                            
                                Deleting duplicate rows from sqlite database
                            
                                What are good alternatives to SQL (the language)? [closed]
                            
                                Replace Default Null Values Returned From Left Outer Join
                            
                                Relationship of Primary Key and Clustered Index
                            
                                Xcode 4 and Core Data: How to enable SQL Debugging
                            
                                Storing DateTime (UTC) vs. storing DateTimeOffset
                            
                                Removing leading zeroes from a field in a SQL statement
                            
                                Store select query's output in one array in postgres
                            
                                How to create a unique index on a NULL column?
                            
                                IF-THEN-ELSE statements in postgresql
                            
                                IN Clause with NULL or IS NULL
                            
                                Why are joins bad when considering scalability?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Partition Function COUNT() OVER possible using DISTINCT

Tags:

sql

sql-server

tsql

sql-server-2008-r2

sql-server-2014

whytheq

People also ask

2 Answers

David

Stefan Steiger

Recent Activity

Donate For Us