I am trying to calculate running count for each 2 rows like below, <pre class="prettyprint"><code>CREATE TABLE sales ( EmpId INT, Yr INT, Sales DECIMAL(8,2) ) INSERT INTO sales (EmpId, Yr, Sales) VALUES (1, 2005, 12000), (1, 2006, 18000), (1, 2007, 25000), (1, 2008, 25000), (1, 2009, 25000), (2, 2005, 15000), (2, 2006, 6000), (2, 2007, 6000) SELECT EmpId, Yr, sales, SUM(Sales) OVER (PARTITION BY empid ORDER BY empid ROWS BETWEEN 2 PRECEDING AND CURRENT ROW ) AS TotalSales2 FROM sales </code></pre> Output: <pre class="prettyprint"><code>EmpId Yr sales TotalSales2 ----------------------------------- 1 2005 12000 12000 1 2006 18000 30000 1 2007 25000 55000 1 2008 25000 68000 1 2009 25000 75000 2 2005 15000 15000 2 2006 6000 21000 2 2007 6000 27000 </code></pre> But expected output: <pre class="prettyprint"><code>EmpId Yr Sales TotalSales2 ----------------------------------- 1 2005 12000 12000 1 2006 18000 30000 1 2007 25000 25000 1 2008 25000 50000 1 2009 25000 25000 2 2005 15000 15000 2 2006 6000 21000 2 2007 6000 6000 </code></pre> What am I doing wrong in this query? Note: SQL Servre version is 2012.

The expression: <pre class="prettyprint"><code>SUM(Sales) OVER (PARTITION BY empid ORDER BY empid ROWS BETWEEN 2 PRECEDING AND CURRENT ROW) </code></pre> calculates the sum considering the current row and the 2 rows immediately preceding it. So it actually calculates a rolling sum, which is what you really don't want. I think you are actually looking for something like the following: <pre class="prettyprint"><code>;WITH CTE_Group AS ( SELECT EmpId, Yr, sales, (ROW_NUMBER() OVER (PARTITION BY empid ORDER BY yr) + 1 ) / 2 AS grp FROM sales ) SELECT EmpId, Yr, sales, SUM(sales) OVER (PARTITION BY empid, grp ORDER BY yr) AS TotalSales2 FROM CTE_Group </code></pre> The above query uses a <code>CTE</code> in order to calculate field <code>grp</code>: the value of this field is <code>1</code> for the first two records of an <code>empid</code> partition, <code>2</code> for the next two records, and so on. Using <code>grp</code> we can calculate the running total of <code>sales</code> for groups of 2 as is the requirement of the OP. Demo here Edit: To offset a larger group of records try using (credit goes to @Max Szczurek for pointing this out): <pre class="prettyprint"><code>(ROW_NUMBER() OVER (PARTITION BY empid ORDER BY yr) - 1 ) / n AS grp </code></pre> where <code>n</code> is the number of records each group contains.

Running count for each 2 rows

Tags:

sql

sql-server

sql-server-2012

I am trying to calculate running count for each 2 rows like below,

CREATE TABLE sales
(
     EmpId INT, 
     Yr INT, 
     Sales DECIMAL(8,2)
)

INSERT INTO sales (EmpId, Yr, Sales)
VALUES (1, 2005, 12000), (1, 2006, 18000), (1, 2007, 25000),
       (1, 2008, 25000), (1, 2009, 25000),
       (2, 2005, 15000), (2, 2006, 6000), (2, 2007, 6000)

SELECT 
    EmpId, Yr, sales, 
    SUM(Sales) OVER (PARTITION BY empid ORDER BY empid ROWS BETWEEN 2 PRECEDING AND CURRENT ROW ) AS TotalSales2
FROM 
    sales

Output:

EmpId   Yr      sales   TotalSales2
-----------------------------------
  1     2005    12000      12000
  1     2006    18000      30000
  1     2007    25000      55000
  1     2008    25000      68000
  1     2009    25000      75000
  2     2005    15000      15000
  2     2006     6000      21000
  2     2007     6000      27000

But expected output:

EmpId   Yr     Sales    TotalSales2
-----------------------------------
  1     2005    12000   12000
  1     2006    18000   30000
  1     2007    25000   25000   
  1     2008    25000   50000
  1     2009    25000   25000   
  2     2005    15000   15000
  2     2006     6000   21000
  2     2007     6000    6000

What am I doing wrong in this query?

Note: SQL Servre version is 2012.

526

asked Mar 29 '17 06:03

MMMMS

2 Answers

SELECT EmpId, Yr, Sales, 
    CASE WHEN ROW_NUMBER() OVER (PARTITION BY EmpId ORDER BY yr) % 2 = 0 
    THEN sales + lag(sales, 1, 0) OVER (PARTITION BY empid ORDER BY yr) 
    ELSE sales 
    END AS TotalSales2
FROM sales

Lag returns the previous row's value - when row_number() is even, add the current row's value to the previous row - otherwise, just show the sales for the current row. Partition each by EmpId, order each by yr - output matches the expected.

Also, thanks so much for adding the DDL/sample data.

answered Sep 18 '22 09:09

Max Szczurek

The expression:

SUM(Sales) OVER (PARTITION BY empid 
                 ORDER BY empid 
                 ROWS BETWEEN 2 PRECEDING AND CURRENT ROW)

calculates the sum considering the current row and the 2 rows immediately preceding it. So it actually calculates a rolling sum, which is what you really don't want.

I think you are actually looking for something like the following:

;WITH CTE_Group AS (
    SELECT EmpId, Yr, sales,        
          (ROW_NUMBER() OVER (PARTITION BY empid ORDER BY yr) + 1 ) / 2 AS grp
    FROM sales      
)
SELECT EmpId, Yr, sales,
       SUM(sales) OVER (PARTITION BY empid, grp 
                        ORDER BY yr) AS TotalSales2
FROM CTE_Group

The above query uses a CTE in order to calculate field grp: the value of this field is 1 for the first two records of an empid partition, 2 for the next two records, and so on.

Using grp we can calculate the running total of sales for groups of 2 as is the requirement of the OP.

Demo here

Edit:

To offset a larger group of records try using (credit goes to @Max Szczurek for pointing this out):

(ROW_NUMBER() OVER (PARTITION BY empid ORDER BY yr) - 1 ) / n AS grp

where n is the number of records each group contains.

answered Sep 19 '22 09:09

Giorgos Betsos

Related questions
                            
                                How to delete files on the directory via MS SQL Server
                            
                                What SQL-server function can I use to get the character or byte length of a nvarchar(max) column?
                            
                                How to set a default value for one column in SQL based on another column
                            
                                How can i speed up this Indexed View?
                            
                                Migrating Oracle DATE columns to TIMESTAMP with timezone
                            
                                RBAR vs. Set based programming for SQL
                            
                                Wrong week number using DATEPART in SQL Server
                            
                                How to properly name record creation(insertion) datetime field?
                            
                                One INSERT with UNIONS or multiple INSERTS?
                            
                                Why is executemany slow in Python MySQLdb?
                            
                                SQL Query for Grouping the results based on sequence
                            
                                Delete rows from SQL Server with WHERE statement from different tables
                            
                                Search in SQL where string starts with X
                            
                                Can Indices actually decrease SELECT performance?
                            
                                Select sum and inner join
                            
                                Oracle SQL Developer 3.1.07 extra spaces between characters using listagg
                            
                                AND field NOT IN(NULL) returns an empty set [duplicate]
                            
                                Change TEXT column default from null to '' (empty string)
                            
                                SQL replace dot with comma
                            
                                SQLite IntegrityError: UNIQUE constraint failed:

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With