SQL query for run-length, or consecutive identical value encoding

Question

My goal is to take a set of data that is ordered by id and return a resultset that indicates the number of consecutive rows where the val column is identical. E.g. given this data:

| id | val |
|  1 |  33 |
|  2 |  33 |
|  3 |  44 |
|  4 |  28 |
|  5 |  44 |
|  6 |  44 |

I would like to see this result:

| id | val | run_length |
| 1  | 33  | 2          |
| 3  | 44  | 1          |
| 4  | 28  | 1          |
| 5  | 44  | 2          |

The id column in the resultset is optional. In fact, if it makes it significantly harder, then just leave that column out of the result. I sort of like having it because it "pins" the resultset to a particular location in the table.

I am primarily interested in the result in free database engines. My order of preference for a solution is:

SQLite
Postgres
MySQL
Oracle
SQL Server
Sybase

Gordon Linoff · Accepted Answer

I'll choose #2 on your list, because this is incredibly painful to do in SQLite with a single query. The following is standard SQL:

select min(id), val, count(*) as runlength
from (select t.*,
             (row_number() over (order by id) -
              row_number() over (partition by val order by id)
             ) as grp
      from data t
     ) t
group by grp, val;

This uses the difference of two row number calculations to identify the seuqnces of identical values. It should work in the recent versions of databases 2, 4, 5, and 6.

SQL query for run-length, or consecutive identical value encoding

Tags:

sql

sqlite

mysql

postgresql

gaps-and-islands

Terrence Brannon

1 Answers

Gordon Linoff

Recent Activity

Donate For Us

SQL query for run-length, or consecutive identical value encoding

Tags:

sql

sqlite

mysql

postgresql

gaps-and-islands

Terrence Brannon

1 Answers

Gordon Linoff

Related questions

Recent Activity

Donate For Us