I have the following data in a SQL Table: <img src="https://i.stack.imgur.com/eps9X.png" alt="enter image description here"> I need to query the data so I can get a list of missing "familyid" per employee. For example, I should get for Employee 1021 that is missing in the sequence the IDs: 2 and 5 and for Employee 1027 should get the missing numbers 1 and 6. Any clue on how to query that? Appreciate any help.

Find the first missing value I would use the <code>ROW_NUMBER</code> window function to assign the "correct" sequence ID number. Assuming that the sequence ID restarts every time the employee ID changes: <pre class="prettyprint"><code>SELECT e.id, e.name, e.employee_number, e.relation, e.familyid, ROW_NUMBER() OVER(PARTITION BY e.employeeid ORDER BY familyid) - 1 AS sequenceid FROM employee_members e </code></pre> Then, I would filter the result set to only include the rows with mismatching sequence IDs: <pre class="prettyprint"><code>SELECT * FROM ( SELECT e.id, e.name, e.employee_number, e.relation, e.familyid, ROW_NUMBER() OVER(PARTITION BY e.employeeid ORDER BY familyid) - 1 AS sequenceid FROM employee_members e ) a WHERE a.familyid <> a.sequenceid </code></pre> Then again, you should easily group by <code>employee_number</code> and find the first missing sequence ID for each employee: <pre class="prettyprint"><code>SELECT a.employee_number, MIN(a.sequence_id) AS first_missing FROM ( SELECT e.id, e.name, e.employee_number, e.relation, e.familyid, ROW_NUMBER() OVER(PARTITION BY e.employeeid ORDER BY familyid) - 1 AS sequenceid FROM employee_members e ) a WHERE a.familyid <> a.sequenceid GROUP BY a.employee_number </code></pre> Finding all the missing values Extending the previous query, we can detect a missing value every time the difference between <code>familyid</code> and <code>sequenceid</code> changes: <pre class="prettyprint"><code>-- Warning: this is totally untested :-/ SELECT b.employee_number, MIN(b.sequence_id) AS missing FROM ( SELECT a.*, a.familyid - a.sequenceid AS displacement SELECT e.*, ROW_NUMBER() OVER(PARTITION BY e.employeeid ORDER BY familyid) - 1 AS sequenceid FROM employee_members e ) a ) b WHERE b.displacement <> 0 GROUP BY b.employee_number, b.displacement </code></pre>

SQL query find missing consecutive numbers

2 Answers

Find the first missing value

I would use the ROW_NUMBER window function to assign the "correct" sequence ID number. Assuming that the sequence ID restarts every time the employee ID changes:

SELECT
  e.id,
  e.name,
  e.employee_number,
  e.relation,
  e.familyid,
  ROW_NUMBER() OVER(PARTITION BY e.employeeid ORDER BY familyid) - 1 AS sequenceid
FROM employee_members e

Then, I would filter the result set to only include the rows with mismatching sequence IDs:

SELECT *
FROM (
  SELECT
    e.id,
    e.name,
    e.employee_number,
    e.relation,
    e.familyid,
    ROW_NUMBER() OVER(PARTITION BY e.employeeid ORDER BY familyid) - 1 AS sequenceid
  FROM employee_members e
) a
WHERE a.familyid <> a.sequenceid

Then again, you should easily group by employee_number and find the first missing sequence ID for each employee:

SELECT
  a.employee_number,
  MIN(a.sequence_id) AS first_missing
FROM (
  SELECT
    e.id,
    e.name,
    e.employee_number,
    e.relation,
    e.familyid,
    ROW_NUMBER() OVER(PARTITION BY e.employeeid ORDER BY familyid) - 1 AS sequenceid
  FROM employee_members e
) a
WHERE a.familyid <> a.sequenceid
GROUP BY a.employee_number

Finding all the missing values

Extending the previous query, we can detect a missing value every time the difference between familyid and sequenceid changes:

-- Warning: this is totally untested :-/
SELECT
  b.employee_number,
  MIN(b.sequence_id) AS missing
FROM (
  SELECT
    a.*,
    a.familyid - a.sequenceid AS displacement
    SELECT
      e.*,
      ROW_NUMBER() OVER(PARTITION BY e.employeeid ORDER BY familyid) - 1 AS sequenceid
    FROM employee_members e
  ) a
) b
WHERE b.displacement <> 0
GROUP BY
  b.employee_number,
  b.displacement

117

answered Sep 21 '22 12:09

Danilo Piazzalunga

Here is one approach. Calculate the maximum family id for each employee. Then join this to a list of numbers up to the maximum family id. The result has one row for each employee and expected family id.

Do a left outer join from this back to the original data, on the familyid and the number. Where nothing matches, those are the missing values:

with nums as (
      select 1 as n
      union all
      select n+1
      from nums
      where n < 20
     )
select en.employee, n.n as MissingFamilyId
from (select employee, min(familyid) as minfi, max(familyid) as maxfi
      from t
      group by employee
     ) en join
     nums n
     on n.n <= maxfi left outer join
     t
     on t.employee = en.employee and
        t.familyid = n.n
where t.employee_number is null;

Note that this will not work when the missing familyid is that last number in the sequence. But it might be the best that you can do with your data structure.

Also the above query assumes that there are at most 20 family members.

answered Sep 21 '22 12:09

Gordon Linoff

Related questions
                            
                                MS Access 2010 query pulls same records multiple times, sql challenge
                            
                                SQL convert 'DDMMYY' to datetime
                            
                                Conditional CASE statement syntax
                            
                                Which is more suitable for prices calculations in Firebird: decimal or numeric?
                            
                                How to get the call log from specific date in android
                            
                                SQL Query: Calculating the deltas in a time series
                            
                                Extract rows based on multiple previous rows' values in SQL Server
                            
                                Updating 4 million records in SQL server using list of record-ids as input
                            
                                PostgreSQL: strange collision of ORDER BY and LIMIT/OFFSET
                            
                                SQL Email Verification Function using Regex
                            
                                MySQL Case/If/Then
                            
                                Algorithm for counting common group memberships with big data
                            
                                SQL count how many times a value appears in multiple columns?
                            
                                how to call a stored procedure in where clause of SQL
                            
                                How to select distinct year from a datetime column and add the result to a comboBox in C#?
                            
                                SQL - How to do a group by without having to pass all the columns from the select?
                            
                                Powershell Script using ExecuteNonQuery() throws exception "Incorrect syntax near 's'."
                            
                                How to group by week (7 days) in SQL Server
                            
                                Insert character into SQL string
                            
                                Value of lastrowid after "insert or ignore"

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

SQL query find missing consecutive numbers

Tags:

sql

sql-server

VAAA

People also ask

2 Answers

Danilo Piazzalunga

Gordon Linoff

Recent Activity

Donate For Us