SELECT Only Records With Duplicate (Column A || Column B) But Different (Column C) Values

Tags:

I apologize for the confusing title, I can't figure out the proper wording for this question. Instead, I'll just give you the background info and the goal:

This is in a table where a person may or may not have multiple rows of data, and those rows may contain the same value for the activity_id, or may not. Each row has an auto-incremented ID. The people do not have a unique identifier attached to their names, so we can only use first_name/last_name to identify a person.

I need to be able to find the people that have multiple rows in this table, but only the ones who have multiple rows that contain more than one different activity_id.

Here's a sample of the data we're looking through:

unique_id | first_name    |   last_name    |    activity_id
---------------------------------------------------------------
 1        | ted           | stevens        | 544
 2        | ted           | stevens        | 544
 3        | ted           | stevens        | 545
 4        | ted           | stevens        | 546
 5        | rachel        | jameson        | 633
 6        | jennifer      | tyler          | 644
 7        | jennifer      | tyler          | 655
 8        | jennifer      | tyler          | 655
 9        | jack          | fillion        | 544
 10       | mallory       | taylor         | 633
 11       | mallory       | taylor         | 633

From that small sample, here are the records I would want returned:

unique_id | first_name    |   last_name    |    activity_id
---------------------------------------------------------------
 dontcare | ted           | stevens        | 544
 dontcare | jennifer      | tyler          | 655

Note that which value of unique_id gets returned is irrelvant, as long as it's one of the unique_ids belonging to that person, and as long as only one record is returned for that person.

Can anyone figure out how to write a query like this? I don't care what version of SQL you use, I can probably translate it into Oracle if it's somehow different.

830

asked Sep 11 '13 21:09

BumbleShrimp

2 Answers

I would do:

SELECT first_name, last_name, COUNT(DISTINCT activity_id)
FROM <table_name>
GROUP BY first_name, last_name
HAVING COUNT(DISTINCT activity_id) > 0;

195

answered Sep 22 '22 04:09

adona9

I'll build through the logic with you. First, lets find all people that have more than one entry:

Unique list of name + activity ID:

select first_name, last_name,activity_id, count(1)
from yourtable
group by first_name, last_name,activity_id

Now we'll turn that into a subquery and look for users with more than 1 activity_ID

Select first_name, last_name
from 
    (select first_name, last_name,activity_id, count(1)
    from yourtable
    group by first_name, last_name,activity_id) a
group by  first_name, last_name
having count(1) > 1

Should work as that...I didn't return an activity_id, adding max(activity_id) to the select statement will grab the highest one.

answered Sep 18 '22 04:09

Twelfth

Related questions
                            
                                How to use inner join in select with mysql?
                            
                                PHP Script to convert .DBF files to .MYSQL
                            
                                How to do "while fetch assoc" in CodeIgniter?
                            
                                Socket.io private message notification
                            
                                substr doesn't work fine with utf8
                            
                                SQL limit number of groups
                            
                                php+mysql Fatal error: Call to undefined function mysql_connect() in
                            
                                How to to fix XAMPP after deleting too many MySQL databases?
                            
                                Is there an equivalent of SQL_CALC_FOUND_ROWS in SQL Server?
                            
                                mysql get month from timestamp not working
                            
                                Why is MySQL showing index_merge on this query?
                            
                                WordPress Plugin Development - Can't drop database table after plugin deactivation?
                            
                                Can't Restart MySQL After Editing my.cnf on Mac OS X 10.8.3
                            
                                phpActiveRecord Incorrect DateTimeFormat
                            
                                Restart Mysql automatically when ubuntu on EC2 micro instance kills it when running out of memory
                            
                                add column before existing column?
                            
                                Sum columns depending on another column value
                            
                                Selecting range of records with MySql
                            
                                How can I implement model revisions in Laravel?
                            
                                Using MYSQL to select rows between dates?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

SELECT Only Records With Duplicate (Column A || Column B) But Different (Column C) Values

Tags:

sql

mysql

oracle

BumbleShrimp

People also ask

2 Answers

adona9

Twelfth

Recent Activity

Donate For Us