Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Select in a many-to-many relationship in MySQL

I have two tables in a MySQL database, Locations and Tags, and a third table LocationsTagsAssoc which associates the two tables and treats them as a many-to-many relationship.

Table structure is as follows:

Locations
---------
ID int (Primary Key)
Name varchar(128)

LocationsTagsAssoc
------------------
ID int (Primary Key)
LocationID int (Foreign Key)
TagID int (Foreign Key)

Tags
----
ID int (Primary Key)
Name varchar(128)

So each location can be tagged with multiple tagwords, and each tagword can be tagged to multiple locations.

What I want to do is select only Locations which are tagged with all of the tag names supplied. For example:

I want all locations which are tagged with both "trees" and "swings". Location "Park" should be selected, but location "Forest" should not.

Any insight would be appreciated. Thanks!

like image 732
Joff Williams Avatar asked Jun 08 '10 22:06

Joff Williams


1 Answers

There are two ways to do this. I prefer the first way, which is to self-join for each tag:

SELECT l.*
FROM Locations l
JOIN LocationsTagsAssoc a1 ON a1.LocationID = l.ID
JOIN Tags t1 ON a1.TagID = t1.ID AND t1.Name = ?
JOIN LocationsTagsAssoc a2 ON a2.LocationID = l.ID
JOIN Tags t2 ON a2.TagID = t2.ID AND t2.Name = ?
JOIN LocationsTagsAssoc a3 ON a3.LocationID = l.ID
JOIN Tags t3 ON a3.TagID = t3.ID AND t3.Name = ?;

The other way also works, but using GROUP BY in MySQL tends to incur a temporary table and performance is slow:

SELECT l.*
FROM Locations l
JOIN LocationsTagsAssoc a ON a.LocationID = l.ID
JOIN Tags t ON a.TagID = t.ID
WHERE t.Name IN (?, ?, ?)
GROUP BY l.ID
HAVING COUNT(*) = 3;

Re comment from @Erikoenig:

If you want to make sure there are no extra tags, you can do it this way:

SELECT l.*
FROM Locations l
JOIN LocationsTagsAssoc a ON a.LocationID = l.ID
JOIN Tags t ON a.TagID = t.ID
GROUP BY l.ID
HAVING COUNT(*) = 3 AND SUM(t.Name IN (?, ?, ?)) = 3;

Taking out the WHERE clause allows other tags to be counted, if there are any. So the COUNT() may be greater than 3.

Or if the count is exactly three tags, but some of these three are not the correct tags, then the SUM() condition in the HAVING clause makes sure that all three tags you want are present in the group.

like image 190
Bill Karwin Avatar answered Sep 29 '22 16:09

Bill Karwin