How to best design address locations in any SQL Database?

Tags:

database-design

Overview

I'm working on some Emergency Services reporting and mapping application for California (kind of weird, considering the fires there, right now...). We need to map demographic and emergency data for an internal govt unit.

What we have are all the streets, cities and neighborhoods in California. Each neighborhood also has it's relevant shapefile (lat long that defines it's boundaries). This was given to us by the US Census board (all public domain stuff) website.

Problem

I'm not sure how to best design the DB tables. We haven't been told what type of DB we need to use .. so we're open to suggestions if that helps. We have experience with MS SQL 2005 and 2008 (and the spatial stuff in '08).

We can have the following legit data.

Street, City, State
City, State
Neighborhood, State
State

The reason why State is a legit location is because we're told this might be sold to other states, so we need to plan for that now.

So, originally, i thought of this...

LocationId INTEGER PK Identity
Street NVARCHAR(100)
Neighbourhood NVARCHAR(100)
City NVARCHAR(100)
State NVARCHAR(100)
Latitude VARCHAR(15)
Longitude VARCHAR(15)
Shapefile

None of those are nullable, btw. But after a short while, i thought that it was a waste to have so many 'California' text or 'San Diego' text in the fields. So i changed the table to be more normalised by making the Neighborhood, City and State fields a foreign key to their own new table (eg. lookups) .. and those two fields are now NULLABLE.

So .. that all works fine. except when i try and do some Sql statements on them. Because of the NULLABLE FK's, it's a nightmare to make all these outer join queries :(

What about having the main table, the sub-lookup tables (eg. Neighbourhoods, Cities and States) linked via ID's and then place all this in a view? Remember, NeighborhoodID and CitiyID would be NULLABLE.. ???

I just want to see people's thoughts on this and the reasons they made their suggestions, please. I'm really worried and confused but are eager to learn.

Please help!

edit 1: I need to stick to an RDBMS Database.

edit 2: I'm thinking about going a single table (de-normalized) with constraints to keep the sum of the fields unqiue OR multi-tables with nullable FK's on the main table (eg. Locations (main table), Neighborhoods, Cities, States ... normalized db schema).

edit 3: Added City to the sample, second list.

edit 4: Added view question.

749

asked Nov 18 '08 04:11

Pure.Krome

1 Answers

Taking the example:

Street, City, State
City, State
Neighborhood, State
State

Firstly go back to basic principles, all of the above are distinct geospatial entities, so your address is composed of a name, and one or many geospatial specifiers. This tells us that we really should be storing them in a single table. The key here is to think of the data more abstractly,

So your address table needs a 1-many relationship to another table, called address_entities which is as follows:

int ID
varchar() name
varchar() type
int parentID
geography position.
int parentID

This means that you will obviously need a table to link the address to the address entity table above. Now, each geospatial entity is inherently hierarchical, and whilst it makes the SQL harder, and personally I try to avoid self referencing tables there are times when it is a good solution and this is one of them.

The benefits are huge, even though it makes the code harder, it is worth it in the long run.

Also, even when it isn't an immediate requirement, think globally, not all addresses in the world have a street, or state, for example,in france a valid address could be

- la Maison des Fou
- 24500 Eymet

So, bear that in mind when designing schemas.

answered Oct 23 '22 06:10

Richard Harrison

Related questions
                            
                                Table Normalization (Parse comma separated fields into individual records)
                            
                                How do you create and update your SQL scripts?
                            
                                Database design structure for storing courses chapters topics subtopics
                            
                                Avoiding Joins to Reference/Lookup Tables in Doctrine 2
                            
                                Which is more efficient: One long Single Table or Distributed Table? and Why?
                            
                                Database build process management
                            
                                Database modeling for a weak entity
                            
                                MYSQL - One Column Referenced to Multiple Table
                            
                                Can a multi-field UNIQUE constraint include fields from more than one table?
                            
                                single table inheritance (with Ebean + Play! framework)
                            
                                Making a simple email/messaging database schema
                            
                                How to design a database for translation dictionary?
                            
                                How do I forward engineer in MySQL Workbench without losing objects?
                            
                                Represent Is"Something" Data in SQL-Server [closed]
                            
                                Foreign Keys are evil? [closed]
                            
                                Any resources for common data model design patterns?
                            
                                Foreign key constraints involving multiple tables
                            
                                Set value of column based on another column in Postgres?
                            
                                DynamoDB performance/cost difference between map or separate attributes
                            
                                Ignore cascade on foreign key update?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With