Dimension row with multiple attributes

Tags:

This isn't exactly what I am doing but I feel this is a good example:

Let's say I have a Product dimension table that connects to my ProductSales Fact table. Each row in dimProduct holds all the relevant data for a single product (code, name, description etc) and there are around a million products.

I now have a requirement to store the product categories into the warehouse. Each Product has multiple categories, averaging at 5.

Am I supposed to duplicate entire rows in the Product Dimension for each category the product fits into or am I supposed to snowflake my current star schema with a dimCategory dimension and dimProductCategory link table between the two?

I'm afraid that if I do the former then my Dimension table will become over 5 times bigger and if I do the latter then the model will become far more complex.

626

asked Jan 20 '14 21:01

Timothy Jeffreys

1 Answers

Well, for a new-comer your question is rather insightful!

If each of your product can be categorized into multiple catagories (and each product category contains n number of products), then the cardinality between Product and Product Category is many-to-many. When you have many-to-many cardinality, direct Snow Flaking is not the solution.

But I think what you mean by snowflaking here is the use of a link table between Category and Product. And in my opinion, that is the currect approach. But I would rather call this table as a Factless fact table.

Snowflaking is the solution for a one-to-many cardinality problem (e.g. 1 category contains multiple products). To resolve the many-to-many cardinality, you will need Factless Fact table that stores the keys from both category Product tables.

Remember, in case your transactional data which you load to your ProductSales fact table, already contains both Category and Product details, you might as well want to include both the Category ID and Product ID in your ProductSales fact table. You do this when you need not maintain any fixed relation between products and categories but rather the relationship is driven from the incidents that occur in actual business.

answered Sep 30 '22 15:09

hashbrown

Related questions
                            
                                Why Index is not used with subquery
                            
                                why "extra characters after command" error shown for the sed command line shown?
                            
                                how to have a double while loop in sql server 2008
                            
                                Is it possible to select more rows than a table contains?
                            
                                Postgres Copy - Importing an integer with a comma
                            
                                Pivot Table with many to many table
                            
                                #1072 - Key column 'role_id' doesn't exist in table
                            
                                Generate DDL SQL create table statement after scanning CSV file
                            
                                How to know that if a Insert query was succesfull in anorm?
                            
                                Comparing equality of date and datetime in SQL Server
                            
                                Create user defined operator with left/right sides
                            
                                How to efficiently design database of multi list application
                            
                                How can I create a complex query that sum conditions with 2 tables?
                            
                                How to do sum of col results from a SQL Stored Procedure [duplicate]
                            
                                Why is MySQL is giving an incorrect count for a simple query?
                            
                                SQL measure stored procedure execution time
                            
                                Extracting alpha and numeric parts from a column
                            
                                INSERT INTO Access DB with Python/pyodbc
                            
                                How to get position of regexp match in string in PostgreSQL?
                            
                                Best practice for fixed number of strings in MySQL?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Dimension row with multiple attributes

Tags:

sql

tsql

database-design

data-warehouse

dimensional-modeling

Timothy Jeffreys

People also ask

1 Answers

hashbrown

Recent Activity

Donate For Us