Data Warehouse modelling: Data Vault vs Persistent Staging Area

2 Answers

Data Vault vs. Persistent Staging Area sounds to me like apples and pears - hard to compare. You should not try to define a Data Vault to capture source data without knowing the business ontology - otherwise you're building a source system vault, which offers no or little benefit to the business. Building a Data Vault on a PSA or a data lake makes much more sense to me. Landing the data as an image of the source systems and then step by step building a sustainable data collection out of it.

163

answered Oct 05 '22 07:10

Andreas

The complexity that is added corresponds to the relational model that is introduced earlier in the Data Vault case. I guess it depends on what level you want to model your data and make it reusable across different use-cases resulting in different data marts. What I mean is that the data marts are designed for a specific business cases and the data vault model is more designed to be overarching (enterprise model). Hence, the data marts based on DV model have no need to physically materialise any data at all. A layer of views can be set up which look like star schema tables, but which in fact have:

•   Zero maintenance cost.
•   Zero storage costs.
•   High flexibility.

Additionally, it is definitely nice to know how the data is related in a more general sense (organization wide) - if that information and the mentioned advantages are justifying the extra effort to build a DV model is difficult to judge.

answered Oct 05 '22 07:10

y4nnick

Related questions
                            
                                How to insert nested json data in mysql table?
                            
                                How to represent GROUP BY with HAVING COUNT(*)>1 in relational algebra?
                            
                                How can I use Windows authentication in MVC but use the newer identity database tables for role storage?
                            
                                Postgres: why LEFT JOIN affects to query plan?
                            
                                "Coalesce" multiple columns
                            
                                Opening .scmp file in Visual Studio 2015 opens as XML and not as comparison UI
                            
                                Cannot use Tuple class with native queries
                            
                                Remove nulls from an array in SQL
                            
                                Multithreaded pyodbc connection
                            
                                Stored Procedure for dynamic data mapping
                            
                                Window functions: PARTITION BY one column after ORDER BY another
                            
                                Complex Joining multiple tables in SQL Server for a fact table
                            
                                While pivoting 13 million records tempDB gets full in SQL Server and pivoting takes more than 28 hours
                            
                                How do I join 2 tables to allocate items?
                            
                                MySQL yearweek() vs week() returning different result
                            
                                Delete statement fails when called from SSIS
                            
                                Count status Id for each day
                            
                                COUNT with WHERE clause giving more rows than without WHERE clause
                            
                                Is there a more efficient query for this task?
                            
                                Connect to Proxy (SOCKS) Database in python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Data Warehouse modelling: Data Vault vs Persistent Staging Area

Tags:

sql

database

etl

data-warehouse

data-vault

user3596100

People also ask

2 Answers

Andreas

y4nnick

Recent Activity

Donate For Us