Map database schema in Power BI

Tags:

I've come across a video on youtube that describes How to Easily Map Your Database Schema in Power BI using the AdventureWorks database from Microsoft. Now I'm trying to replicate that example using another database. The problem is that many of my columns have got similar content, but different column names with prefixes such as pk_ or fk_ depending on which tables they are located in. And that causes the following query to fail:

SELECT
    c.TABLE_NAME
    ,c.COLUMN_NAME
FROM INFORMATION_SCHEMA.COLUMNS c
INNER JOIN
        (SELECT
                COLUMN_NAME
        FROM INFORMATION_SCHEMA.COLUMNS
        GROUP BY COLUMN_NAME
        HAVING COUNT(*) > 1
        ) dupes
ON dupes.COLUMN_NAME = c.COLUMN_NAME

Does anyone know if it's possible to fuzzy match column names or taking different prefixes into account to make this work? The very same question has been asked directly to the youtube OP. It can also be found on reddit.com, but the question remains unanswered.

I'm trying to wrap my head around some more advanced Power BI features and at the same time learn some much needed SQL, and I thought this would be a cool place to start, so any help is much appreciated!

909

asked Nov 05 '18 13:11

vestland

1 Answers

If you want to show relationships between tables then using common column names between two tables is not best idea.

For example:

CREATE TABLE tab(id INT PRIMARY KEY, name INT);
CREATE TABLE tab2(id2 INT PRIMARY KEY, name INT);
-- completely unrelated tables

SELECT
    c.TABLE_NAME
    ,c.COLUMN_NAME
FROM INFORMATION_SCHEMA.COLUMNS c
INNER JOIN
        (SELECT
                COLUMN_NAME
        FROM INFORMATION_SCHEMA.COLUMNS
        GROUP BY COLUMN_NAME
        HAVING COUNT(*) > 1
        ) dupes
ON dupes.COLUMN_NAME = c.COLUMN_NAME


+-------------+-------------+
| TABLE_NAME  | COLUMN_NAME |
+-------------+-------------+
| tab         | name        |
| tab2        | name        |
+-------------+-------------+

db<>fiddle demo

I would propose to use proper metadata views i.e. sys.foreign_key_columns:

SELECT [table] = tab1.name,
       [column] =  col1.name,
       [referenced_table] = tab2.name,
       [referenced_column] = col2.name
FROM sys.foreign_key_columns fkc
JOIN sys.objects obj ON obj.object_id = fkc.constraint_object_id
JOIN sys.tables tab1 ON tab1.object_id = fkc.parent_object_id
JOIN sys.schemas sch ON tab1.schema_id = sch.schema_id
JOIN sys.columns col1 ON col1.column_id = parent_column_id 
 AND col1.object_id = tab1.object_id
JOIN sys.tables tab2 ON tab2.object_id = fkc.referenced_object_id
JOIN sys.columns col2 ON col2.column_id = referenced_column_id 
 AND col2.object_id = tab2.object_id;

db<>fiddle demo2

Then you need to choose appropriate visualisation method in PowerBI.

answered Oct 28 '22 11:10

Lukasz Szozda

Related questions
                            
                                PostgreSQL - Grant select on all tables (and future tables), in *all schemas*
                            
                                Human readable elapsed time between many days
                            
                                Using If Not Exists on Primary Key
                            
                                How do I group on continuous ranges (mysql 5.7)
                            
                                How to insert a vector into a column of a table in mysql?
                            
                                how to generate SQL from dbplyr without a database connection?
                            
                                Does MySQL support partial indexes?
                            
                                Does SELECT start transaction in PL/SQL
                            
                                LAG() / LEAD() of the next rank (Postgresql)
                            
                                WHERE vs. HAVING performance with GROUP BY
                            
                                Convert string variable to GUID
                            
                                How to use RODBC to save dataframe to table with primary key generated at database
                            
                                Why SQL Server throws Arithmetic overflow error using ROUND?
                            
                                SQLZOO- using GROUPBY to find the largest country in a continent; is this possible?
                            
                                There is a column named ... it cannot be referenced from this part of the query sub query
                            
                                Create native SQL query without creating entity class in SpringBoot
                            
                                How can I aggregate by the top N categories with an "all others" and totals?
                            
                                Sql Query to get an array having no column name in json result
                            
                                SQL Left-Join - Get value If both values are Not null in TableB or Row missing in TableB
                            
                                writing dynamic sql query based on user input coditions in C#

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Map database schema in Power BI

Tags:

sql

sql-server

tsql

powerbi

vestland

People also ask

1 Answers

Lukasz Szozda

Recent Activity

Donate For Us