Database Logging Table Structure

Tags:

database-design

I'm creating some database tables for storing log entries. Under normal circumstances I would always normalize and never stuff values together, but I'm not 100% convinced that's a good idea here.

I could normalize and have:

LogEntry
LogEntryCategory
LogCategory
LogEntryProperty

LogEntry has a many to many relationship with LogCategory and a LogEntry has a 1 to many with LogEntryProperty (which are name/value pairs).

Alternative is a denormalized version which has just LogEntry with categories stored as a comma delimited list of string categories and properties stored as a comma limited list of Name: Value formatted properties. As ugly as this sounds, from a reporting, performance, and searchability perspective, I'm not sure if this isn't better.

Which is a better idea?

Thanks.

939

asked Jul 19 '11 18:07

Jeff

1 Answers

Because there are only a few distinct properties, I would stay away from name-value pairs and give each property a separate table with a proper name and data-type. I have used generic Property_, just for the demo.

enter image description here

The thing here is to make sure not to insert a value into a property table if it is missing, in other words all property values are NOT NULL.

To make life easier, define a view

create view dbo.vLogs AS
select
      LogCategoryName
    , LogTime
    , p1_Value
    , p2_Value
    , p3_Value
    , p4_Value
    , p5_Value  
from LogEntry              as e
left join Property_1       as p1 on p1.LogEntryId   = e.LogEntryId
left join Property_2       as p2 on p2.LogEntryId   = e.LogEntryId
left join Property_3       as p3 on p3.LogEntryId   = e.LogEntryId
left join Property_4       as p4 on p4.LogEntryId   = e.LogEntryId
left join Property_5       as p5 on p5.LogEntryId   = e.LogEntryId
left join LogEntryCategory as x  on x.LogEntryId    = e.LogEntryId
left join LogCategory      as c  on c.LogCategoryID = x.LogCategoryID

This view (query) looks complicated and long; however, if you try a query like the one below and look at the execution plan, you may notice that property tables which are not mentioned in the select list are not included in the plan (not touched).

select
      LogCategoryName
    , LogTime
    , p1_Value
    , p2_Value
from dbo.vLogs
where LogCategoryName = 'some_category' 
  and LogTime between from_time and to_time

and if you need something simple like this

select max(p1_Value)
from dbo.vLogs
where LogTime between '2011-07-18' and '2011-07-19'

Here is the execution plan, as you can see only two tables are involved.

enter image description here

This is called table (join) elimination and you do need SQL Server, Oracle, PostgreSql 9.x, ... for this to work -- will not work on MySql (yet).

Each time a property is added, you would have to add a new table and modify the view.

answered Sep 23 '22 01:09

Damir Sudarevic

Related questions
                            
                                Foreign key constraints involving multiple tables
                            
                                Set value of column based on another column in Postgres?
                            
                                DynamoDB performance/cost difference between map or separate attributes
                            
                                Ignore cascade on foreign key update?
                            
                                How to best design address locations in any SQL Database?
                            
                                What's the difference between different mapping types in Hibernate?
                            
                                How to model tables with foreign keys from several other tables
                            
                                What is the best method/options for expiring records within a database?
                            
                                Data Model tools for DB2
                            
                                how do you enforce conditional not null check across sql columns
                            
                                Set of Foreign Keys Where All But One Are NULL
                            
                                In SQL / MySQL, are there reasons not to put one-to-one relationship in the same table?
                            
                                Fastest way to populate a Db [closed]
                            
                                Storing inherited objects in a database
                            
                                Database tables, one table referencing multiple unrelated tables
                            
                                How to define polymorphism in JPA
                            
                                One or many databases for application for many clients in PHP
                            
                                Need to design a table with loads of attributes which must all be searchable by SQL
                            
                                Best-performing method for associating arbitrary key/value pairs with a table row in a Postgres DB?
                            
                                Indexing simple query in a huge database

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With