Database design for user settings

Tags:

Which of the following options, if any, is considered best practice when designing a table used to store user settings?

(OPTION 1)

USER_SETTINGS -Id -Code (example "Email_LimitMax") -Value (example "5") -UserId

(OPTION 2)

create a new table for each setting where, for example, notification settings would require you to create:

"USER_ALERT_SETTINGS" -Id -UserId -EmailAdded (i.e true) -EmailRemoved  -PasswordChanged ... ...  "USER_EMAIL_SETTINGS" -Id -UserId -EmailLimitMax ....

(OPTION 3)

"USER" -Name ... -ConfigXML

576

asked Apr 18 '12 07:04

001

2 Answers

Other answers have ably outlined the pros and cons of your various options.

I believe that your Option 1 (property bag) is the best overall design for most applications, especially if you build in some protections against the weaknesses of propety bags.

See the following ERD:

Property Bag ERD

In the above ERD, the USER_SETTING table is very similar to OP's. The difference is that instead of varchar Code and Value columns, this design has a FK to a SETTING table which defines the allowable settings (Codes) and two mutually exclusive columns for the value. One option is a varchar field that can take any kind of user input, the other is a FK to a table of legal values.

The SETTING table also has a flag that indicates whether user settings should be defined by the FK or by unconstrained varchar input. You can also add a data_type to the SETTING to tell the system how to encode and interpret the USER_SETTING.unconstrained_value. If you like, you can also add the SETTING_GROUP table to help organize the various settings for user-maintenance.

This design allows you to table-drive the rules around what your settings are. This is convenient, flexible and easy to maintain, while avoiding a free-for-all.

EDIT: A few more details, including some examples...

Note that the ERD, above, has been augmented with more column details (range values on SETTING and columns on ALLOWED_SETTING_VALUE).

Here are some sample records for illustration.

SETTING: +----+------------------+-------------+--------------+-----------+-----------+ | id | description      | constrained | data_type    | min_value | max_value | +----+------------------+-------------+--------------+-----------+-----------+ | 10 | Favourite Colour | true        | alphanumeric | {null}    | {null}    | | 11 | Item Max Limit   | false       | integer      | 0         | 9001      | | 12 | Item Min Limit   | false       | integer      | 0         | 9000      | +----+------------------+-------------+--------------+-----------+-----------+  ALLOWED_SETTING_VALUE: +-----+------------+--------------+-----------+ | id  | setting_id | item_value   | caption   | +-----+------------+--------------+-----------+ | 123 | 10         | #0000FF      | Blue      | | 124 | 10         | #FFFF00      | Yellow    | | 125 | 10         | #FF00FF      | Pink      | +-----+------------+--------------+-----------+  USER_SETTING: +------+---------+------------+--------------------------+---------------------+ | id   | user_id | setting_id | allowed_setting_value_id | unconstrained_value | +------+---------+------------+--------------------------+---------------------+ | 5678 | 234     | 10         | 124                      | {null}              | | 7890 | 234     | 11         | {null}                   | 100                 | | 8901 | 234     | 12         | {null}                   | 1                   | +------+---------+------------+--------------------------+---------------------+

From these tables, we can see that some of the user settings which can be determined are Favourite Colour, Item Max Limit and Item Min Limit. Favourite Colour is a pick list of alphanumerics. Item min and max limits are numerics with allowable range values set. The SETTING.constrained column determines whether users are picking from the related ALLOWED_SETTING_VALUEs or whether they need to enter a USER_SETTING.unconstrained_value. The GUI that allows users to work with their settings needs to understand which option to offer and how to enforce both the SETTING.data_type and the min_value and max_value limits, if they exist.

Using this design, you can table drive the allowable settings including enough metadata to enforce some rudimentary constraints/sanity checks on the values selected (or entered) by users.

EDIT: Example Query

Here is some sample SQL using the above data to list the setting values for a given user ID:

-- DDL and sample data population... CREATE TABLE SETTING     (`id` int, `description` varchar(16)      , `constrained` varchar(5), `data_type` varchar(12)      , `min_value` varchar(6) NULL , `max_value` varchar(6) NULL) ;  INSERT INTO SETTING     (`id`, `description`, `constrained`, `data_type`, `min_value`, `max_value`) VALUES     (10, 'Favourite Colour', 'true', 'alphanumeric', NULL, NULL),     (11, 'Item Max Limit', 'false', 'integer', '0', '9001'),     (12, 'Item Min Limit', 'false', 'integer', '0', '9000') ;  CREATE TABLE ALLOWED_SETTING_VALUE     (`id` int, `setting_id` int, `item_value` varchar(7)      , `caption` varchar(6)) ;  INSERT INTO ALLOWED_SETTING_VALUE     (`id`, `setting_id`, `item_value`, `caption`) VALUES     (123, 10, '#0000FF', 'Blue'),     (124, 10, '#FFFF00', 'Yellow'),     (125, 10, '#FF00FF', 'Pink') ;  CREATE TABLE USER_SETTING     (`id` int, `user_id` int, `setting_id` int      , `allowed_setting_value_id` varchar(6) NULL      , `unconstrained_value` varchar(6) NULL) ;  INSERT INTO USER_SETTING     (`id`, `user_id`, `setting_id`, `allowed_setting_value_id`, `unconstrained_value`) VALUES     (5678, 234, 10, '124', NULL),     (7890, 234, 11, NULL, '100'),     (8901, 234, 12, NULL, '1') ;

And now the DML to extract a user's settings:

-- Show settings for a given user select   US.user_id  , S1.description  , S1.data_type  , case when S1.constrained = 'true'   then AV.item_value   else US.unconstrained_value   end value , AV.caption from USER_SETTING US   inner join SETTING S1     on US.setting_id = S1.id    left outer join ALLOWED_SETTING_VALUE AV     on US.allowed_setting_value_id = AV.id where US.user_id = 234

See this in SQL Fiddle.

121

answered Sep 30 '22 15:09

Joel Brown

Option 1 (as noted, "property bag") is easy to implement - very little up-front analysis. But it has a bunch of downsides.

If you want to restrain the valid values for UserSettings.Code, you need an auxiliary table for the list of valid tags. So you have either (a) no validation on UserSettings.Code – your application code can dump any value in, missing the chance to catch bugs, or you have to add maintenance on the new list of valid tags.
UserSettings.Value probably has a string data type to accommodate all the different values that might go into it. So you have lost the true data type – integer, Boolean, float, etc., and the data type checking that would be done by the RDMBS on insert of an incorrect values. Again, you have bought yourself a potential QA problem. Even for string values, you have lost the ability to constrain the length of the column.
You cannot define a DEFAULT value on the column based on the Code. So if you wanted EmailLimitMax to default to 5, you can’t do it.
Similarly, you can’t put a CHECK constraint on the Values column to prevent invalid values.
The property bag approach loses validation of SQL code. In the named column approach, a query that says “select Blah from UserSettings where UserID = x” will get a SQL error if Blah does not exist. If the SELECT is in a stored procedure or view, you will get the error when you apply the proc/view – way before the time the code goes to production. In the property bag approach, you just get NULL. So you have lost another automatic QA feature provided by the database, and introduced a possible undetected bug.
As noted, a query to find a UserID where conditions apply on multiple tags becomes harder to write – it requires one join into the table for each condition being tested.
Unfortunately, the Property Bag is an invitation for application developers to just stick a new Code into the property bag without analysis of how it will be used in the rest of application. For a large application, this becomes a source of “hidden” properties because they are not formally modeled. It’s like doing your object model with pure tag-value instead of named attributes: it provides an escape valve, but you’re missing all the help the compiler would give you on strongly-typed, named attributes. Or like doing production XML with no schema validation.
The column-name approach is self-documenting. The list of columns in the table tells any developer what the possible user settings are.

I have used property bags; but only as an escape valve and I have often regretted it. I have never said “gee, I wish I had made that explicit column be a property bag.”

answered Sep 30 '22 14:09

Tom Wilson

Related questions
                            
                                How many characters can you store with 1 byte?
                            
                                Is it possible to query a tree structure table in MySQL in a single query, to any depth?
                            
                                How do I make a MySQL database run completely in memory?
                            
                                How to find out the username and password for mysql database
                            
                                results grid not showing on mysql workbench 6.3.9 for macOS sierra
                            
                                Using Hibernate's ScrollableResults to slowly read 90 million records
                            
                                How to specify the parent query field from within a subquery in MySQL?
                            
                                How to extract the nth word and count word occurrences in a MySQL string?
                            
                                Database design: 3 types of users, separate or one table? [closed]
                            
                                Why does MySQL report a syntax error on FULL OUTER JOIN?
                            
                                How to get mysql random integer range?
                            
                                connecting to mysql from cygwin
                            
                                What does a successful MySQL DELETE return? How to check if DELETE was successful?
                            
                                How to fetch field from MySQL query result in bash
                            
                                Calculate a running total in MySQL
                            
                                Get affected rows on ExecuteNonQuery
                            
                                How to insert pandas dataframe via mysqldb into database?
                            
                                Format date in MySQL SELECT as ISO 8601
                            
                                Sql query to select from 1 hour ago?
                            
                                How can I check if mysql is installed on ubuntu?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Database design for user settings

Tags:

database

sql-server

mysql

relational-database

database-design

001

People also ask

2 Answers

Joel Brown

Tom Wilson

Recent Activity

Donate For Us