I wondering what will be the right approach to build EAV on jsonb. I have <code>Attribute</code> -> <code>Values</code> tables as like in standard EAV. <pre class="prettyprint"><code>CREATE TABLE attribute_values ( id INTEGER, attribute_id INTEGER, value VARCHAR(255) ); CREATE TABLE attributes ( id INTEGER, name VARCHAR(255) ); </code></pre> Values will saved in <code>attributes</code> filed of <code>Entity</code> <pre class="prettyprint"><code> CREATE TABLE entity ( id INTEGER, title TEXT, attributes JSONB ); </code></pre> Tables <code>Attribute</code> created to control duplicate attributes their types and better determine what it's a attribute is. For example to avoid: <code>{weight: 100}</code> and <code>{Weight: 100}</code> or <code>{weigh: 100}</code>. <code>Values</code> for work with unique values and contain avaliable list of values like color (green, red, white etc.) Values can be preloaded and using for faseted search. I see several options: 1. Store format like <pre class="prettyprint"><code>[{"attribute_id":1, "value":5},{"attribute_id":1, value:"text"}] </code></pre> where <code>value_id</code> will be <code>custom value</code> like text or <code>id</code> from <code>Values</code> table. But I can't understand how to build indexing on this format, for example if <code>Attribute 10</code> will <code>integer</code> 2. leave only <code>Attribute</code> table (for controlling attribute <code>name</code>) and store data like: <pre class="prettyprint"><code>{"price": 105, "weight": 100, "color": "white"} </code></pre> . This approach much better for indexing <pre class="prettyprint"><code>CREATE INDEX entity_index ON entity (((attributes ->> 'price')::int)); </code></pre> but I will have problem with translation of text property and controlling of unique values. Also I can't add additional key like in option <code>1</code>: <code>{"attribute_id":1, "value":5, "values": []}</code> What will be the best approach to store extra field with unique control (for unique attributes) and with the opportunity to indexing.

<h3>Objective: You want to store attribute related to a given entity.</h3> I do not recommend a separate table for attribute values like we might have done in years gone by. Put a <code>jsonb</code> field right on the appropriate table and call it <code>Attributes</code>. Add a <code>GIN</code> index to it so you can query the values quickly. Or use the other techniques described within. Read this: https://dba.stackexchange.com/a/174421/7762 The biggest question here is if you intend to pre-define attribute values. If you do, there is an extremely efficient way to store them. If not, then I recommend a standard JSON object. <h3>If you can pre-define your attributes names AND values:</h3> This gives you the most control, speed, and still provides flexibility. Create a table <code>Attribute</code> which has these fields: <ul> <li><code>AttributeID int4 unsigned not null primary key</code></li> <li><code>ParentAttributeID int4 unsigned null</code></li> <li><code>Name varchar(64) not null</code></li> <li> <code>Deleted</code> bool not null default false</li> <li>Add an index on <code>ParentAttributeID</code> </li> <li>Add a trigger to prevent <code>AttributeID</code> from changing</li> <li>Add a rule on delete do instead set Deleted=True</li> </ul> Then in any table you want to attribute, add this field: <ul> <li><code>AttributeSet" int[] not null default</code></li> <li>Add a GIN index on that array field</li> <li>Also enable the <code>intarray</code> extension from https://www.postgresql.org/docs/current/static/intarray.html </li> </ul> What has this accomplished? You've create a tree of attributes. It might look like this: <pre class="prettyprint"><code>ID Parent Name ---------------------------- 100 NULL Color 101 100 Blue 102 100 Red 103 100 Green 110 NULL Size 111 110 Large 112 110 Medium 113 110 Small </code></pre> Say you have a table called <code>Items</code> and on it you've added <code>AttributeSet</code>: <pre class="prettyprint"><code> ItemID: 1234 Name: Tee Shirt AttributeSet: [100, 103, 110, 112] </code></pre> When translated, this means that it has the <code>Color=Green</code> attribute, and the <code>Size=Medium</code> attribute. <code>103</code> and <code>112</code> were enough to store that, but sometimes it's nice to be able to say "Show me all items that have any Size defined", that's why 110 was included. You can make this lightning fast and ultra flexible. <pre class="prettyprint"><code>SELECT "ItemID", "Name" FROM "Items" WHERE "AttributeMap" @> ARRAY[103,112] </code></pre> Will return all items that have <code>Size=Medium</code> and <code>Color=Green</code> Or you can use the other operators on https://www.postgresql.org/docs/10/static/functions-array.html to come up with some awesome queries. <h3>When you don't know the attribute values but it's a small set:</h3> This gives you the most speed, control, and is even more flexible. You can flag new attributes for review if needed. You can use the above technique and just dynamically add values to the <code>Attribute</code> table if they don't exist. <h3>When you don't know the attribute values and the values are diverse</h3> This gives you the most flexibility, but at the expense of control. In this case just add this to any table: <ul> <li><code>AttributeMap jsonb not null default '{}'::jsonb</code></li> <li>Add a GIN index to that field</li> </ul> Write code to validate the values against your <code>Attribute</code> table. Have an indicator there if it is a single or multi-value... Store like this in the <code>AttributeMap</code> field: <pre class="prettyprint"><code>{ "Color": "Green", "Size": "Medium", "Categories": ["Sports", "Leisure"] } </code></pre> Notice that Categories is a multi-attribute. In your <code>Attribute</code> table you should have a field that is <code>IsMulti bool not null</code> which will allow you to know how to query for it.

Looking for a right EAV structure based on jsonb

Tags:

postgresql

jsonb

entity-attribute-value

I wondering what will be the right approach to build EAV on jsonb. I have Attribute -> Values tables as like in standard EAV.

CREATE TABLE attribute_values
(
  id           INTEGER,
  attribute_id INTEGER,
  value        VARCHAR(255)
);

CREATE TABLE attributes
(
  id   INTEGER,
  name VARCHAR(255)
);

Values will saved in attributes filed of Entity

 CREATE TABLE entity
    (
      id   INTEGER,
      title TEXT,
      attributes JSONB
    );

Tables Attribute created to control duplicate attributes their types and better determine what it's a attribute is. For example to avoid: {weight: 100} and {Weight: 100} or {weigh: 100}. Values for work with unique values and contain avaliable list of values like color (green, red, white etc.) Values can be preloaded and using for faseted search.

I see several options:

1. Store format like

[{"attribute_id":1, "value":5},{"attribute_id":1, value:"text"}]

where value_id will be custom value like text or id from Values table. But I can't understand how to build indexing on this format, for example if Attribute 10 will integer

2. leave only Attribute table (for controlling attribute name) and store data like:

{"price": 105, "weight": 100, "color": "white"}

. This approach much better for indexing

CREATE INDEX entity_index ON entity (((attributes ->> 'price')::int));

but I will have problem with translation of text property and controlling of unique values. Also I can't add additional key like in option 1: {"attribute_id":1, "value":5, "values": []}

What will be the best approach to store extra field with unique control (for unique attributes) and with the opportunity to indexing.

419

asked Apr 01 '18 14:04

Ivanov

1 Answers

Objective: You want to store attribute related to a given entity.

I do not recommend a separate table for attribute values like we might have done in years gone by. Put a jsonb field right on the appropriate table and call it Attributes. Add a GIN index to it so you can query the values quickly. Or use the other techniques described within.

Read this: https://dba.stackexchange.com/a/174421/7762

The biggest question here is if you intend to pre-define attribute values. If you do, there is an extremely efficient way to store them. If not, then I recommend a standard JSON object.

If you can pre-define your attributes names AND values:

This gives you the most control, speed, and still provides flexibility.

Create a table Attribute which has these fields:

AttributeID int4 unsigned not null primary key
ParentAttributeID int4 unsigned null
Name varchar(64) not null
Deleted bool not null default false
Add an index on ParentAttributeID
Add a trigger to prevent AttributeID from changing
Add a rule on delete do instead set Deleted=True

Then in any table you want to attribute, add this field:

AttributeSet" int[] not null default
Add a GIN index on that array field
Also enable the intarray extension from https://www.postgresql.org/docs/current/static/intarray.html

What has this accomplished?

You've create a tree of attributes. It might look like this:

ID   Parent  Name
----------------------------
100  NULL    Color
101  100       Blue
102  100       Red
103  100       Green
110  NULL    Size
111  110       Large
112  110       Medium 
113  110       Small

Say you have a table called Items and on it you've added AttributeSet:

      ItemID: 1234
        Name: Tee Shirt
AttributeSet: [100, 103, 110, 112]

When translated, this means that it has the Color=Green attribute, and the Size=Medium attribute. 103 and 112 were enough to store that, but sometimes it's nice to be able to say "Show me all items that have any Size defined", that's why 110 was included.

You can make this lightning fast and ultra flexible.

SELECT
  "ItemID", "Name"
FROM
  "Items"
WHERE "AttributeMap" @> ARRAY[103,112]

Will return all items that have Size=Medium and Color=Green

Or you can use the other operators on https://www.postgresql.org/docs/10/static/functions-array.html to come up with some awesome queries.

When you don't know the attribute values but it's a small set:

This gives you the most speed, control, and is even more flexible. You can flag new attributes for review if needed.

You can use the above technique and just dynamically add values to the Attribute table if they don't exist.

When you don't know the attribute values and the values are diverse

This gives you the most flexibility, but at the expense of control.

In this case just add this to any table:

AttributeMap jsonb not null default '{}'::jsonb
Add a GIN index to that field

Write code to validate the values against your Attribute table. Have an indicator there if it is a single or multi-value...

Store like this in the AttributeMap field:

{
    "Color": "Green", 
    "Size": "Medium", 
    "Categories": ["Sports", "Leisure"]
}

Notice that Categories is a multi-attribute. In your Attribute table you should have a field that is IsMulti bool not null which will allow you to know how to query for it.

188

answered Nov 15 '22 03:11

gahooa

Related questions
                            
                                How to enable postgresql-9.6 for remote connectivity
                            
                                Escape single quote in Postgres query inside node js app
                            
                                How to typecast a join column in Arel
                            
                                postgresql search text into array of text
                            
                                How to use IN clause with multiple columns on same data in postgresql?
                            
                                Speed up INSERT of 1 million+ rows into Postgres via R using COPY?
                            
                                Using pg_restore to create or overwrite tables
                            
                                Postgresql: date format and local language output
                            
                                syntax error at or near "SERIAL" with autoIncrement only
                            
                                Connection to postgres server on Azure fails when I use "sslmode=verify-full"
                            
                                How to connect to Postgresql service inside Docker Swarm?
                            
                                How to get data from postgresql json array field in an array
                            
                                Automated Testing with Databases
                            
                                Postgresql installation: command not found: createdb, psql
                            
                                PostgreSQL subquery using like
                            
                                Upsert if on conflict occurs on multiple columns in Postgres db
                            
                                RDS logging not appearing for PostgreSQL
                            
                                How do I access postgresql within Docker with sqlalchemy?
                            
                                How to delete certain amount of last rows in the table in PostgreSQL?
                            
                                postgres pgagent job status

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With