How to implement a many-to-many relationship in PostgreSQL?

People also ask

How do you create a many-to-many relationship in SQL?

When you need to establish a many-to-many relationship between two or more tables, the simplest way is to use a Junction Table. A Junction table in a database, also referred to as a Bridge table or Associative Table, bridges the tables together by referencing the primary keys of each data table.

Can you have a many-to-many relationship in SQL?

Many-to-Many RelationMany-to-Many relationship lets you relate each row in one table to many rows in another table and vice versa. As an example, an employee in the Employee table can have many skills from the EmployeeSkill table and also, one skill can be associated with one or more employees.

The SQL DDL (data definition language) statements could look like this:

CREATE TABLE product (
  product_id serial PRIMARY KEY  -- implicit primary key constraint
, product    text NOT NULL
, price      numeric NOT NULL DEFAULT 0
);

CREATE TABLE bill (
  bill_id  serial PRIMARY KEY
, bill     text NOT NULL
, billdate date NOT NULL DEFAULT CURRENT_DATE
);

CREATE TABLE bill_product (
  bill_id    int REFERENCES bill (bill_id) ON UPDATE CASCADE ON DELETE CASCADE
, product_id int REFERENCES product (product_id) ON UPDATE CASCADE
, amount     numeric NOT NULL DEFAULT 1
, CONSTRAINT bill_product_pkey PRIMARY KEY (bill_id, product_id)  -- explicit pk
);

I made a few adjustments:

The n:m relationship is normally implemented by a separate table - bill_product in this case.
I added serial columns as surrogate primary keys. In Postgres 10 or later consider an IDENTITY column instead. See:
- Safely rename tables using serial primary key columns
- Auto increment table column
- https://www.2ndquadrant.com/en/blog/postgresql-10-identity-columns/
I highly recommend that, because the name of a product is hardly unique (not a good "natural key"). Also, enforcing uniqueness and referencing the column in foreign keys is typically cheaper with a 4-byte integer (or even an 8-byte bigint) than with a string stored as text or varchar.
Don't use names of basic data types like date as identifiers. While this is possible, it is bad style and leads to confusing errors and error messages. Use legal, lower case, unquoted identifiers. Never use reserved words and avoid double-quoted mixed case identifiers if you can.
"name" is not a good name. I renamed the column of the table product to be product (or product_name or similar). That is a better naming convention. Otherwise, when you join a couple of tables in a query - which you do a lot in a relational database - you end up with multiple columns named "name" and have to use column aliases to sort out the mess. That's not helpful. Another widespread anti-pattern would be just "id" as column name.
I am not sure what the name of a bill would be. bill_id will probably suffice in this case.
price is of data type numeric to store fractional numbers precisely as entered (arbitrary precision type instead of floating point type). If you deal with whole numbers exclusively, make that integer. For example, you could save prices as Cents.
The amount ("Products" in your question) goes into the linking table bill_product and is of type numeric as well. Again, integer if you deal with whole numbers exclusively.
You see the foreign keys in bill_product? I created both to cascade changes: ON UPDATE CASCADE. If a product_id or bill_id should change, the change is cascaded to all depending entries in bill_product and nothing breaks. Those are just references without significance of their own.
I also used ON DELETE CASCADE for bill_id: If a bill gets deleted, its details die with it.
Not so for products: You don't want to delete a product that's used in a bill. Postgres will throw an error if you attempt this. You would add another column to product to mark obsolete rows ("soft-delete") instead.
All columns in this basic example end up to be NOT NULL, so NULL values are not allowed. (Yes, all columns - primary key columns are defined UNIQUE NOT NULL automatically.) That's because NULL values wouldn't make sense in any of the columns. It makes a beginner's life easier. But you won't get away so easily, you need to understand NULL handling anyway. Additional columns might allow NULL values, functions and joins can introduce NULL values in queries etc.
Read the chapter on CREATE TABLE in the manual.
Primary keys are implemented with a unique index on the key columns, that makes queries with conditions on the PK column(s) fast. However, the sequence of key columns is relevant in multicolumn keys. Since the PK on bill_product is on (bill_id, product_id) in my example, you may want to add another index on just product_id or (product_id, bill_id) if you have queries looking for a given product_id and no bill_id. See:
- PostgreSQL composite primary key
- Is a composite index also good for queries on the first field?
- Working of indexes in PostgreSQL
Read the chapter on indexes in the manual.

Related questions
                            
                                How do I delete from multiple tables using INNER JOIN in SQL server
                            
                                Display names of all constraints for a table in Oracle SQL
                            
                                Do I need to create indexes on foreign keys on Oracle?
                            
                                How do I use variables in Oracle SQL Developer?
                            
                                Can we have multiple "WITH AS" in single sql - Oracle SQL
                            
                                How to get the first and last date of the current year?
                            
                                MySQL JOIN the most recent row only?
                            
                                How to delete the top 1000 rows from a table using Sql Server 2008?
                            
                                A beginner's guide to SQL database design [closed]
                            
                                How to return result of a SELECT inside a function in PostgreSQL?
                            
                                What's the best way to join on the same table twice?
                            
                                Database Design for Revisions?
                            
                                Favourite performance tuning tricks [closed]
                            
                                How to use multiple WITH statements in one PostgreSQL query?
                            
                                Is the NOLOCK (Sql Server hint) bad practice?
                            
                                How to change column order in a table using sql query in sql server 2005?
                            
                                SQLAlchemy: how to filter date field?
                            
                                Is there a version control system for database structure changes?
                            
                                Generate sql insert script from excel worksheet
                            
                                SQL Server 2008: How to query all databases sizes?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to implement a many-to-many relationship in PostgreSQL?

Tags:

sql

database

postgresql

database-design

many-to-many

People also ask

Recent Activity

Donate For Us