I have 2 tables: <pre class="prettyprint"><code>table "person" with columns: person_id, person_name table "pet" with columns: pet_id, owner_id, pet_name person data: 1, 'John' 2, 'Jill' 3, 'Mary' pet data: 1, 1, 'Fluffy' 2, 1, 'Buster' 3, 2, 'Doggy' </code></pre> How to write select query from <code>person</code> left join <code>pet</code> on <code>person_id = owner_id</code> with aggregate functions so my result data looks like: <pre class="prettyprint"><code>1,[{pet_id:1,pet_name:'Fluffy'},{pet_id:2,pet_name:'Buster'}],'John' 2,[{pet_id:3,pet_name:'Doggy'}],'Jill' 3,[],'Mary' </code></pre>

Use <code>LEFT JOIN LATERAL</code> and aggregate in the subquery: <pre class="prettyprint lang-sql prettyprint-override"><code>SELECT p.person_id, COALESCE(pet.pets, '[]') AS pets, p.person_name FROM person p LEFT JOIN LATERAL ( SELECT json_agg(json_build_object('pet_id', pet.pet_id , 'pet_name', pet.pet_name)) AS pets FROM pet WHERE pet.owner_id = p.person_id ) pet ON true ORDER BY p.person_id; -- optional, Q suggests ordered results </code></pre> db<>fiddle here This way you do not need to aggregate results from the outer query. Simpler and cleaner when your outer query is more complex than the example in the question. When aggregating multiple related tables, it even becomes a necessity: <ul> <li>Multiple array_agg() calls in a single query</li> <li>Two SQL LEFT JOINS produce incorrect result</li> </ul> It is also typically much faster when there are selective predicates on the outer table <code>person</code> - which is the typical use case. Make sure there is an index on <code>pet(owner_id)</code> to make it fast. Or even one on <code>pet(owner_id, pet_id, pet_name)</code> or <code>pet(owner_id) INCLUDE (pet_id, pet_name)</code> in Postgres 11 or later, if your row isn't wide like in your example, and if you get index-only scans out of it. Oh, and use <code>json_build_object()</code> to preserve attribute names for arbitrary selections: <ul> <li>Return multiple columns of the same row as JSON array of objects</li> </ul> Related: <ul> <li>What is the difference between a LATERAL JOIN and a subquery in PostgreSQL?</li> </ul>

demo:db<>fiddle <pre class="prettyprint"><code>select COALESCE( json_agg(row_to_json(row(p2.pet_id::text, p2.pet_name))) FILTER (WHERE pet_id IS NOT NULL), '[]' ) as json, p1.person_name from person p1 left join pet p2 on p1.person_id = p2.owner_id group by p1.person_name; </code></pre> <ol> <li> <code>FILTER</code> clause to filter out <code>NULL</code> values. That creates a <code>NULL</code> value for Mary.</li> <li>If you want to add an empty JSON array: Use <code>COALESCE</code>, which replaces <code>NULL</code> with a default value</li> </ol>

PostgreSQL left join query object array aggregate

Tags:

json

sql

postgresql

left-join

aggregate-functions

I have 2 tables:

table "person" with columns: person_id, person_name
table "pet" with columns: pet_id, owner_id, pet_name

person data:
1, 'John'
2, 'Jill'
3, 'Mary'

pet data:
1, 1, 'Fluffy'
2, 1, 'Buster'
3, 2, 'Doggy'

How to write select query from person left join pet on person_id = owner_id with aggregate functions so my result data looks like:

1,[{pet_id:1,pet_name:'Fluffy'},{pet_id:2,pet_name:'Buster'}],'John'
2,[{pet_id:3,pet_name:'Doggy'}],'Jill'
3,[],'Mary'

460

asked Oct 07 '19 10:10

Tom Berghuis

3 Answers

Use LEFT JOIN LATERAL and aggregate in the subquery:

SELECT p.person_id, COALESCE(pet.pets, '[]') AS pets, p.person_name
FROM   person p
LEFT   JOIN LATERAL (
   SELECT json_agg(json_build_object('pet_id', pet.pet_id
                                   , 'pet_name', pet.pet_name)) AS pets
   FROM   pet
   WHERE  pet.owner_id = p.person_id
   ) pet ON true
ORDER  BY p.person_id;  -- optional, Q suggests ordered results

db<>fiddle here

This way you do not need to aggregate results from the outer query. Simpler and cleaner when your outer query is more complex than the example in the question. When aggregating multiple related tables, it even becomes a necessity:

Multiple array_agg() calls in a single query
Two SQL LEFT JOINS produce incorrect result

It is also typically much faster when there are selective predicates on the outer table person - which is the typical use case.

Make sure there is an index on pet(owner_id) to make it fast.
Or even one on pet(owner_id, pet_id, pet_name) or pet(owner_id) INCLUDE (pet_id, pet_name) in Postgres 11 or later, if your row isn't wide like in your example, and if you get index-only scans out of it.

Oh, and use json_build_object() to preserve attribute names for arbitrary selections:

Return multiple columns of the same row as JSON array of objects

What is the difference between a LATERAL JOIN and a subquery in PostgreSQL?

answered Oct 17 '22 00:10

Erwin Brandstetter

select
    person_id,
    jsonb_agg(to_jsonb(pet) - 'owner_id'),
    person_name
from person
left join pet on person_id = owner_id
group by person_id;

 person_id |                                 jsonb_agg                                  | person_name 
-----------+----------------------------------------------------------------------------+-------------
         1 | [{"pet_id": 1, "pet_name": "Fluffy"}, {"pet_id": 2, "pet_name": "Buster"}] | John
         2 | [{"pet_id": 3, "pet_name": "Doggy"}]                                       | Jill
         3 | [null]                                                                     | Mary
(3 rows)

Db<>fiddle.

answered Oct 17 '22 02:10

klin

demo:db<>fiddle

select
    COALESCE(
        json_agg(row_to_json(row(p2.pet_id::text, p2.pet_name))) FILTER (WHERE pet_id IS NOT NULL), 
       '[]'
    ) as json,
    p1.person_name
from person p1
left join pet p2
    on p1.person_id = p2.owner_id
group by
    p1.person_name;

FILTER clause to filter out NULL values. That creates a NULL value for Mary.
If you want to add an empty JSON array: Use COALESCE, which replaces NULL with a default value

answered Oct 17 '22 00:10

S-Man

Related questions
                            
                                query " ALTER TABLE test_posts ADD sticky boolean NOT NULL default = false" = error
                            
                                count columns group by
                            
                                Is there a portable way to have "SELECT FIRST 10 * FROM T" semantic?
                            
                                How to drop more than one constraint at once (Oracle, SQL)
                            
                                Select * with specific alias [syntax]
                            
                                How do I identify views with broken dependencies in SQL Server?
                            
                                Querying last 5 years
                            
                                SQL Data Type for System.Drawing.Color
                            
                                Column level vs table level constraints in sql server?
                            
                                SQL - Select newest record when there's a duplicate
                            
                                How to check my data in SQL Server have carriage return and line feed? [duplicate]
                            
                                How do I perform a simple string mapping as part of a t-sql select?
                            
                                pseudo_encrypt() function in plpgsql that takes bigint
                            
                                simple select query in linq
                            
                                JDBC Derby driver not found
                            
                                Can I use MySQL Workbench to create MariaDB?
                            
                                SQL Server: stored procedure become very slow, raw SQL query is still very fast
                            
                                Laravel Eloquent OR WHERE IS NOT NULL
                            
                                INSERT INTO using a query, and add a default value
                            
                                SQL Select Distinct column and latest date

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With