I am trying to get the following to return a count for every organization using a left join in PostgreSQL, but I cannot figure out why it's not working:
select o.name as organisation_name, coalesce(COUNT(exam_items.id)) as total_used from organisations o left join exam_items e on o.id = e.organisation_id where e.item_template_id = #{sanitize(item_template_id)} and e.used = true group by o.name order by o.name
Using coalesce
doesn't seem to work. I'm at my wit's end! Any help would certainly be appreciated!
To clarify what's not working, at the moment the query only returns values for organisations that have a count greater than 0. I would like it to return a line for every organisation, regardless of the count.
Table definitions:
TABLE exam_items id serial NOT NULL exam_id integer item_version_id integer used boolean DEFAULT false question_identifier character varying(255) organisation_id integer created_at timestamp without time zone NOT NULL updated_at timestamp without time zone NOT NULL item_template_id integer stem_id integer CONSTRAINT exam_items_pkey PRIMARY KEY (id) TABLE organisations id serial NOT NULL slug character varying(255) name character varying(255) code character varying(255) address text organisation_type integer created_at timestamp without time zone NOT NULL updated_at timestamp without time zone NOT NULL super boolean DEFAULT false CONSTRAINT organisations_pkey PRIMARY KEY (id)
An ugly workaround, if you want your original query to return a row with 0's, when no records are present, is to add something like this to your query: UNION SELECT NULL AS [Month], 0 AS [COUNT], 0 AS [GRAMS], 0 AS [PRINCIPAL] WHERE (SELECT COUNT(*) FROM #AllExpired) = 0 , but a better solution would be to have your ...
The SQL LEFT JOIN returns all rows from the left table, even if there are no matches in the right table. This means that if the ON clause matches 0 (zero) records in the right table; the join will still return a row in the result, but with NULL in each column from the right table.
Left joins can increase the number of rows in the left table if there are multiple matches in the right table.
The JOIN or INNER JOIN does not return any non-matching rows at all. It returns only the rows that match in both of the tables you join. If you want to get any unmatched rows, you shouldn't use it. The LEFT JOIN and the RIGHT JOIN get you both matched and unmatched rows.
LEFT JOIN
This should work:
SELECT o.name AS organisation_name, count(e.id) AS total_used FROM organisations o LEFT JOIN exam_items e ON e.organisation_id = o.id AND e.item_template_id = #{sanitize(item_template_id)} AND e.used GROUP BY o.name ORDER BY o.name;
You had a LEFT [OUTER] JOIN
but the later WHERE
conditions made it act like a plain [INNER] JOIN
.
Move the condition(s) to the JOIN
clause to make it work as intended. This way, only rows that fulfill all these conditions are joined in the first place (or columns from the right table are filled with NULL). Like you had it, joined rows are tested for additional conditions virtually after the LEFT JOIN
and removed if they don't pass, just like with a plain JOIN
.
count()
never returns NULL to begin with. It's an exception among aggregate functions in this respect. Therefore, never makes sense, even with additional parameters. The manual:COALESCE(COUNT(col))
It should be noted that except for
count
, these functions return a null value when no rows are selected.
Bold emphasis mine. See:
count()
must be on a column defined NOT NULL
(like e.id
), or where the join condition guarantees NOT NULL
(e.organisation_id
, e.item_template_id
, or e.used
) in the example.
Since used
is type boolean
, the expression e.used = true
is noise that burns down to just e.used
.
Since o.name
is not defined UNIQUE NOT NULL
, you may want to GROUP BY o.id
instead (id
being the PK) - unless you intend to fold rows with the same name (including NULL).
If most or all rows of exam_items
are counted in the process, this equivalent query is typically considerably faster / cheaper:
SELECT o.id, o.name AS organisation_name, e.total_used FROM organisations o LEFT JOIN ( SELECT organisation_id AS id -- alias to simplify join syntax , count(*) AS total_used -- count(*) = fastest to count all FROM exam_items WHERE item_template_id = #{sanitize(item_template_id)} AND used GROUP BY 1 ) e USING (id) ORDER BY o.name, o.id;
(This is assuming that you don't want to fold rows with the same name like mentioned above - the typical case.)
Now we can use the faster / simpler count(*)
in the subquery, and we need no GROUP BY
in the outer SELECT
.
See:
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With