correct way to create a pivot table in postgresql using CASE WHEN

Tags:

I am trying to create a pivot table type view in postgresql and am nearly there! Here is the basic query:

select 
acc2tax_node.acc, tax_node.name, tax_node.rank 
from 
tax_node, acc2tax_node 
where 
tax_node.taxid=acc2tax_node.taxid and acc2tax_node.acc='AJ012531';

And the data:

   acc    |          name           |     rank     
----------+-------------------------+--------------
 AJ012531 | Paromalostomum fusculum | species
 AJ012531 | Paromalostomum          | genus
 AJ012531 | Macrostomidae           | family
 AJ012531 | Macrostomida            | order
 AJ012531 | Macrostomorpha          | no rank
 AJ012531 | Turbellaria             | class
 AJ012531 | Platyhelminthes         | phylum
 AJ012531 | Acoelomata              | no rank
 AJ012531 | Bilateria               | no rank
 AJ012531 | Eumetazoa               | no rank
 AJ012531 | Metazoa                 | kingdom
 AJ012531 | Fungi/Metazoa group     | no rank
 AJ012531 | Eukaryota               | superkingdom
 AJ012531 | cellular organisms      | no rank

What I am trying to get is the following:

acc      | species                  | phylum
AJ012531 | Paromalostomum fusculum  | Platyhelminthes

I am trying to do this with CASE WHEN, so I've got as far as the following:

select 
acc2tax_node.acc, 
CASE tax_node.rank WHEN 'species' THEN tax_node.name ELSE NULL END as species, 
CASE tax_node.rank WHEN 'phylum' THEN tax_node.name ELSE NULL END as phylum 
from 
tax_node, acc2tax_node 
where 
tax_node.taxid=acc2tax_node.taxid and acc2tax_node.acc='AJ012531';

Which gives me the output:

   acc    |         species         |     phylum      
----------+-------------------------+-----------------
 AJ012531 | Paromalostomum fusculum | 
 AJ012531 |                         | 
 AJ012531 |                         | 
 AJ012531 |                         | 
 AJ012531 |                         | 
 AJ012531 |                         | 
 AJ012531 |                         | Platyhelminthes
 AJ012531 |                         | 
 AJ012531 |                         | 
 AJ012531 |                         | 
 AJ012531 |                         | 
 AJ012531 |                         | 
 AJ012531 |                         | 
 AJ012531 |                         |

Now I know that I have to group by acc at some point, so I try

select 
acc2tax_node.acc, 
CASE tax_node.rank WHEN 'species' THEN tax_node.name ELSE NULL END as sp, 
CASE tax_node.rank WHEN 'phylum' THEN tax_node.name ELSE NULL END as ph 
from 
tax_node, acc2tax_node 
where 
tax_node.taxid=acc2tax_node.taxid and acc2tax_node.acc='AJ012531' 
group by acc2tax_node.acc;

But I get the dreaded

ERROR:  column "tax_node.rank" must appear in the GROUP BY clause or be used in an aggregate function

All the previous examples I've been able to find use something like SUM() around the CASE statements, so I guess that is the aggregate function. I have tried using FIRST():

select 
acc2tax_node.acc, 
FIRST(CASE tax_node.rank WHEN 'species' THEN tax_node.name ELSE NULL END) as sp, 
FIRST(CASE tax_node.rank WHEN 'phylum' THEN tax_node.name ELSE NULL END) as ph 
from tax_node, acc2tax_node where tax_node.taxid=acc2tax_node.taxid and acc2tax_node.acc='AJ012531' group by acc2tax_node.acc;

but get the error:

ERROR:  function first(character varying) does not exist

Can anyone offer any hints?

450

asked Mar 19 '10 12:03

mojones

2 Answers

Use MAX() or MIN(), not FIRST(). In this scenario, you will have all NULLs in the column per each group value except for, at most, one with a not null value. By definition, this is both the MIN and the MAX of that set of values (all nulls are excluded).

answered Nov 03 '22 01:11

Matthew Wood

PostgreSQL does have a couple of functions for pivot queries, see this article at Postgresonline. You can find these functions in the contrib.

answered Nov 03 '22 02:11

Frank Heikens

Related questions
                            
                                ON DELETE CASCADE for multiple foreign keys with Sequelize
                            
                                Equivalent of string contains in google bigquery
                            
                                Sequelize: escape string in a literal string
                            
                                How to create a unique index containing multiple fields where one is a foreign key
                            
                                What are the consequences of not closing database connection after an error?
                            
                                Dates with no time or timezone component in Java/MySQL
                            
                                in SQL, or Django ORM, what's the conventional way to have an ordered one-to-many?
                            
                                Why does the SqlServer optimizer get so confused with parameters?
                            
                                Preferred database design method for assigning user roles? (Hats vs. Groups)
                            
                                SQL Server 2005, turn columns into rows
                            
                                Should CONTROL permission be given on a Stored Procedure in SQL Server 2005?
                            
                                is it better to put more logic in your ON clause or should it only have the minimum necessary?
                            
                                What are the best practices on formatting inline sql using ADO.NET in C#
                            
                                Using IsolationLevel.Snapshot but DB is still locking
                            
                                complex sql order by
                            
                                SQL Query, Selecting 5 most recent in each group
                            
                                Need help with the Merge statement
                            
                                Oracle bug? SELECT returns no dupes, INSERT from SELECT has duplicate rows
                            
                                SQL Count(*) on multiple tables
                            
                                Stored procedure executing another stored procedure

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

correct way to create a pivot table in postgresql using CASE WHEN

Tags:

sql

postgresql

case-when

pivot

pivot-table

mojones

People also ask

2 Answers

Matthew Wood

Frank Heikens

Recent Activity

Donate For Us