In PostgreSQL, I want to use an SQL statement to combine two columns and create a new column from them. I'm thinking about using <code>concat(...)</code>, but is there a better way? What's the best way to do this?

Generally, I agree with @kgrittn's advice. Go for it. But to address your basic question about <code>concat()</code>: The new function <code>concat()</code> is useful if you need to deal with null values - and null has neither been ruled out in your question nor in the one you refer to. If you can rule out null values, the good old (SQL standard) concatenation operator <code>||</code> is still the best choice, and @luis' answer is just fine: <pre class="prettyprint"><code>SELECT col_a || col_b; </code></pre> If either of your columns can be null, the result would be null in that case. You could defend with <code>COALESCE</code>: <pre class="prettyprint"><code>SELECT COALESCE(col_a, '') || COALESCE(col_b, ''); </code></pre> But that get tedious quickly with more arguments. That's where <code>concat()</code> comes in, which never returns null, not even if all arguments are null. Per documentation: <blockquote> NULL arguments are ignored. </blockquote> <pre class="prettyprint"><code>SELECT concat(col_a, col_b); </code></pre> The remaining corner case for both alternatives is where all input columns are null in which case we still get an empty string <code>''</code>, but one might want null instead (at least I would). One possible way: <pre class="prettyprint"><code>SELECT CASE WHEN col_a IS NULL THEN col_b WHEN col_b IS NULL THEN col_a ELSE col_a || col_b END; </code></pre> This gets more complex with more columns quickly. Again, use <code>concat()</code> but add a check for the special condition: <pre class="prettyprint"><code>SELECT CASE WHEN (col_a, col_b) IS NULL THEN NULL ELSE concat(col_a, col_b) END; </code></pre> How does this work? <code>(col_a, col_b)</code> is shorthand notation for a row type expression <code>ROW (col_a, col_b)</code>. And a row type is only null if all columns are null. Detailed explanation: <ul> <li>NOT NULL constraint over a set of columns</li> </ul> Also, use <code>concat_ws()</code> to add separators between elements (<code>ws</code> for "with separator"). <hr> An expression like the one in Kevin's answer: <pre class="prettyprint"><code>SELECT $1.zipcode || ' - ' || $1.city || ', ' || $1.state; </code></pre> is tedious to prepare for null values in PostgreSQL 8.3 (without <code>concat()</code>). One way (of many): <pre class="prettyprint"><code>SELECT COALESCE( CASE WHEN $1.zipcode IS NULL THEN $1.city WHEN $1.city IS NULL THEN $1.zipcode ELSE $1.zipcode || ' - ' || $1.city END, '') || COALESCE(', ' || $1.state, ''); </code></pre> <h3>Function volatility is only <code>STABLE</code> </h3> <code>concat()</code> and <code>concat_ws()</code> are <code>STABLE</code> functions, not <code>IMMUTABLE</code> because they can invoke datatype output functions (like <code>timestamptz_out</code>) that depend on locale settings. Explanation by Tom Lane. This prohibits their direct use in index expressions. If you know that the result is actually immutable in your case, you can work around this with an <code>IMMUTABLE</code> function wrapper. Example here: <ul> <li>Does PostgreSQL support "accent insensitive" collations?</li> </ul>

Did you check the string concatenation function? Something like: <pre class="prettyprint"><code>update table_c set column_a = column_b || column_c </code></pre> should work. More here

Combine two columns and add into one new column

3 Answers

Generally, I agree with @kgrittn's advice. Go for it.

But to address your basic question about concat(): The new function concat() is useful if you need to deal with null values - and null has neither been ruled out in your question nor in the one you refer to.

If you can rule out null values, the good old (SQL standard) concatenation operator || is still the best choice, and @luis' answer is just fine:

SELECT col_a || col_b;

If either of your columns can be null, the result would be null in that case. You could defend with COALESCE:

SELECT COALESCE(col_a, '') || COALESCE(col_b, '');

But that get tedious quickly with more arguments. That's where concat() comes in, which never returns null, not even if all arguments are null. Per documentation:

NULL arguments are ignored.

SELECT concat(col_a, col_b);

The remaining corner case for both alternatives is where all input columns are null in which case we still get an empty string '', but one might want null instead (at least I would). One possible way:

SELECT CASE           WHEN col_a IS NULL THEN col_b           WHEN col_b IS NULL THEN col_a           ELSE col_a || col_b        END;

This gets more complex with more columns quickly. Again, use concat() but add a check for the special condition:

SELECT CASE WHEN (col_a, col_b) IS NULL THEN NULL             ELSE concat(col_a, col_b) END;

How does this work?
(col_a, col_b) is shorthand notation for a row type expression ROW (col_a, col_b). And a row type is only null if all columns are null. Detailed explanation:

NOT NULL constraint over a set of columns

Also, use concat_ws() to add separators between elements (ws for "with separator").

An expression like the one in Kevin's answer:

SELECT $1.zipcode || ' - ' || $1.city || ', ' || $1.state;

is tedious to prepare for null values in PostgreSQL 8.3 (without concat()). One way (of many):

SELECT COALESCE(          CASE             WHEN $1.zipcode IS NULL THEN $1.city             WHEN $1.city    IS NULL THEN $1.zipcode             ELSE $1.zipcode || ' - ' || $1.city          END, '')        || COALESCE(', ' || $1.state, '');

Function volatility is only `STABLE`

concat() and concat_ws() are STABLE functions, not IMMUTABLE because they can invoke datatype output functions (like timestamptz_out) that depend on locale settings.
Explanation by Tom Lane.

This prohibits their direct use in index expressions. If you know that the result is actually immutable in your case, you can work around this with an IMMUTABLE function wrapper. Example here:

Does PostgreSQL support "accent insensitive" collations?

answered Sep 21 '22 15:09

Erwin Brandstetter

You don't need to store the column to reference it that way. Try this:

To set up:

CREATE TABLE tbl   (zipcode text NOT NULL, city text NOT NULL, state text NOT NULL); INSERT INTO tbl VALUES ('10954', 'Nanuet', 'NY');

We can see we have "the right stuff":

\pset border 2 SELECT * FROM tbl;

 +---------+--------+-------+ | zipcode |  city  | state | +---------+--------+-------+ | 10954   | Nanuet | NY    | +---------+--------+-------+

Now add a function with the desired "column name" which takes the record type of the table as its only parameter:

CREATE FUNCTION combined(rec tbl)   RETURNS text   LANGUAGE SQL AS $$   SELECT $1.zipcode || ' - ' || $1.city || ', ' || $1.state; $$;

This creates a function which can be used as if it were a column of the table, as long as the table name or alias is specified, like this:

SELECT *, tbl.combined FROM tbl;

Which displays like this:

 +---------+--------+-------+--------------------+ | zipcode |  city  | state |      combined      | +---------+--------+-------+--------------------+ | 10954   | Nanuet | NY    | 10954 - Nanuet, NY | +---------+--------+-------+--------------------+

This works because PostgreSQL checks first for an actual column, but if one is not found, and the identifier is qualified with a relation name or alias, it looks for a function like the above, and runs it with the row as its argument, returning the result as if it were a column. You can even index on such a "generated column" if you want to do so.

Because you're not using extra space in each row for the duplicated data, or firing triggers on all inserts and updates, this can often be faster than the alternatives.

answered Sep 17 '22 15:09

kgrittn

Did you check the string concatenation function? Something like:

update table_c set column_a = column_b || column_c

should work. More here

answered Sep 21 '22 15:09

luis

Related questions
                            
                                Delete all but one duplicate record
                            
                                Auto increment table column
                            
                                What is the purpose of using WHERE 1=1 in SQL statements? [duplicate]
                            
                                Get previous and next row from rows selected with (WHERE) conditions
                            
                                How can I make a MySQL SUM query return zero instead of null if there are no records?
                            
                                Sql Server trigger insert values from new row into another table
                            
                                SQL Server 2008: How to find trailing spaces
                            
                                String to timestamp in MySQL
                            
                                How to perform a LEFT JOIN in SQL Server between two SELECT statements?
                            
                                How to add static value when doing INSERT INTO with SELECT in a MySQL query?
                            
                                SQL best practice to deal with default sort order
                            
                                how to find out number of days in month in mysql
                            
                                How do I perform a batch insert in Django?
                            
                                How can I create a blank/hardcoded column in a sql query?
                            
                                SQLite ORDER BY string containing number starting with 0
                            
                                How to grant the database owner (DBO) the EXTERNAL ACCESS ASSEMBLY permission?
                            
                                How can I solve ORA-00911: invalid character error?
                            
                                Sleep function in ORACLE
                            
                                why would you use WHERE 1=0 statement in SQL?
                            
                                Enterprise Reporting Solutions [closed]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Combine two columns and add into one new column

Tags:

sql

null

concatenation

postgresql

Rock

People also ask

3 Answers

Function volatility is only `STABLE`

Erwin Brandstetter

kgrittn

luis

Recent Activity

Donate For Us

Combine two columns and add into one new column

Tags:

sql

null

concatenation

postgresql

Rock

People also ask

3 Answers

Function volatility is only STABLE

Erwin Brandstetter

kgrittn

luis

Related questions

Recent Activity

Donate For Us

Function volatility is only `STABLE`