I posted a related question, but this is another part of my puzzle. I would like to get the OLD value of a column from a row that was UPDATEd - WITHOUT using triggers (nor stored procedures, nor any other extra, non -SQL/-query entities). I have a query like this: <pre class="prettyprint"><code> UPDATE my_table SET processing_by = our_id_info -- unique to this worker WHERE trans_nbr IN ( SELECT trans_nbr FROM my_table GROUP BY trans_nbr HAVING COUNT(trans_nbr) > 1 LIMIT our_limit_to_have_single_process_grab ) RETURNING row_id; </code></pre> If I could do <code>FOR UPDATE ON my_table</code> at the end of the subquery, that'd be divine (and fix my other question/problem). But that won't work: can't combine this with <code>GROUP BY</code> (which is necessary for figuring out the count). Then I could just take those trans_nbr's and do a query first to get the (soon-to-be-) former <code>processing_by</code> values. I've tried doing like: <pre class="prettyprint"><code> UPDATE my_table SET processing_by = our_id_info -- unique to this worker FROM my_table old_my_table JOIN ( SELECT trans_nbr FROM my_table GROUP BY trans_nbr HAVING COUNT(trans_nbr) > 1 LIMIT our_limit_to_have_single_process_grab ) sub_my_table ON old_my_table.trans_nbr = sub_my_table.trans_nbr WHERE my_table.trans_nbr = sub_my_table.trans_nbr AND my_table.processing_by = old_my_table.processing_by RETURNING my_table.row_id, my_table.processing_by, old_my_table.processing_by </code></pre> But that can't work; <code>old_my_table</code> is not visible outside the join; the <code>RETURNING</code> clause is blind to it. I've long since lost count of all the attempts I've made; I have been researching this for literally hours. If I could just find a bullet-proof way to lock the rows in my subquery - and ONLY those rows, and WHEN the subquery happens - all the concurrency issues I'm trying to avoid would disappear ... <hr> UPDATE: I had a typo in the non-generic code of the above. I retried after Erwin Brandstetter suggested it should work. Since it took me so long to find this sort of solution, perhaps my embarrassment is worth it? At least this is on SO for posterity now... :> What I now have (that works) is like this: <pre class="prettyprint"><code> UPDATE my_table SET processing_by = our_id_info -- unique to this worker FROM my_table AS old_my_table WHERE trans_nbr IN ( SELECT trans_nbr FROM my_table GROUP BY trans_nbr HAVING COUNT(*) > 1 LIMIT our_limit_to_have_single_process_grab ) AND my_table.row_id = old_my_table.row_id RETURNING my_table.row_id, my_table.processing_by, old_my_table.processing_by AS old_processing_by </code></pre> The <code>COUNT(*)</code> is per a suggestion from Flimzy in a comment on my other (linked above) question. Please see my other question for correctly implementing concurrency and even a non-blocking version; THIS query merely shows how to get the old and new values from an update, ignore the bad/wrong concurrency bits.

The CTE variant as proposed by @MattDiPasquale should work too. With the comfortable means of a CTE I would be more explicit, though: <pre class="prettyprint"><code>WITH sel AS ( SELECT tbl_id, name FROM tbl WHERE tbl_id = 3 -- assuming unique tbl_id ) , upd AS ( UPDATE tbl SET name = 'New Guy' WHERE tbl_id = 3 RETURNING tbl_id, name ) SELECT s.tbl_id AS old_id, s.name As old_name , u.tbl_id, u.name FROM sel s, upd u; </code></pre> Without testing I claim this works: <code>SELECT</code> and <code>UPDATE</code> see the same snapshot of the database. The <code>SELECT</code> is bound to return the old values (even if you place the CTE after the CTE with the <code>UPDATE</code>), while the <code>UPDATE</code> returns the new values by definition. Voilá. But it will be slower than my first answer.

You can use a <code>SELECT</code> subquery. Example: Update a user's email <code>RETURNING</code> the old value. <ol> <li> <code>RETURNING</code> Subquery <pre class="prettyprint"><code>UPDATE users SET email = 'new@gmail.com' WHERE id = 1 RETURNING (SELECT email FROM users WHERE id = 1); </code></pre> </li> <li> PostgreSQL WITH Query (Common Table Expressions) <pre class="prettyprint"><code>WITH u AS ( SELECT email FROM users WHERE id = 1 ) UPDATE users SET email = 'new@gmail.com' WHERE id = 1 RETURNING (SELECT email FROM u); </code></pre> This has worked several times on my local database without fail, but I'm not sure if the <code>SELECT</code> in <code>WITH</code> is guaranteed to consistently execute before the <code>UPDATE</code> since "the sub-statements in WITH are executed concurrently with each other and with the main query." </li> </ol>

Return pre-UPDATE column values using SQL only

Tags:

sql

postgresql

sql-update

concurrency

subquery

I posted a related question, but this is another part of my puzzle.

I would like to get the OLD value of a column from a row that was UPDATEd - WITHOUT using triggers (nor stored procedures, nor any other extra, non -SQL/-query entities).

I have a query like this:

   UPDATE my_table
      SET processing_by = our_id_info  -- unique to this worker
    WHERE trans_nbr IN (
                        SELECT trans_nbr
                          FROM my_table
                         GROUP BY trans_nbr
                        HAVING COUNT(trans_nbr) > 1
                         LIMIT our_limit_to_have_single_process_grab
                       )
RETURNING row_id;

If I could do FOR UPDATE ON my_table at the end of the subquery, that'd be divine (and fix my other question/problem). But that won't work: can't combine this with GROUP BY (which is necessary for figuring out the count). Then I could just take those trans_nbr's and do a query first to get the (soon-to-be-) former processing_by values.

I've tried doing like:

   UPDATE my_table
      SET processing_by = our_id_info -- unique to this worker
     FROM my_table old_my_table
     JOIN (
             SELECT trans_nbr
               FROM my_table
           GROUP BY trans_nbr
             HAVING COUNT(trans_nbr) > 1
              LIMIT our_limit_to_have_single_process_grab
          ) sub_my_table
       ON old_my_table.trans_nbr = sub_my_table.trans_nbr
    WHERE     my_table.trans_nbr = sub_my_table.trans_nbr
      AND my_table.processing_by = old_my_table.processing_by
RETURNING my_table.row_id, my_table.processing_by, old_my_table.processing_by

But that can't work; old_my_table is not visible outside the join; the RETURNING clause is blind to it.

I've long since lost count of all the attempts I've made; I have been researching this for literally hours.

If I could just find a bullet-proof way to lock the rows in my subquery - and ONLY those rows, and WHEN the subquery happens - all the concurrency issues I'm trying to avoid would disappear ...

UPDATE: I had a typo in the non-generic code of the above. I retried after Erwin Brandstetter suggested it should work. Since it took me so long to find this sort of solution, perhaps my embarrassment is worth it? At least this is on SO for posterity now... :>

What I now have (that works) is like this:

   UPDATE my_table
      SET processing_by = our_id_info -- unique to this worker
     FROM my_table AS old_my_table
    WHERE trans_nbr IN (
                          SELECT trans_nbr
                            FROM my_table
                        GROUP BY trans_nbr
                          HAVING COUNT(*) > 1
                           LIMIT our_limit_to_have_single_process_grab
                       )
      AND my_table.row_id = old_my_table.row_id
RETURNING my_table.row_id, my_table.processing_by, old_my_table.processing_by AS old_processing_by

The COUNT(*) is per a suggestion from Flimzy in a comment on my other (linked above) question.

Please see my other question for correctly implementing concurrency and even a non-blocking version; THIS query merely shows how to get the old and new values from an update, ignore the bad/wrong concurrency bits.

294

asked Oct 27 '11 22:10

pythonlarry

3 Answers

The CTE variant as proposed by @MattDiPasquale should work too.
With the comfortable means of a CTE I would be more explicit, though:

WITH sel AS (
   SELECT tbl_id, name FROM tbl WHERE tbl_id = 3  -- assuming unique tbl_id
   )
, upd AS (
   UPDATE tbl SET name = 'New Guy' WHERE tbl_id = 3
   RETURNING tbl_id, name
   )
SELECT s.tbl_id AS old_id, s.name As old_name
     , u.tbl_id, u.name
FROM   sel s, upd u;

Without testing I claim this works: SELECT and UPDATE see the same snapshot of the database. The SELECT is bound to return the old values (even if you place the CTE after the CTE with the UPDATE), while the UPDATE returns the new values by definition. Voilá.

But it will be slower than my first answer.

141

answered Oct 05 '22 04:10

Erwin Brandstetter

You can use a SELECT subquery.

Example: Update a user's email RETURNING the old value.

RETURNING Subquery

UPDATE users SET email = '[email protected]' WHERE id = 1
RETURNING (SELECT email FROM users WHERE id = 1);

PostgreSQL WITH Query (Common Table Expressions)
```
WITH u AS (
    SELECT email FROM users WHERE id = 1
)
UPDATE users SET email = '[email protected]' WHERE id = 1
RETURNING (SELECT email FROM u);
```
This has worked several times on my local database without fail, but I'm not sure if the SELECT in WITH is guaranteed to consistently execute before the UPDATE since "the sub-statements in WITH are executed concurrently with each other and with the main query."

answered Oct 05 '22 03:10

ma11hew28

when faced with this dilemma I added junk columns to the table and then I copy the old values into the junk columns (which I then return) when I update the record. this bloats the table a bit but avoids the need for joins.

answered Oct 05 '22 04:10

Jasen

Related questions
                            
                                How to add plus one (+1) to a SQL Server column in a SQL Query
                            
                                SQL SELECT LIKE (Insensitive casing)
                            
                                SQL IN Clause 1000 item limit
                            
                                C# Version Of SQL LIKE
                            
                                Which sql server data type best represents a double in C#? [duplicate]
                            
                                When to use an auto-incremented primary key and when not to?
                            
                                What is the purpose for using OPTION(MAXDOP 1) in SQL Server?
                            
                                MySQL COUNT with LIMIT
                            
                                oracle - what statements need to be committed?
                            
                                SELECT * EXCEPT
                            
                                Is there a coalesce-like function in Excel?
                            
                                Using the DISTINCT keyword causes this error: not a SELECTed expression
                            
                                GROUP BY and COUNT using ActiveRecord
                            
                                Passing Output parameters to stored procedure using dapper in c# code
                            
                                What is the linq equivalent to the SQL IN operator
                            
                                MySQL: View with Subquery in the FROM Clause Limitation
                            
                                How to create a new database with the hstore extension already installed?
                            
                                Are left outer joins and left joins the same? [duplicate]
                            
                                What is the meaning of <> in mysql query?
                            
                                PL/SQL block problem: No data found error

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With