I'm trying to randomly insert values from a list of pre-defined values into a table for testing. I tried using the solution found on this StackOverflow question: <code>stackoverflow.com/.../update-sql-table-with-random-value-from-other-table</code> When I I tried this, all of my "random" values that are inserted are exactly the same for all 3000 records. When I run the part of the query that actually selects the random row, it does select a random record every time I run it by hand, so I know the query works. My best guesses as to what is happening are: <ul> <li>SQL Server is optimizing the <code>SELECT</code> somehow, not allowing the subquery to be evaluated more than once</li> <li>The random value's seed is the same on every record the query updates</li> </ul> I'm stuck on what my options are. Am I doing something wrong, or is there another way I should be doing this? This is the code I'm using: <pre class="prettyprint"><code>DECLARE @randomStuff TABLE ([id] INT, [val] VARCHAR(100)) INSERT INTO @randomStuff ([id], [val]) VALUES ( 1, 'Test Value 1' ) INSERT INTO @randomStuff ([id], [val]) VALUES ( 2, 'Test Value 2' ) INSERT INTO @randomStuff ([id], [val]) VALUES ( 3, 'Test Value 3' ) INSERT INTO @randomStuff ([id], [val]) VALUES ( 4, 'Test Value 4' ) INSERT INTO @randomStuff ([id], [val]) VALUES ( 5, 'Test Value 5' ) INSERT INTO @randomStuff ([id], [val]) VALUES ( 6, null ) INSERT INTO @randomStuff ([id], [val]) VALUES ( 7, null ) INSERT INTO @randomStuff ([id], [val]) VALUES ( 8, null ) INSERT INTO @randomStuff ([id], [val]) VALUES ( 9, null ) INSERT INTO @randomStuff ([id], [val]) VALUES ( 10, null ) UPDATE MyTable SET MyColumn = (SELECT TOP 1 [val] FROM @randomStuff ORDER BY NEWID()) </code></pre>

When the query engine sees this... <pre class="prettyprint"><code>(SELECT TOP 1 [val] FROM @randomStuff ORDER BY NEWID()) </code></pre> ... it's all like, "ooooh, a cachable scalar subquery, I'm gonna cache that!" You need to trick the query engine into thinking it's non-cachable. jfar's answer was close, but the query engine was smart enough to see the tautalogy of <code>MyTable.MyColumn = MyTable.MyColumn</code>, but it ain't smart enough to see through this. <pre class="prettyprint"><code>UPDATE MyTable SET MyColumn = (SELECT TOP 1 val FROM @randomStuff r INNER JOIN MyTable _MT ON M.Id = _MT.Id ORDER BY NEWID()) FROM MyTable M </code></pre> By bringing in the outer table (MT) into the subquery, the query engine assumes subquery will need to be re-evaluated. Anything will work really, but I went with the (assumed) primary key of MyTable.Id since it'd be indexed and would add very little overhead. A cursor would probably be just as fast, but is most certainly not as fun.

How can I insert random values into a SQL Server table?

Tags:

sql

sql-server

tsql

random

I'm trying to randomly insert values from a list of pre-defined values into a table for testing. I tried using the solution found on this StackOverflow question:

stackoverflow.com/.../update-sql-table-with-random-value-from-other-table

When I I tried this, all of my "random" values that are inserted are exactly the same for all 3000 records.

When I run the part of the query that actually selects the random row, it does select a random record every time I run it by hand, so I know the query works. My best guesses as to what is happening are:

SQL Server is optimizing the SELECT somehow, not allowing the subquery to be evaluated more than once
The random value's seed is the same on every record the query updates

I'm stuck on what my options are. Am I doing something wrong, or is there another way I should be doing this?

This is the code I'm using:

DECLARE @randomStuff TABLE ([id] INT, [val] VARCHAR(100))

INSERT INTO @randomStuff ([id], [val]) 
VALUES ( 1,  'Test Value 1' )
INSERT INTO @randomStuff ([id], [val])
VALUES ( 2,  'Test Value 2' )
INSERT INTO @randomStuff ([id], [val])
VALUES ( 3,  'Test Value 3' )
INSERT INTO @randomStuff ([id], [val])
VALUES ( 4,  'Test Value 4' )
INSERT INTO @randomStuff ([id], [val])
VALUES ( 5,  'Test Value 5' )
INSERT INTO @randomStuff ([id], [val])
VALUES ( 6,  null )
INSERT INTO @randomStuff ([id], [val])
VALUES ( 7,  null )
INSERT INTO @randomStuff ([id], [val])
VALUES ( 8,  null )
INSERT INTO @randomStuff ([id], [val])
VALUES ( 9,  null )
INSERT INTO @randomStuff ([id], [val])
VALUES ( 10, null )

UPDATE MyTable
SET MyColumn = (SELECT TOP 1 [val] FROM @randomStuff ORDER BY NEWID())

852

asked Sep 23 '09 19:09

Dan Herbert

1 Answers

When the query engine sees this...

(SELECT TOP 1 [val] FROM @randomStuff ORDER BY NEWID())

... it's all like, "ooooh, a cachable scalar subquery, I'm gonna cache that!"

You need to trick the query engine into thinking it's non-cachable. jfar's answer was close, but the query engine was smart enough to see the tautalogy of MyTable.MyColumn = MyTable.MyColumn, but it ain't smart enough to see through this.

UPDATE MyTable
   SET MyColumn = (SELECT TOP 1 val
                     FROM @randomStuff r
                          INNER JOIN MyTable _MT
                                  ON M.Id = _MT.Id
                    ORDER BY NEWID())
 FROM MyTable M

By bringing in the outer table (MT) into the subquery, the query engine assumes subquery will need to be re-evaluated. Anything will work really, but I went with the (assumed) primary key of MyTable.Id since it'd be indexed and would add very little overhead.

A cursor would probably be just as fast, but is most certainly not as fun.

answered Oct 25 '22 20:10

Alex Papadimoulis

Related questions
                            
                                When to use Money data or Decimal Data type in sql server to store costing values? [duplicate]
                            
                                how to bind a list of tuples using Spring JDBCTemplate?
                            
                                mysql query "SHOW COLUMNS FROM table like 'colmunname'":questions
                            
                                SQLServer get top 1 row from subquery
                            
                                Convert a BINARY stored as VARCHAR to BINARY
                            
                                Difference between executeBatch() and executeLargeBatch() while using PreparedStatement
                            
                                PostgreSQL integer array value join to integer in other table with desc string
                            
                                check chars in varchar
                            
                                Why does the CAST() function return the wrong date?
                            
                                sql Check if column is substring of another column
                            
                                How to use subquery in the join function of Yii framework 2 ActiveRecord?
                            
                                SQL - If Select returns nothing then do another Select
                            
                                How to iterate through the result of a PLSQL Select
                            
                                dovecot password hashing with mysql 8 SHA2
                            
                                Inserting nested entities with autoincrement using SQL Server
                            
                                md5 in bigquery
                            
                                MySQL - Set default value for field as a string concatenation function
                            
                                Escape percentage sign DB2 SQL
                            
                                SQL Dump from DB2
                            
                                SQL Insert one row or multiple rows data?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With