Random record from a database table (T-SQL)

Tags:

Is there a succinct way to retrieve a random record from a sql server table?

I would like to randomize my unit test data, so am looking for a simple way to select a random id from a table. In English, the select would be "Select one id from the table where the id is a random number between the lowest id in the table and the highest id in the table."

I can't figure out a way to do it without have to run the query, test for a null value, then re-run if null.

Ideas?

999

asked Oct 10 '08 13:10

Jeremy

2 Answers

Is there a succinct way to retrieve a random record from a sql server table?

Yes

SELECT TOP 1 * FROM table ORDER BY NEWID()

Explanation

A NEWID() is generated for each row and the table is then sorted by it. The first record is returned (i.e. the record with the "lowest" GUID).

Notes

GUIDs are generated as pseudo-random numbers since version four:
The version 4 UUID is meant for generating UUIDs from truly-random or pseudo-random numbers.

The algorithm is as follows:
- Set the two most significant bits (bits 6 and 7) of the clock_seq_hi_and_reserved to zero and one, respectively.
- Set the four most significant bits (bits 12 through 15) of the time_hi_and_version field to the 4-bit version number from Section 4.1.3.
- Set all the other bits to randomly (or pseudo-randomly) chosen values.
—A Universally Unique IDentifier (UUID) URN Namespace - RFC 4122
The alternative SELECT TOP 1 * FROM table ORDER BY RAND() will not work as one would think. RAND() returns one single value per query, thus all rows will share the same value.
While GUID values are pseudo-random, you will need a better PRNG for the more demanding applications.
Typical performance is less than 10 seconds for around 1,000,000 rows — of course depending on the system. Note that it's impossible to hit an index, thus performance will be relatively limited.

answered Sep 18 '22 21:09

Sklivvz

On larger tables you can also use TABLESAMPLE for this to avoid scanning the whole table.

SELECT  TOP 1 * FROM YourTable TABLESAMPLE (1000 ROWS) ORDER BY NEWID()

The ORDER BY NEWID is still required to avoid just returning rows that appear first on the data page.

The number to use needs to be chosen carefully for the size and definition of table and you might consider retry logic if no row is returned. The maths behind this and why the technique is not suited to small tables is discussed here

answered Sep 20 '22 21:09

Martin Smith

Related questions
                            
                                SQL Server Plans : difference between Index Scan / Index Seek
                            
                                Oracle: is there a tool to trace queries, like Profiler for sql server? [closed]
                            
                                How to Use UTF-8 Collation in SQL Server database?
                            
                                How to export SQL Server database to MySQL? [duplicate]
                            
                                SQL Server: PRINT output doesn't appear immediately
                            
                                Best data store for billions of rows
                            
                                SQL Server ORDER BY date and nulls last
                            
                                Functions vs Stored Procedures
                            
                                The database owner SID recorded in the master database differs from the database owner SID
                            
                                Subquery using Exists 1 or Exists *
                            
                                Is there a LastIndexOf in SQL Server?
                            
                                Get month and year from a datetime in SQL Server 2005
                            
                                Maximum size of a varchar(max) variable
                            
                                Why does Sql Server keep executing after raiserror when xact_abort is on?
                            
                                Must declare the scalar variable
                            
                                Does the order of columns matter in a group by clause?
                            
                                How to install SQL Server Management Studio 2012 (SSMS) Express?
                            
                                Is it necessary to use # for creating temp tables in SQL server?
                            
                                VarBinary vs Image SQL Server Data Type to Store Binary Data?
                            
                                Is there a way to suppress "x rows affected" in SQLCMD from the command line?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Random record from a database table (T-SQL)

Tags:

sql-server

tsql

random