Pseudo Random Repeatable Sort in SQL Server (not NEWID() and not RAND())

Q: Is SQL Newid random?

SQL Server NewId() generates a random GUID or unique identifier which can be used to return randomized rows from a SELECT query. T-SQL developers will realize that the return list of a SQL SELECT query is sorted randomly when they place "NEWID() function in the "ORDER BY" clause of the SELECT statement.

Q: What is Newid function in SQL?

Using NEWID in a CREATE TABLE statement. Applies to: SQL Server. The following example creates the cust table with a uniqueidentifier data type, and uses NEWID to fill the table with a default value. In assigning the default value of NEWID() , each new and existing row has a unique value for the CustomerID column.

Q: Can newid () be duplicate?

Long story in short: NEWID() can generate duplicate value, however, the probability is one in a billion which is very negligible.

Q: Which procedure is used for random number generation in SQL?

SQL Server RAND() Function The RAND() function returns a random number between 0 (inclusive) and 1 (exclusive).

Tags:

sql

sql-server

tsql

random

paging

I would like to randomly sort a result in a repeatable fashion for purposes such as paging. For this NEWID() is too random in that the same results cannot be re-obtained. Order by Rand(seed) would be ideal as with the same seed the same random collection would result. Unfortunately, the Rand() state resets with every row, does anyone have a solution?

declare @seed as int;
set @seed = 1000;

create table temp (
id int,
date datetime)

insert into temp (id, date) values (1,'20090119')
insert into temp (id, date) values (2,'20090118')
insert into temp (id, date) values (3,'20090117')
insert into temp (id, date) values (4,'20090116')
insert into temp (id, date) values (5,'20090115')
insert into temp (id, date) values (6,'20090114')

-- re-seeds for every item
select *, RAND(), RAND(id+@seed) as r from temp order by r
--1 2009-01-19 00:00:00.000 0.277720118060575   0.732224964471124
--2 2009-01-18 00:00:00.000 0.277720118060575   0.732243597442382
--3 2009-01-17 00:00:00.000 0.277720118060575   0.73226223041364
--4 2009-01-16 00:00:00.000 0.277720118060575   0.732280863384898
--5 2009-01-15 00:00:00.000 0.277720118060575   0.732299496356156
--6 2009-01-14 00:00:00.000 0.277720118060575   0.732318129327415
-- Note how the last column is +=~0.00002

drop table temp

-- interestingly this works:
select RAND(@seed), RAND()
--0.732206331499865 0.306382810665955

Note, I tried Rand(ID) but that just turns out to be sorted. Apparently Rand(n) < Rand(n+1)

628

asked Jan 19 '09 16:01

ccook

2 Answers

Building off of gkrogers hash suggestion this works great. Any thoughts on performance?

declare @seed as int;
set @seed = 10;

create table temp (
id int,
date datetime)

insert into temp (id, date) values (1,'20090119')
insert into temp (id, date) values (2,'20090118')
insert into temp (id, date) values (3,'20090117')
insert into temp (id, date) values (4,'20090116')
insert into temp (id, date) values (5,'20090115')
insert into temp (id, date) values (6,'20090114')

-- re-seeds for every item
select *, HASHBYTES('md5',cast(id+@seed as varchar)) r
from temp order by r
--1 2009-01-19 00:00:00.000 0x6512BD43D9CAA6E02C990B0A82652DCA
--5 2009-01-15 00:00:00.000 0x9BF31C7FF062936A96D3C8BD1F8F2FF3
--4 2009-01-16 00:00:00.000 0xAAB3238922BCC25A6F606EB525FFDC56
--2 2009-01-18 00:00:00.000 0xC20AD4D76FE97759AA27A0C99BFF6710
--3 2009-01-17 00:00:00.000 0xC51CE410C124A10E0DB5E4B97FC2AF39
--6 2009-01-14 00:00:00.000 0xC74D97B01EAE257E44AA9D5BADE97BAF

drop table temp

EDIT: Note, the declaration of @seed as it's use in the query could be replace with a parameter or with a constant int if dynamic SQL is used. (declaration of @int in a TSQL fashion is not necessary)

178

answered Oct 21 '22 14:10

ccook

You can use a value from each row to re-evaluate the rand function:

Select *, Rand(@seed + id) as r from temp order by r

adding the ID ensures that the rand is reseeded for each row. But for a value of seed you will always get back the same sequence of rows (provided that the table does not change)

answered Oct 21 '22 12:10

Jack Ryan

Related questions
                            
                                Calling DB Function with Entity Framework 6
                            
                                SQL Server: Replace invalid XML characters from a VARCHAR(MAX) field
                            
                                Using Reactive MySQL Databases in Meteor (An Update?)
                            
                                ORDER BY alphanumeric characters only in SQLite
                            
                                Oracle SQL - can I return the "before" state of a column value
                            
                                Delete all tables in Derby DB
                            
                                How do I make DBIx::Class join tables using other operators than `=`?
                            
                                Using a table to provide enum values in MySQL?
                            
                                Greatest not null column
                            
                                SQL indexes for "not equal" searches
                            
                                What is the best way to implement a substring search in SQL?
                            
                                Postgres UPSERT (INSERT or UPDATE) only if value is different
                            
                                How can I make cx-oracle bind the results of a query to a dictionary rather than a tuple?
                            
                                Is varchar(128) better than varchar(100)
                            
                                MySQL LEFT JOIN with GROUP BY and WHERE IN (sub query)
                            
                                SQL Server table to json
                            
                                Postgresql JSON has key
                            
                                Is it possible for me to include a sub report in a tablix row that is grouped by an ID?
                            
                                Using SQL LocalDB in a Windows Service
                            
                                What's the reason / usefulness is to use ENABLE keyword in oracle database statements

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Pseudo Random Repeatable Sort in SQL Server (not NEWID() and not RAND())

Tags:

sql

sql-server

tsql

random

paging

ccook

People also ask

2 Answers

ccook

Jack Ryan

Recent Activity

Donate For Us