How to generate random numbers with each random number having a difference of at least x with all other elements?

Tags:

I know this goes against the definition of random numbers, but still I require this for my project. For instance, I want to generate an array with 5 random elements in range(0, 200).

Now, I want each of the elements to have a difference of at least 15 between them. So the random array should look something like this:

[15, 45, 99, 132, 199]

I can generate random numbers using numpy:

np.random.uniform(low=0, high=200, size=5)

However, I am not able to keep a consistent difference of at least 15.

887

asked Nov 30 '18 21:11

3 Answers

How about trial-and-error? e.g. throw some random numbers, sort, compute differences... and if too small repeat?

import random as r

def spreadRandom(theRange, howMany, minSpacing):
    while True:
        candidate = sorted([r.randint(*theRange) for _ in range(howMany)])
        minDiff = min([ candidate[i+1]-candidate[i] for i, _ in enumerate(candidate[:-1])])
        if minDiff >= minSpacing:
            return candidate

spreadRandom([0,200], 5, 15)

You're not guaranteed to ever get an answer, but you're not biasing your numbers at all as you might be by enforcing ranges based on neighboring numbers.

answered Oct 26 '22 09:10

It would be nice if the question showed more effort towards solving the problem (i.e. from the Stack Overflow Tour: "Don't ask about... Questions you haven't tried to find an answer for (show your work!)"), but sometimes a question triggers an itch you just have to scratch...

Here's one way you could do it, written as the function random_spaced:

import numpy as np


def random_spaced(low, high, delta, n, size=None):
    """
    Choose n random values between low and high, with minimum spacing delta.

    If size is None, one sample is returned.
    Set size=m (an integer) to return m samples.

    The values in each sample returned by random_spaced are in increasing
    order.
    """
    empty_space = high - low - (n-1)*delta
    if empty_space < 0:
        raise ValueError("not possible")

    if size is None:
        u = np.random.rand(n)
    else:
        u = np.random.rand(size, n)
    x = empty_space * np.sort(u, axis=-1)
    return low + x + delta * np.arange(n)

For example,

In [27]: random_spaced(0, 200, 15, 5)
Out[27]: array([ 30.3524969 ,  97.4773284 , 140.38221631, 161.9276264 , 189.3404236 ])

In [28]: random_spaced(0, 200, 15, 5)
Out[28]: array([ 81.01616136, 103.11710522, 118.98018499, 141.68196775, 169.02965952])

The size argument lets you generate more than one sample at a time:

In [29]: random_spaced(0, 200, 15, 5, size=3)
Out[29]: 
array([[ 52.62401348,  80.04494534,  96.21983265, 138.68552066, 178.14784825],
       [  7.57714106,  33.05818556,  62.59831316,  81.86507168, 180.30946733],
       [ 24.16367913,  40.37480075,  86.71321297, 148.24263974, 195.89405713]])

This code generates a histogram for each component using 100000 samples, and plots the corresponding theoretical marginal PDFs of each component:

import matplotlib.pyplot as plt
from scipy.stats import beta

low = 0
high = 200
delta = 15 
n = 5
s = random_spaced(low, high, delta, n, size=100000)

for k in range(s.shape[1]):
    plt.hist(s[:, k], bins=100, density=True, alpha=0.25)
plt.title("Normalized marginal histograms and marginal PDFs")
plt.grid(alpha=0.2)

# Plot the PDFs of the marginal distributions of each component.
# These are beta distributions.
for k in range(n):
    left = low + k*delta
    right = high - (n - k - 1)*delta
    xx = np.linspace(left, right, 400)
    yy = beta.pdf(xx, k + 1, n - k, loc=left, scale=right - left)
    plt.plot(xx, yy, 'k--', linewidth=1, alpha=0.25)
    if n > 1:
        # Mark the mode with a dot.
        mode0 = k/(n-1)
        mode = (right-left)*mode0 + left
        plt.plot(mode, beta.pdf(mode, k + 1, n - k, loc=left, scale=right - left),
                 'k.', alpha=0.25)

plt.show()

Here's the plot that it generates:

plot

As can be seen in the plot, the marginal distributions are beta distributions. The modes of the marginal distributions correspond to the positions of n evenly spaced points on the interval [low, high].

By fiddling with how u is generated in random_spaced, distributions with different marginals can be generated (an old version of this answer had an example), but the distribution that random_spaced currently generates seems to be a natural choice. As mentioned above, the modes of the marginals occur in "meaningful" positions. Moreover, in the trivial case where n is 1, the distribution simplifies to the uniform distribution on [low, high].

156

answered Oct 26 '22 09:10

Warren Weckesser

Try shuffling the numbers 0-200:

import random
numbers = list(range(200))
random.shuffle(numbers)
distant_numbers = [numbers[0]]
for number in numbers:
    if any(abs(number - x) < 15 for x in distant_numbers):
        continue
    distant_numbers.append(number)
    if len(distant_numbers) >= 5: break

Edit:

Here's a solution that uses z3 for massive overkill:

def spaced_randoms(n, d, R, first=None):
    solver = z3.SolverFor("QF_FD")
    numbers = [z3.Int("x{}".format(x)) for x in range(n)]
    for number in numbers:
        solver.add(number >= 0)
        solver.add(number <= R)
    for ii in range(n):
        for jj in range(ii+1,n):
            solver.add(z3.Or(numbers[ii] - numbers[jj] > d, numbers[ii] - numbers[jj] < -d))
    if first is not None:
        solver.add(numbers[0] == first)
    result = solver.check()
    if str(result) != "sat":
        raise Exception("Unsatisfiable")
    model = solver.model()
    return [model.get_interp(number) for number in numbers]

Call it like this for a random result:

import random
spaced_randoms(n, d, R, random.randint(0,R))

answered Oct 26 '22 08:10

Matthew Woodruff

Related questions
                            
                                python spark alternative to explode for very large data
                            
                                What does keras normalize axis argument does?
                            
                                Pandas delete first n rows until condition on columns is fulfilled
                            
                                Is pytest supposed to collect tests from dependency modules in a virtual environtment?
                            
                                Getting a list from Django into Javascript as an array
                            
                                Comparing two columns of pandas dataframe by np.char.find gives TypeError: string operation on non-string array
                            
                                Multiple inputs to Keras Sequential model
                            
                                How to send both image(ndarray) and string data in single ZMQ send request
                            
                                Python2 vs Python3: Different result when converting to datetime from timestamp
                            
                                group and classify words as well as characters
                            
                                pretty printing numpy ndarrays using unicode characters
                            
                                Tensorflow: ValueError: The last dimension of the inputs to `Dense` should be defined. Found `None`
                            
                                How to get the screen position of `QMainWindow` and print it?
                            
                                Python re.split on "," or " " [duplicate]
                            
                                Tooltips in Altair line charts
                            
                                Python 3 equivalent of Python 2 str.decode('hex') [duplicate]
                            
                                Google Analytics, Server-Side Tracking & Bot Filter
                            
                                Issues after Apache Airflow migration from 1.9.0 to 1.10.1
                            
                                Convert list of pyodbc.rows to pandas Dataframe takes very long time
                            
                                Can Python recognize changes to a file that it is running interactively?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to generate random numbers with each random number having a difference of at least x with all other elements?

Tags:

python

random

numpy

user5229506

People also ask

3 Answers

Joshua R.

Warren Weckesser

Matthew Woodruff

Recent Activity

Donate For Us