I am faced with the following programming problem. I need to generate <code>n</code> <code>(a, b)</code> tuples for which the sum of all <code>a</code>'s is a given <code>A</code> and sum of all <code>b</code>'s is a given <code>B</code> and for each tuple the ratio of <code>a / b</code> is in the range <code>(c_min, c_max)</code>. <code>A / B</code> is within the same range, too. I am also trying to make sure there is no bias in the result other than what is introduced by the constraints and the <code>a / b</code> values are more-or-less uniformly distributed in the given range. Some clarifications and meta-constraints: <ul> <li> <code>A</code>, <code>B</code>, <code>c_min</code>, and <code>c_max</code> are given. </li> <li>The ratio <code>A / B</code> is in the <code>(c_min, c_max)</code> range. This has to be so if the problem is to have a solution given the other constraints.</li> <li>a and b are <code>>0</code> and non-integer.</li> </ul> I am trying to implement this in Python but ideas in any language (English included) are much appreciated.

Lots of good ideas here. Thanks! Rossum's idea seemed the most straightforward implementation-wise so I went for it. Here is the code for posterity: <pre class="prettyprint"><code>c_min = 0.25 c_max = 0.75 a_sum = 100.0 b_sum = 200.0 n = 1000 a = [a_sum / n] * n b = [b_sum / n] * n while not good_enough(a, b): i, j = random.sample(range(n), 2) li, ui = c_min * b[i] - a[i], c_max * b[i] - a[i] lj, uj = a[j] - c_min * b[j], a[j] - c_max * b[j] llim = max((li, uj)) ulim = min((ui, lj)) q = random.uniform(llim, ulim) a[i] += q a[j] -= q i, j = random.sample(range(n), 2) li, ui = a[i] / c_max - b[i], a[i] / c_min - b[i] lj, uj = b[j] - a[j] / c_max, b[j] - a[j] / c_min llim = max((li, uj)) ulim = min((ui, lj)) q = random.uniform(llim, ulim) b[i] += q b[j] -= q </code></pre> The <code>good_enough(a, b)</code> function can be a lot of things. I tried: <ul> <li>Standard deviation, which is hit or miss, as you don't know what is a good enough value.</li> <li>Kurtosis, where a large negative value would be nice. However, it is relatively slow to calculate and is undefined with the seed values of <code>(a_sum / n, b_sum / n)</code> (though that's trivial to fix).</li> <li>Skewness, where a value close to <code>0</code> is desirable. But it has the same drawbacks as kurtosis.</li> <li>A number of iterations proportional to <code>n</code>. <code>2n</code> sometimes wasn't enough, <code>n ^ 2</code> is a little bit of overkill and is, well, exponential.</li> </ul> Ideally, a heuristic using a combination of skewness and kurtosis would be best but I settled for making sure each value has been changed from the initial (again, as rossum suggested in a comment). Though there is no theoretical guarantee that the loop will complete, it seemed to work well enough for me.

Generating random numbers under very specific constraints

Tags:

python

algorithm

random

I am faced with the following programming problem. I need to generate n (a, b) tuples for which the sum of all a's is a given A and sum of all b's is a given B and for each tuple the ratio of a / b is in the range (c_min, c_max). A / B is within the same range, too. I am also trying to make sure there is no bias in the result other than what is introduced by the constraints and the a / b values are more-or-less uniformly distributed in the given range.

Some clarifications and meta-constraints:

A, B, c_min, and c_max are given.
The ratio A / B is in the (c_min, c_max) range. This has to be so if the problem is to have a solution given the other constraints.
a and b are >0 and non-integer.

I am trying to implement this in Python but ideas in any language (English included) are much appreciated.

855

asked Oct 26 '11 20:10

ktdrv

1 Answers

Lots of good ideas here. Thanks! Rossum's idea seemed the most straightforward implementation-wise so I went for it. Here is the code for posterity:

Click to copy

c_min = 0.25
c_max = 0.75
a_sum = 100.0
b_sum = 200.0
n = 1000 

a = [a_sum / n] * n
b = [b_sum / n] * n

while not good_enough(a, b):
    i, j = random.sample(range(n), 2)
    li, ui = c_min * b[i] - a[i], c_max * b[i] - a[i]
    lj, uj = a[j] - c_min * b[j], a[j] - c_max * b[j]
    llim = max((li, uj))
    ulim = min((ui, lj))
    q = random.uniform(llim, ulim)
    a[i] += q
    a[j] -= q

    i, j = random.sample(range(n), 2)
    li, ui = a[i] / c_max - b[i], a[i] / c_min - b[i]
    lj, uj = b[j] - a[j] / c_max, b[j] - a[j] / c_min
    llim = max((li, uj))
    ulim = min((ui, lj))
    q = random.uniform(llim, ulim)
    b[i] += q
    b[j] -= q

The good_enough(a, b) function can be a lot of things. I tried:

Standard deviation, which is hit or miss, as you don't know what is a good enough value.
Kurtosis, where a large negative value would be nice. However, it is relatively slow to calculate and is undefined with the seed values of (a_sum / n, b_sum / n) (though that's trivial to fix).
Skewness, where a value close to 0 is desirable. But it has the same drawbacks as kurtosis.
A number of iterations proportional to n. 2n sometimes wasn't enough, n ^ 2 is a little bit of overkill and is, well, exponential.

Ideally, a heuristic using a combination of skewness and kurtosis would be best but I settled for making sure each value has been changed from the initial (again, as rossum suggested in a comment). Though there is no theoretical guarantee that the loop will complete, it seemed to work well enough for me.

answered Oct 11 '22 02:10

ktdrv

Related questions
                            
                                Flask admin remember form value
                            
                                Despite installing the torch vision pytorch library, I am getting an error saying that there is no module named torch vision
                            
                                Getting coordinates of the closest data point on matplotlib plot
                            
                                Google cloud storage python client AttributeError: 'ClientOptions' object has no attribute 'scopes' occurs after deployment
                            
                                Merge two files and add computation and sorting the updated data in python
                            
                                Replacement for for... if array iteration
                            
                                Python UPnP/IGD Client Implementation?
                            
                                Why am I leaking memory with this python loop?
                            
                                How to display locale sensitive time format without seconds in python
                            
                                How do I create a named temporary file on windows in Python?
                            
                                Matplotlib: one line, plotted against two related x axes in different units?
                            
                                Overriding __cmp__, __eq__, and __hash__ for SQLAlchemy Declarative Base
                            
                                Python: Pickling a dict with some unpicklable items
                            
                                Twisted and Websockets: Beyond Echo
                            
                                Is there a Google Insights API? [closed]
                            
                                recursive crawling with Python and Scrapy
                            
                                Python not able to open file with non-english characters in path
                            
                                Python Idle and KeyboardInterrupts
                            
                                set python recursion limit for a function
                            
                                What Python accessible tools can you use to generate XSD from an XML document?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Generating random numbers under very specific constraints

Tags:

python

algorithm

random

ktdrv

People also ask

1 Answers

ktdrv

Recent Activity

Donate For Us