Seeded Python RNG showing non-deterministic behavior with sets

Tags:

I'm seeing non-deterministic behavior when trying to select a pseudo-random element from sets, even though the RNG is seeded (example code shown below). Why is this happening, and should I expect other Python data types to show similar behavior?

Notes: I've only tested this on Python 2.7, but it's been reproducible on two different Windows computers.

Similar Issue: The issue at Python random seed not working with Genetic Programming example code may be similar. Based on my testing, my hypothesis is that run-to-run memory allocation differences within the sets is leading to different elements getting picked up for the same RNG state.

So far I haven't found any mention of this kind of caveat/issue in the Python docs for set or random.

Example Code (randTest produces different output run-to-run):

import random

''' Class contains a large set of pseudo-random numbers. '''
class bigSet:
    def __init__(self):
        self.a = set()
        for n in range(2000):
            self.a.add(random.random())
        return


''' Main test function. '''
def randTest():
    ''' Seed the PRNG. '''
    random.seed(0)

    ''' Create sets of bigSet elements, presumably many memory allocations. ''' 
    b = set()
    for n in range (2000):
        b.add(bigSet())

    ''' Pick a random value from a random bigSet. Would have expected this to be deterministic. '''    
    c = random.sample(b,1)[0]
    print('randVal: ' + str(random.random()))           #This value is always the same
    print('setSample: ' + str(random.sample(c.a,1)[0])) #This value can change run-to-run
    return

402

asked Mar 30 '16 19:03

Amac26629

1 Answers

`OrderedSet` is the ideal choice.

Neither set nor frozenset should be used here, since nowhere is it specified that any of them are ordered. The fact that another answer works is just an accident of implementation. Sets are unordered, and relying on their order results in coupling to the Python version (and possibly machine).

I get a different order from Roland's answer in Python 3.8.6 (although the order between two runs happens to be the same). This is in spite of the fact that the random numbers generated are the same.

To preserve the order, and therefore determinism based on a random seed, you must use an ordered data structure such as OrderedSet.

If you do not have OrderedSet available, or if profiling your code shows OrderedSet is slow, you can use an OrderedDict and ignore its values.

If you have Python >= 3.6, then even regular dicts are ordered thanks to performance optimizations.

160

answered Oct 01 '22 07:10

danuker

Related questions
                            
                                Queue objects should only be shared between processes through inheritance
                            
                                Shape of earth seems wrong in Skyfield - is my python correct?
                            
                                Python - why does time.sleep cause memory leak?
                            
                                python: stretch world map
                            
                                Google Cloud VM - Installing openCV
                            
                                Speeding up matrix-vector multiplication and exponentiation in Python, possibly by calling C/C++
                            
                                Pandas invalid type comparison error
                            
                                Where is my custom Django app code?
                            
                                What is the right way to pass inputs parameters to a Theano function?
                            
                                Embed Python Zip file throws error?
                            
                                How to enable mod_wsgi after pip install
                            
                                impyla hangs when connecting to HiveServer2
                            
                                Is it un-pythonic to define a function inside of a class method?
                            
                                Group by year/month/day in pandas
                            
                                pip install --upgrade pip installs the same version
                            
                                Django maintain versions of a model object
                            
                                CFFI UserWarning: 'point_conversion_form_t' has no values explicitly defined;
                            
                                Interactive plot in Jupyter notebook
                            
                                Expected Chromecast Audio Delay?
                            
                                Python package wheel PKG-INFO name

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Seeded Python RNG showing non-deterministic behavior with sets

Tags:

python

random

set

python-2.7

non-deterministic

Amac26629

People also ask

1 Answers

`OrderedSet` is the ideal choice.

danuker

Recent Activity

Donate For Us

Seeded Python RNG showing non-deterministic behavior with sets

Tags:

python

random

set

python-2.7

non-deterministic

Amac26629

People also ask

1 Answers

OrderedSet is the ideal choice.

danuker

Related questions

Recent Activity

Donate For Us

`OrderedSet` is the ideal choice.