Reproducibility and performance in PyTorch

Tags:

The documentation states:

Deterministic mode can have a performance impact, depending on your model.

My question is, what is meant by performance here. Processing speed or model quality (i.e. minimal loss)? In other words, when setting manual seeds and making the model perform in a deterministic way, does that cause longer training time until minimal loss is found, or is that minimal loss worse than when the model is non-deterministic?

For completeness' sake, I manually make the model deterministic by setting all of these properties:

def set_seed(seed):
    torch.manual_seed(seed)
    torch.cuda.manual_seed_all(seed)
    torch.backends.cudnn.deterministic = True
    torch.backends.cudnn.benchmark = False
    np.random.seed(seed)
    random.seed(seed)
    os.environ['PYTHONHASHSEED'] = str(seed)

518

asked May 29 '19 06:05

Bram Vanroy

2 Answers

Performance refers to the run time; CuDNN has several ways of implementations, when cudnn.deterministic is set to true, you're telling CuDNN that you only need the deterministic implementations (or what we believe they are). In a nutshell, when you are doing this, you should expect the same results on the CPU or the GPU on the same system when feeding the same inputs. Why would it affect the performance? CuDNN uses heuristics for the choice of the implementation. So, it actually depends on your model how CuDNN will behave; choosing it to be deterministic may affect the runtime because their could have been, let's say, faster way of choosing them at the same point of running.

Concerning your snippet, I do the exact seeding, it has been working good (in terms of reproducibility) for 100+ DL experiments.

176

answered Sep 20 '22 12:09

ndrwnaguib

"performance" in this context refer to run-time

answered Sep 18 '22 12:09

Shai

Related questions
                            
                                How can I speed up an animation?
                            
                                Why is this program faster in Python than Objective-C?
                            
                                What is the theory behind mutable and immutable types?
                            
                                Django and Shibboleth
                            
                                Jinja2 Inheritance with Blocks and Includes
                            
                                Does SQLAlchemy save order when adding objects to session?
                            
                                Update all models at once in Django
                            
                                numpy array that is (n,1) and (n,)
                            
                                Why do Python's empty classes and functions work as arbitrary data containers, but not other objects?
                            
                                How to return a dictionary | Python
                            
                                The PyData Ecosystem
                            
                                Check if a python thread threw an exception
                            
                                How to use flask-sqlalchemy with existing sqlalchemy model?
                            
                                Python 2.7 mock/patch: understanding assert_called_XYZ()
                            
                                Find the area between two curves plotted in matplotlib (fill_between area)
                            
                                OverflowError occurs when using cython with a large int
                            
                                NumPy performance: uint8 vs. float and multiplication vs. division?
                            
                                How can I make my class pretty printable in Python?
                            
                                No FileSystem for scheme: s3 with pyspark
                            
                                What is the pandas.Panel deprecation warning actually recommending?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Reproducibility and performance in PyTorch

Tags:

performance

python

deep-learning

pytorch

deterministic

Bram Vanroy

People also ask

2 Answers

ndrwnaguib

Shai

Recent Activity

Donate For Us