What is a chain in PyMC3?

Tags:

I am learning PyMC3 for Bayesian modeling. You can create a model and sample with:

import pandas as pd
import pymc3 as pm

# obs is a DataFrame with a single column, containing
# the observed values for variable height
obs = pd.DataFrame(...)

# we create a pymc3 model
with pm.Model() as m:
    mu = pm.Normal('mu', mu=178, sd=20)
    sigma = pm.Uniform('sigma', lower=0, upper=50)
    height = pm.Normal('height', mu=mu, sd=sigma, observed=obs)
    trace = pm.sample(1000, tune=1000)

pm.traceplot(trace)

pymc3 output

When I check the trace (in this case 1000 samples from the posterior probability), I notice that 2 chains are created:

>>> trace.nchains
2

I read the tutorial on PyMC3 and looked through the API but it is unclear to me what a chain represents (in this case I asked for 1000 samples from the posterior but I got 2 chains, each one with 1000 samples from the posterior).

Are the chains different runs of the sampler with the same parameters or do they have some other meaning/purpose?

963

asked Apr 13 '18 21:04

gc5

1 Answers

A chain is a single run of MCMC. So if you have six 2-d parameters in your model and ask for 1000 samples, you will get six 2x1000 arrays for each chain.

When running MCMC, it is a best practice to use multiple chains, as they can help diagnose problems. For example, the Gelman-Rubin diagnostic requires multiple chains, and runs automatically (using joblib, which tries to use multiple cores if possible) if you use more than 1 chain in PyMC3.

As a concrete example of when you might want multiple chains, consider sampling from a multimodal distribution. Even the NUTS sampler may not visit both modes in a single chain, but you could diagnose this using multiple chains.

multiple chains

Note that PyMC3 usually combines the chains when you work with them (e.g., using trace.get_values('my_var')), since they are all valid MCMC samples. This does lead to some confusing behavior in that asking for 1000 samples actually gets you 4000 on most systems, where you get 4 chains by default.

145

answered Oct 20 '22 16:10

colcarroll

Related questions
                            
                                python sqlite insert named parameters or null
                            
                                Creating a tree/deeply nested dict from an indented text file in python
                            
                                How do I crop to largest interior bounding box in OpenCV?
                            
                                Pip doesn't install latest available version from pypi (argparse in this case)
                            
                                Creating same random number sequence in Python, NumPy and R
                            
                                How to get SQLite result/error codes in Python
                            
                                How to solve the 10054 error
                            
                                Retrieve the command line arguments of the Python interpreter
                            
                                Most efficient way to remove multiple substrings from string?
                            
                                Customize location of .so file generated by Cython
                            
                                How to cope with the performance of generating signed URLs for accessing private content via CloudFront?
                            
                                In locust How to get a response from one task and pass it to other task
                            
                                np.isnan on arrays of dtype "object"
                            
                                Difference between web-based and executable installers for Python 3 on Windows
                            
                                docker python custom module not found
                            
                                Connect MySQL with Python 3.6 [closed]
                            
                                Removing cached files after a pytest run
                            
                                Write to /tmp directory in aws lambda with python
                            
                                pandas rolling window & datetime indexes: What does `offset` mean?
                            
                                Tesseract OCR fails to detect varying font size and letters that are not horizontally aligned

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What is a chain in PyMC3?

Tags:

python

markov-chains

bayesian

pymc3

gc5

People also ask

1 Answers

colcarroll

Recent Activity

Donate For Us