Simulation of t copula in Python

Tags:

I am trying to simulate a t-copula using Python, but my code yields strange results (is not well-behaving):

I followed the approach suggested by Demarta & McNeil (2004) in "The t Copula and Related Copulas", which states:

t copula simulation

By intuition, I know that the higher the degrees of freedom parameter, the more the t copula should resemble the Gaussian one (and hence the lower the tail dependency). However, given that I sample from scipy.stats.invgamma.rvs or alternatively from scipy.stats.chi2.rvs, yields higher values for my parameter s the higher my parameter df. This does not made any sense, as I found multiple papers stating that for df--> inf, t-copula --> Gaussian copula.

Here is my code, what am I doing wrong? (I'm a beginner in Python fyi).

import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
from scipy.stats import invgamma, chi2, t

#Define number of sampling points
n_samples = 1000
df = 10

calib_correl_matrix = np.array([[1,0.8,],[0.8,1]]) #I just took a bivariate correlation matrix here
mu = np.zeros(len(calib_correl_matrix))
s = chi2.rvs(df)
#s = invgamma.pdf(df/2,df/2) 
Z = np.random.multivariate_normal(mu, calib_correl_matrix,n_samples)
X = np.sqrt(df/s)*Z #chi-square method
#X = np.sqrt(s)*Z #inverse gamma method

U = t.cdf(X,df)

My outcomes behave exactly oppisite to what I am (should be) expecting: Higher df create much higher tail-dependency, here also visually:

 U_pd = pd.DataFrame(U)
 fig = plt.gcf()
 fig.set_size_inches(14.5, 10.5)
 pd.plotting.scatter_matrix(U_pd, figsize=(14,10), diagonal = 'kde')
 plt.show()

df=4: scatter_plot

df=100: enter image description here

It gets especially worse when using the invgamma.rvs directly, even though they should yield the same. For dfs>=30 I often receive a ValueError ("ValueError: array must not contain infs or NaNs")

Thank you very much for your help, much appreciated!

889

asked Jul 26 '18 10:07

rhonsprudel

1 Answers

There is one obvious problem in your code. Namely, this:

s = chi2.rvs(df)

Has to be changed to something like that:

s = chi2.rvs(df, size=n_samples)[:, np.newaxis]

Otherwise the variable s is just a single constant and your X ends up being a sample from the multivariate normal (scaled by np.sqrt(df/s)), rather than the t-distrubution that you need.

You most probably obtained your "tail-heavy" charts simply because you were unlucky and your sampled value of s ended up being too small. This has nothing to do with df, though, yet it seems that it is easier to hit the "unlucky" values when df is smaller.

179

answered Sep 27 '22 21:09

KT.

Related questions
                            
                                Plotting with scientific axis, changing the number of significant figures
                            
                                Algorithm to exchange the roles of two randomly chosen nodes from a tree moving pointers
                            
                                TensorFlow: Performing this loss computation
                            
                                Django Proxy Field
                            
                                Can we use serializer_class attribute with APIView(django rest framework)?
                            
                                How to save plots from multiple python scripts using an interactive C# process command?
                            
                                Python scraping of javascript web pages fails for https pages only
                            
                                Providing SSL Connections in Python using PKCS#11
                            
                                Efficient way to set elements to zero where mask is True on scipy sparse matrix
                            
                                Pandas uses substantially more memory for storage than asked for
                            
                                Debugging a Neural Network
                            
                                Numpy Apply Along Axis and Get Row Index
                            
                                (Installing Python 3.6.1) SSLError: SSL: TLSV1_ALERT_UNKNOWN_CA tlsv1 alert unknown ca
                            
                                Text[Multi-Level] Classification with many outputs
                            
                                Temporary images with Pyglet
                            
                                How to use the latest sqlite3 version in python
                            
                                Proxy Pooling System for Scrapy to temporarily stop using slow/timing out proxies
                            
                                How to use py_func with a function that returns dict
                            
                                What does "Broker transport failure" mean in kafka?
                            
                                Weird behaviour with groupby on ordered categorical columns

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Simulation of t copula in Python

Tags:

python

simulation

scipy

rhonsprudel

People also ask

1 Answers

KT.

Recent Activity

Donate For Us