I need some help using the scipy.stats.t.interval() function http://docs.scipy.org/doc/scipy/reference/generated/scipy.stats.t.html?highlight=stats.t#scipy.stats.t I am looking at the documentation, and it doesn't make sense. What are loc and scale? I'm used to student T intervals requiring a mean, sd, df, and confidence interval. If you know the answer and can help, please post. Also if you could tell me how you learned it, that would be great. I've been having no luck with this documentation.

The docs page you linked has a link to the source code. Which even has a nicely formatted formula for the distribution in the comments (search for <code>class t_gen</code>). <code>loc</code> and <code>scale</code> are a way all the continuous distributions in <code>scipy.stats</code> are parametrized: Basically, for a distribution <code>f(x)</code>, specifying loc and scale means you get <code>f(loc + x*scale)</code> (line 1208 in the source linked above). <pre class="prettyprint"><code>>>> import scipy.stats as stats >>> stats.t.pdf(2, 2) 0.06804138174397717 >>> stats.t.pdf(2, 2, loc=0, scale=1) 0.06804138174397717 >>> stats.t.pdf(2+42, 2, loc=42, scale=1) 0.06804138174397717 >>> stats.t.stats(9, moments='mvsk') (array(0.0), array(1.2857142857142858), array(0.0), array(1.2)) >>> stats.t.stats(8, loc=1, moments='mvsk') (array(1.0), array(1.3333333333333333), array(0.0), array(1.5)) >>> stats.t.interval(0.95, 4, loc=0) (-2.7764451051977987, 2.7764451051977987) >>> stats.t.interval(0.95, 4, loc=3) (0.22355489480220125, 5.7764451051977987) </code></pre> Yes, this is a little baffling at first sight :-).

Since the previous answer is not explicit, I made some research and just verified that: loc is the mean. scale is the standard error of the mean. Such that: μ = M ± t(sM) where μ is the t-interval, M is the mean, t is the t statistic, and sM = √(std^2/n) is the standard error of the mean.

Interpretting Scipy function's meaning and usage t.interval()

2 Answers

The docs page you linked has a link to the source code. Which even has a nicely formatted formula for the distribution in the comments (search for class t_gen).

loc and scale are a way all the continuous distributions in scipy.stats are parametrized: Basically, for a distribution f(x), specifying loc and scale means you get f(loc + x*scale) (line 1208 in the source linked above).

>>> import scipy.stats as stats
>>> stats.t.pdf(2, 2) 
0.06804138174397717
>>> stats.t.pdf(2, 2, loc=0, scale=1) 
0.06804138174397717
>>> stats.t.pdf(2+42, 2, loc=42, scale=1) 
0.06804138174397717

>>> stats.t.stats(9, moments='mvsk')
(array(0.0), array(1.2857142857142858), array(0.0), array(1.2))
>>> stats.t.stats(8, loc=1, moments='mvsk')
(array(1.0), array(1.3333333333333333), array(0.0), array(1.5))

>>> stats.t.interval(0.95, 4, loc=0)
(-2.7764451051977987, 2.7764451051977987)
>>> stats.t.interval(0.95, 4, loc=3)
(0.22355489480220125, 5.7764451051977987)

Yes, this is a little baffling at first sight :-).

134

answered Nov 07 '22 11:11

ev-br

Since the previous answer is not explicit, I made some research and just verified that:

loc is the mean.

scale is the standard error of the mean.

Such that: μ = M ± t(sM)

where μ is the t-interval, M is the mean, t is the t statistic, and sM = √(std^2/n) is the standard error of the mean.

answered Nov 07 '22 12:11

rossberto

Related questions
                            
                                How to read and write 24-bit wav file using scipy or common alternative?
                            
                                How to use griddata from scipy.interpolate
                            
                                Accessing Single Entries in Sparse Matrix in Python
                            
                                Scipy's correlate function is slow
                            
                                Attaching intensity to 3D plot
                            
                                scipy eigh gives negative eigenvalues for positive semidefinite matrix
                            
                                Get lag with cross-correlation?
                            
                                Compute outer product of arrays with arbitrary dimensions
                            
                                Rearrange sparse arrays by swapping rows and columns
                            
                                SciPy deconvolution function
                            
                                Add values to a Scipy sparse matrix with indexes and values
                            
                                Scipy - Sparse Library ImportError: DLL load failed: %1 is not a valid Win32 application
                            
                                Estimate formants using LPC in Python
                            
                                Create scipy curve fitting definitions for fourier series dynamically
                            
                                pickling scipy interp1d spline
                            
                                How to plot FFT of signal with correct frequencies on x-axis?
                            
                                Scipy: Minimize violates given bounds
                            
                                Categorical variables usage in pandas for ANOVA and regression?
                            
                                How to access fields in a struct imported from a .mat file using loadmat in Python?
                            
                                Python Least-Squares Natural Splines

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Interpretting Scipy function's meaning and usage t.interval()

Tags:

scipy

SwimBikeRun

People also ask

2 Answers

ev-br

rossberto

Recent Activity

Donate For Us