How to calculate 95% confidence intervals using Bootstrap method

Tags:

I'm trying to calculate the confidence interval for the mean value using the method of bootstrap in python. Let say I have a vector a with 100 entries and my aim is to calculate the mean value of these 100 values and its 95% confidence interval using bootstrap. So far I have manage to resample 1000 times from my vector using the np.random.choice function. Then for each bootstrap vector with 100 entries I calculated the mean. So now I have 1000 bootstrap mean values and a single sample mean value from my initial vector but I'm not sure how to proceed from here. How could I use these mean values to find the confidence interval for the mean value of my initial vector? I'm relatively new in python and it's the first time I came across with the method of bootstrap so any help would be much appreciated.

209

asked Nov 08 '16 15:11

Andriana

2 Answers

You could sort the array of 1000 means and use the 50th and 950th elements as the 90% bootstrap confidence interval.

Your set of 1000 means is basically a sample of the distribution of the mean estimator (the sampling distribution of the mean). So, any operation you could do on a sample from a distribution you can do here.

172

answered Sep 30 '22 01:09

Horia Coman

I have a simple statistical solution : Confidence intervals are based on the standard error. The standard error in your case is the standard deviation of your 1000 bootstrap means. Assuming a normal distribution of the sampling distribution of your parameter(mean), which should be warranted by the properties of the Central Limit Theorem, just multiply the equivalent z-score of the desired confidence interval with the standard deviation. Therefore:

lower boundary = mean of your bootstrap means - 1.96 * std. dev. of your bootstrap means

upper boundary = mean of your bootstrap means + 1.96 * std. dev. of your bootstrap means

95% of cases in a normal distribution sit within 1.96 standard deviations from the mean

hope this helps

answered Sep 30 '22 01:09

Bogdan Lalu

Related questions
                            
                                Python: Pandas Dataframe AttributeError: 'numpy.ndarray' object has no attribute 'fillna'
                            
                                ReportLab Paragraph and text formatting
                            
                                Can I use more than 26 letters in `numpy.einsum`?
                            
                                Hive Data to Pandas Data frame
                            
                                Celery beat not starting on Heroku
                            
                                python filter doesn't work
                            
                                How to append a tuple to a numpy array without it being preformed element-wise?
                            
                                Provide temporary PYTHONPATH on the commandline?
                            
                                AttributeError: probability estimates are not available for loss='hinge'
                            
                                PIP how escape character # in password?
                            
                                Using scipy.interpolate.interpn to interpolate a N-Dimensional array
                            
                                python pandas.Series.str.contains WHOLE WORD
                            
                                Image loses quality with cv2.warpPerspective
                            
                                Adding an extra hidden layer using Google's TensorFlow
                            
                                What does (n,) mean in the context of numpy and vectors?
                            
                                Redis locking for a KEY
                            
                                How to view the source code of numpy.random.exponential?
                            
                                How to get current user from a Django Channels web socket packet?
                            
                                how to save/crop detected faces in dlib python
                            
                                Mean of data scaled with sklearn StandardScaler is not zero

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to calculate 95% confidence intervals using Bootstrap method

Tags:

python

statistics

Andriana

People also ask

2 Answers

Horia Coman

Bogdan Lalu

Recent Activity

Donate For Us