What scipy statistical test do I use to compare sample means?

Tags:

Assuming sample sizes are not equal, what test do I use to compare sample means under the following circumstances (please correct if any of the following are incorrect):

Normal Distribution = True and Homogeneity of Variance = True

scipy.stats.ttest_ind(sample_1, sample_2)

Normal Distribution = True and Homogeneity of Variance = False

scipy.stats.ttest_ind(sample_1, sample_2, equal_var = False)

Normal Distribution = False and Homogeneity of Variance = True

scipy.stats.mannwhitneyu(sample_1, sample_2)

Normal Distribution = False and Homogeneity of Variance = False

???

905

asked Jul 31 '14 16:07

blahblahblah

1 Answers

Fast answer:

Normal Distribution = True and Homogeneity of Variance = False and sample sizes > 30-50

scipy.stats.ttest_ind(sample1, sample2, equal_var=False)

Good answer:

If you check the Central limit theorem, it says (from Wikipedia): "In probability theory, the central limit theorem (CLT) states that, given certain conditions, the arithmetic mean of a sufficiently large number of iterates of independent random variables, each with a well-defined (finite) expected value and finite variance, will be approximately normally distributed, regardless of the underlying distribution"

So, although you do not have a normal distributed population, if your sample is big enough (greater than 30 or 50 samples), then the mean of the samples will be normally distributed. So, you can use:

scipy.stats.ttest_ind(sample1, sample2, equal_var=False)

This is a two-sided test for the null hypothesis that 2 independent samples have identical average (expected) values. With the option equal_var = False it performs a Welch’s t-test, which does not assume equal population variance.

answered Sep 19 '22 09:09

ivangtorre

Related questions
                            
                                Puzzling "'tuple' object does not support item assignment" error [duplicate]
                            
                                Using scipy.interpolate.splrep function
                            
                                fifo - reading in a loop
                            
                                using flask-sqlalchemy without the subclassed declarative base
                            
                                How to handle Python multiprocessing database concurrency, specifically with django?
                            
                                Python unittest data provider
                            
                                Element-wise maximum of two sparse matrices
                            
                                Django i18n: recommended size and formatting for {% blocktrans %} blocks?
                            
                                How to POST multiple FILES using Flask test client?
                            
                                Install paramiko on Windows
                            
                                MiniBatchKMeans Parameters
                            
                                The equivalent function of Matlab imfilter in Python
                            
                                Create a new type in python [closed]
                            
                                Is it pythonic to use generators to write header and body of a file?
                            
                                Create a Diverging Stacked Bar Chart in matplotlib
                            
                                Postgresql Database Backup Using Python
                            
                                Python text processing: AttributeError: 'list' object has no attribute 'lower'
                            
                                Using cascaded_union to combine shapes gives "ValueError: No Shapely geometry can be created from null value"
                            
                                Selenium Webdriver Exception: u'f.QueryInterface is not a function
                            
                                Transaction manager revert/rollback last commit

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What scipy statistical test do I use to compare sample means?

Tags:

python

numpy

statistics

scipy