Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

How (and why) do you use contrasts?

Under what cases do you create contrasts in your analysis? How is it done and what is it used for?

I checked ?contrasts and ?C - both lead to "Chapter 2 of Statistical Models in S", which is not readily available to me.

like image 626
Tal Galili Avatar asked Feb 28 '10 20:02

Tal Galili


People also ask

How do you use contrasts?

1 : to show noticeable differences Red contrasts with black. 2 : to compare two persons or things so as to show the differences between them Contrast the styles of these two authors. 1 : something that is different from another Today's weather is quite a contrast to yesterday's.

What does contrasts () do in R?

A contrast is a linear combination of variables that allows comparison of different treatments. Categorical variables are entered into a regression analysis as a sequence of \(n-1\) variables. Generally, these are dummy variables. There are four built-in contrast coding schemes in R.

What is contrast in design of experiments?

Experiments | Contrast. The contrast is an important part of the experimental design. It describes the key differences that are being compared within an experiment. A general example would be "Task > Rest", where the contrast is the difference between the active task and rest.

What are contrasts in regression?

In statistics, particularly in analysis of variance and linear regression, a contrast is a linear combination of variables (parameters or statistics) whose coefficients add up to zero, allowing comparison of different treatments.


2 Answers

Contrasts are needed when you fit linear models with factors (i.e. categorical variables) as explanatory variables. The contrast specifies how the levels of the factors will be coded into a family of numeric dummy variables for fitting the model.

Here are some good notes for the different varieties of contrasts used: http://www.unc.edu/courses/2006spring/ecol/145/001/docs/lectures/lecture26.htm

When the contrasts used are changed, the model remains the same in terms of the underlying joint probability distributions allowed. Only its parametrization changes. The fitted values remain the same as well. Also, once you have the value of the parameters for one choice of contrasts, it is easy to derive what the value of the parameters for another choice of contrasts would have been.

Therefore the choice of contrasts has no statistical consequence. It is purely a matter of making coefficients and hypothesis tests easier to interpret.

like image 106
Jyotirmoy Bhattacharya Avatar answered Sep 24 '22 06:09

Jyotirmoy Bhattacharya


Take a look here (pages 365-370, which are free to view). On page 364 starts a one-way analysis of variance of a plant competition experiment. The code from page 364 that is missing is:

comp<-read.table("c:\\temp\\competition.txt",header=T) attach(comp) names(comp) [1] "biomass" "clipping" The categorical explanatory variable is clipping and it has five levels as follows: levels(clipping) ... 

The definition of contrasts is given later, on page 368 (in the homonym paragraph). If you want to follow the examples, you can download the datasets. See my post here.

It is out of the question that the book is highly recommended.

like image 35
George Dontas Avatar answered Sep 26 '22 06:09

George Dontas