I have been trying the find the difference between <code>ordered</code> and unordered <code>factor</code> variables in R. Especially this line in the documentation of <code>?factor</code> is confusing me: <pre class="prettyprint"><code>Ordered factors differ from factors only in their class, but methods and the model-fitting functions treat the two classes quite differently. </code></pre> The closest I have come to finding the answer is from the answers of these three questions: <ol> <li>Factors ordered vs. levels</li> <li>Is there an advantage to ordering a categorical variable?</li> <li>factor() command in R is for categorical variables with hierarchy level only?</li> </ol> In an answer to above 1st question, @joran has said that "A detailed summary of the statistical differences is probably way beyond the scope of a StackOverflow answer." I'm not looking for a detailed summary here, but can anyone give a small and simple example demonstrating how <code>ordered</code> and unordered <code>factor</code> differs when used in methods and model-fitting functions?

Ordered factors use orthogonal polynomial contrasts by default. The L and Q stand for the linear and quadratic terms. Unordered factors use "treatment" contrasts although (they're actually not contrasts). for understanding read: http://r.789695.n4.nabble.com/Models-with-ordered-and-unordered-factors-td4072225.html http://www.stat.berkeley.edu/~s133/factors.html

Difference between ordered and unordered factor variables in R

Tags:

r

I have been trying the find the difference between ordered and unordered factor variables in R. Especially this line in the documentation of ?factor is confusing me:

Ordered factors differ from factors only in their class, but methods and
the model-fitting functions treat the two classes quite differently.

The closest I have come to finding the answer is from the answers of these three questions:

Factors ordered vs. levels
Is there an advantage to ordering a categorical variable?
factor() command in R is for categorical variables with hierarchy level only?

In an answer to above 1st question, @joran has said that "A detailed summary of the statistical differences is probably way beyond the scope of a StackOverflow answer."

I'm not looking for a detailed summary here, but can anyone give a small and simple example demonstrating how ordered and unordered factor differs when used in methods and model-fitting functions?

922

asked Sep 30 '14 11:09

StrikeR

Video Answer

2 Answers

for understanding read: http://r.789695.n4.nabble.com/Models-with-ordered-and-unordered-factors-td4072225.html http://www.stat.berkeley.edu/~s133/factors.html

answered Sep 18 '22 22:09

Suchit kumar

The major difference that is the most easily apparent is "pretty printing." Ordered factors print well, in console, and they determine order of labels in ggplots.

In terms of modelling, contrasts generated for them in fitting linear models are different. If you are looking for some simple examples that describe the material, I would suggest you look at http://www.ats.ucla.edu/stat/r/library/contrast_coding.htm. Two points in this article give examples of the two schemes: 1. Dummy Coding - Unordered R factors 4. Orthogonal Polynomial Coding - Ordered R factors.

To summarise, dummy coding uses simple comparison of levels to a reference level in fitting models (e.g. gender, race, etc.); whereas polynomial coding uses trend analysis (for a variable such as income or education).

The examples in the above link are in R, so would serve to illustrate your query well.

answered Sep 17 '22 22:09

Ankur Kanoria

Related questions
                            
                                R parse HTML document and use xpath to get all matches of two patterns
                            
                                Passing a vector of lambdas to Rcpp's rpois
                            
                                Plot multiple ggplot plots on a single image with left alignment of the plots and a single legend
                            
                                How do I present a variable out of sequence in R markdown?
                            
                                How to select all
                            
                                How can you tell if a pipe operator is the last (or first) in a chain?
                            
                                Three column graph
                            
                                Understanding data.table invalid .selfref warning
                            
                                For loop R create and populate new column with output
                            
                                unable to install rJava in centos R
                            
                                Using geom_boxplot with facet_grid and free_y
                            
                                Creating new SQL table from dplyr object without using R memory
                            
                                data.table merge produces extra columns [R]
                            
                                Developing R package when functions are written in S4 and using roxygen2
                            
                                ERROR: compilation failed for package ‘Rcpp’
                            
                                Different versions of R, lme4 and OS X give different fixed-effects significance results in glmer
                            
                                Add file extension to all files in a folder in R
                            
                                difftime between rows using dplyr
                            
                                How to build R package from GitHub?
                            
                                R: Christmas Tree

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With