I have two factors. factor A have 2 level, factor B have 3 level. How to create the following design matrix? <pre class="prettyprint"><code> factorA1 factorA2 factorB1 factorB2 factorB3 [1,] 1 0 1 0 0 [2,] 1 0 0 1 0 [3,] 1 0 0 0 1 [4,] 0 1 1 0 0 [5,] 0 1 0 1 0 [6,] 0 1 0 0 1 </code></pre>

You have a couple of options: Use base and piece it together yourself: <pre class="prettyprint"><code>(iris.dummy<-with(iris,model.matrix(~Species-1))) (IRIS<-data.frame(iris,iris.dummy)) </code></pre> Or use the ade4 package as follows: <pre class="prettyprint"><code>dummy <- function(df) { require(ade4) ISFACT <- sapply(df, is.factor) FACTS <- acm.disjonctif(df[, ISFACT, drop = FALSE]) NONFACTS <- df[, !ISFACT,drop = FALSE] data.frame(NONFACTS, FACTS) } dat <-data.frame(eggs = c("foo", "foo", "bar", "bar"), ham = c("red","blue","green","red"), x=rnorm(4)) dummy(dat) ## x eggs.bar eggs.foo ham.blue ham.green ham.red ## 1 0.3365302 0 1 0 0 1 ## 2 1.1341354 0 1 1 0 0 ## 3 2.0489741 1 0 0 1 0 ## 4 1.1019108 1 0 0 0 1 </code></pre>

Assuming your data in in a data.frame called <code>dat</code>, let's say the two factors are given as in this example: <pre class="prettyprint"><code>> dat <- data.frame(f1=sample(LETTERS[1:3],20,T),f2=sample(LETTERS[4:5],20,T),id=1:20) > dat f1 f2 id 1 C D 1 2 B E 2 3 B E 3 4 A D 4 5 C E 5 6 C E 6 7 C D 7 8 B E 8 9 C D 9 10 A D 10 11 B E 11 12 C E 12 13 B D 13 14 B E 14 15 A D 15 16 C E 16 17 C D 17 18 C D 18 19 B D 19 20 C D 20 > dat$f1 [1] C B B A C C C B C A B C B B A C C C B C Levels: A B C > dat$f2 [1] D E E D E E D E D D E E D E D E D D D D Levels: D E </code></pre> You can use <code>outer</code> to get a matrix as you showed, for each factor: <pre class="prettyprint"><code>> F1 <- with(dat, outer(f1, levels(f1), `==`)*1) > colnames(F1) <- paste("f1",sep="=",levels(dat$f1)) > F1 f1=A f1=B f1=C [1,] 0 0 1 [2,] 0 1 0 [3,] 0 1 0 [4,] 1 0 0 [5,] 0 0 1 [6,] 0 0 1 [7,] 0 0 1 [8,] 0 1 0 [9,] 0 0 1 [10,] 1 0 0 [11,] 0 1 0 [12,] 0 0 1 [13,] 0 1 0 [14,] 0 1 0 [15,] 1 0 0 [16,] 0 0 1 [17,] 0 0 1 [18,] 0 0 1 [19,] 0 1 0 [20,] 0 0 1 </code></pre> Now do the same for the second factor: <pre class="prettyprint"><code>> F2 <- with(dat, outer(f2, levels(f2), `==`)*1) > colnames(F2) <- paste("f2",sep="=",levels(dat$f2)) </code></pre> And <code>cbind</code> them to get the final result: <pre class="prettyprint"><code>> cbind(F1,F2) </code></pre>

How to create design matrix in r

Q: What is a model matrix R?

In R, 'model. matrix' is a useful tool for seeing the design matrices that are in play when you build regression models. Build a simple data frame. First, build a simple data frame with time as a factor and Time as a continuous, numeric variable. The two variables look alike when you print the data frame.

Q: What is design matrix example?

For example, suppose an experiment is run where 10 people are pulled off the street and asked four questions. The data matrix M would be a 10×4 matrix (meaning 10 rows and 4 columns). The datum in row i and column j of this matrix would be the answer of the i th person to the j th question.

Q: What does a design matrix do?

The purpose of the design matrix is to allow models that further constrain parameter sets. These constraints provide additional flexibility in modeling and allows researchers to build models that cannot be derived using the simple PIMs in .

Tags:

r

I have two factors. factor A have 2 level, factor B have 3 level.

How to create the following design matrix?

Click to copy

     factorA1 factorA2 factorB1 factorB2 factorB3
[1,]        1        0        1        0        0
[2,]        1        0        0        1        0
[3,]        1        0        0        0        1
[4,]        0        1        1        0        0
[5,]        0        1        0        1        0
[6,]        0        1        0        0        1

376

asked Mar 30 '13 19:03

Colin

2 Answers

You have a couple of options:

Use base and piece it together yourself:

Click to copy

(iris.dummy<-with(iris,model.matrix(~Species-1))) 
(IRIS<-data.frame(iris,iris.dummy))

Or use the ade4 package as follows:

Click to copy

dummy <- function(df) {
    require(ade4)
    ISFACT <- sapply(df, is.factor)
    FACTS <- acm.disjonctif(df[, ISFACT, drop = FALSE])
    NONFACTS <- df[, !ISFACT,drop = FALSE]
    data.frame(NONFACTS, FACTS)
}

dat <-data.frame(eggs = c("foo", "foo", "bar", "bar"),  
    ham = c("red","blue","green","red"), x=rnorm(4)) 
dummy(dat)



##           x eggs.bar eggs.foo ham.blue ham.green ham.red
## 1 0.3365302        0        1        0         0       1
## 2 1.1341354        0        1        1         0       0
## 3 2.0489741        1        0        0         1       0
## 4 1.1019108        1        0        0         0       1

191

answered Nov 15 '22 06:11

Tyler Rinker

Assuming your data in in a data.frame called dat, let's say the two factors are given as in this example:

Click to copy

> dat <- data.frame(f1=sample(LETTERS[1:3],20,T),f2=sample(LETTERS[4:5],20,T),id=1:20)
> dat
   f1 f2 id
1   C  D  1
2   B  E  2
3   B  E  3
4   A  D  4
5   C  E  5
6   C  E  6
7   C  D  7
8   B  E  8
9   C  D  9
10  A  D 10
11  B  E 11
12  C  E 12
13  B  D 13
14  B  E 14
15  A  D 15
16  C  E 16
17  C  D 17
18  C  D 18
19  B  D 19
20  C  D 20
> dat$f1
 [1] C B B A C C C B C A B C B B A C C C B C
Levels: A B C
> dat$f2
 [1] D E E D E E D E D D E E D E D E D D D D
Levels: D E

You can use outer to get a matrix as you showed, for each factor:

Click to copy

> F1 <- with(dat, outer(f1, levels(f1), `==`)*1)
> colnames(F1) <- paste("f1",sep="=",levels(dat$f1))
> F1
      f1=A f1=B f1=C
 [1,]    0    0    1
 [2,]    0    1    0
 [3,]    0    1    0
 [4,]    1    0    0
 [5,]    0    0    1
 [6,]    0    0    1
 [7,]    0    0    1
 [8,]    0    1    0
 [9,]    0    0    1
[10,]    1    0    0
[11,]    0    1    0
[12,]    0    0    1
[13,]    0    1    0
[14,]    0    1    0
[15,]    1    0    0
[16,]    0    0    1
[17,]    0    0    1
[18,]    0    0    1
[19,]    0    1    0
[20,]    0    0    1

Now do the same for the second factor:

Click to copy

> F2 <- with(dat, outer(f2, levels(f2), `==`)*1)
> colnames(F2) <- paste("f2",sep="=",levels(dat$f2))

And cbind them to get the final result:

Click to copy

> cbind(F1,F2)

answered Nov 15 '22 05:11

Ferdinand.kraft

Related questions
                            
                                Comparing two linear models with anova() in R [closed]
                            
                                How are BRR weights used in the survey package for R?
                            
                                shading certain area in graph - a combination of lines and point plots in base R
                            
                                Rpy2 Installation issue, windows 7 [duplicate]
                            
                                R - how do I change font style and size on my two x-axes which I have moved to the top of the graph?
                            
                                Draw multiple Bootstrap curves in R
                            
                                Why doesn't rle accept a factor as input?
                            
                                Linebreak in R plot legend behavior
                            
                                How to avoid vector search in data.table
                            
                                inset \footnote{} into header with xtable and tabular.environment
                            
                                R does ggplot2 have interactivity option?
                            
                                Determining derivatives from GAM smooth object
                            
                                How to put square brackets and a subscript next to each other in R expression?
                            
                                Reshaping wide dataset in interval format
                            
                                Find all possible ways to split a list of elements into a a given number of group of the same size
                            
                                ggplot2/colorbrewer qualitative pallette with 125 categories
                            
                                Producing RNG vectors in R that have pre-defined sum of pdf or sum of cdf
                            
                                combining column number having same values
                            
                                Wrapping plots in another html container within an Rmd file
                            
                                How to set up a shortcut for different versions of R in RStudio?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to create design matrix in r

Tags:

r

Colin

People also ask

2 Answers

Tyler Rinker

Ferdinand.kraft

Recent Activity

Donate For Us