I want to create 7 dummy variables -one for each day, using dplyr So far, I have managed to do it using the <code>sjmisc</code> package and the <code>to_dummy</code> function, but I do it in 2 steps -1.Create a df of dummies, 2) append to the original df <pre class="prettyprint"><code>#Sample dataframe mydfdata.frame(x=rep(letters[1:9]), day=c("Mon","Tues","Wed","Thurs","Fri","Sat","Sun","Fri","Mon")) #1.Create the 7 dummy variables separately daysdummy<-sjmisc::to_dummy(mydf$day,suffix="label") #2. append to dataframe mydf<-bind_cols(mydf,daysdummy) > mydf x day day_Fri day_Mon day_Sat day_Sun day_Thurs day_Tues day_Wed 1 a Mon 0 1 0 0 0 0 0 2 b Tues 0 0 0 0 0 1 0 3 c Wed 0 0 0 0 0 0 1 4 d Thurs 0 0 0 0 1 0 0 5 e Fri 1 0 0 0 0 0 0 6 f Sat 0 0 1 0 0 0 0 7 g Sun 0 0 0 1 0 0 0 8 h Fri 1 0 0 0 0 0 0 9 i Mon 0 1 0 0 0 0 0 </code></pre> My question is whether I can do it in one single workflow using <code>dplyr</code> and add the <code>to_dummy</code> into the pipe-workflow- perhaps using <code>mutate</code>? *<code>to_dummy</code> documentation

If you want to do this with the pipe, you can do something like: <pre class="prettyprint"><code>library(dplyr) library(sjmisc) mydf %>% to_dummy(day, suffix = "label") %>% bind_cols(mydf) %>% select(x, day, everything()) </code></pre> Returns: <blockquote> <pre class="prettyprint"><code># A tibble: 9 x 9 x day day_Fri day_Mon day_Sat day_Sun day_Thurs day_Tues day_Wed <fct> <fct> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> 1 a Mon 0. 1. 0. 0. 0. 0. 0. 2 b Tues 0. 0. 0. 0. 0. 1. 0. 3 c Wed 0. 0. 0. 0. 0. 0. 1. 4 d Thurs 0. 0. 0. 0. 1. 0. 0. 5 e Fri 1. 0. 0. 0. 0. 0. 0. 6 f Sat 0. 0. 1. 0. 0. 0. 0. 7 g Sun 0. 0. 0. 1. 0. 0. 0. 8 h Fri 1. 0. 0. 0. 0. 0. 0. 9 i Mon 0. 1. 0. 0. 0. 0. 0. </code></pre> </blockquote> With <code>dplyr</code> and <code>tidyr</code> we could do: <pre class="prettyprint"><code>library(dplyr) library(tidyr) mydf %>% mutate(var = 1) %>% spread(day, var, fill = 0, sep = "_") %>% left_join(mydf) %>% select(x, day, everything()) </code></pre> And with base R we could do something like: <pre class="prettyprint"><code>as.data.frame.matrix(table(rep(mydf$x, lengths(mydf$day)), unlist(mydf$day))) </code></pre> Returns: <blockquote> <pre class="prettyprint"><code> Fri Mon Sat Sun Thurs Tues Wed a 0 1 0 0 0 0 0 b 0 0 0 0 0 1 0 c 0 0 0 0 0 0 1 d 0 0 0 0 1 0 0 e 1 0 0 0 0 0 0 f 0 0 1 0 0 0 0 g 0 0 0 1 0 0 0 h 1 0 0 0 0 0 0 i 0 1 0 0 0 0 0 </code></pre> </blockquote>

Mutating dummy variables in dplyr

Tags:

r

dplyr

dummy-variable

I want to create 7 dummy variables -one for each day, using dplyr

So far, I have managed to do it using the sjmisc package and the to_dummy function, but I do it in 2 steps -1.Create a df of dummies, 2) append to the original df

Click to copy

#Sample dataframe
mydfdata.frame(x=rep(letters[1:9]),
           day=c("Mon","Tues","Wed","Thurs","Fri","Sat","Sun","Fri","Mon"))

#1.Create the 7 dummy variables separately
daysdummy<-sjmisc::to_dummy(mydf$day,suffix="label")

#2. append to dataframe
mydf<-bind_cols(mydf,daysdummy)


> mydf
  x   day day_Fri day_Mon day_Sat day_Sun day_Thurs day_Tues day_Wed
1 a   Mon       0       1       0       0         0        0       0
2 b  Tues       0       0       0       0         0        1       0
3 c   Wed       0       0       0       0         0        0       1
4 d Thurs       0       0       0       0         1        0       0
5 e   Fri       1       0       0       0         0        0       0
6 f   Sat       0       0       1       0         0        0       0
7 g   Sun       0       0       0       1         0        0       0
8 h   Fri       1       0       0       0         0        0       0
9 i   Mon       0       1       0       0         0        0       0

My question is whether I can do it in one single workflow using dplyr and add the to_dummy into the pipe-workflow- perhaps using mutate?

*to_dummy documentation

545

asked Mar 14 '18 11:03

Lefkios Paikousis

1 Answers

If you want to do this with the pipe, you can do something like:

Click to copy

library(dplyr)
library(sjmisc)

mydf %>% 
  to_dummy(day, suffix = "label") %>% 
  bind_cols(mydf) %>% 
  select(x, day, everything())

Returns:

Click to copy

# A tibble: 9 x 9
  x     day   day_Fri day_Mon day_Sat day_Sun day_Thurs day_Tues day_Wed
  <fct> <fct>   <dbl>   <dbl>   <dbl>   <dbl>     <dbl>    <dbl>   <dbl>
1 a     Mon        0.      1.      0.      0.        0.       0.      0.
2 b     Tues       0.      0.      0.      0.        0.       1.      0.
3 c     Wed        0.      0.      0.      0.        0.       0.      1.
4 d     Thurs      0.      0.      0.      0.        1.       0.      0.
5 e     Fri        1.      0.      0.      0.        0.       0.      0.
6 f     Sat        0.      0.      1.      0.        0.       0.      0.
7 g     Sun        0.      0.      0.      1.        0.       0.      0.
8 h     Fri        1.      0.      0.      0.        0.       0.      0.
9 i     Mon        0.      1.      0.      0.        0.       0.      0.

With dplyr and tidyr we could do:

Click to copy

library(dplyr)
library(tidyr)

mydf %>% 
  mutate(var = 1) %>% 
  spread(day, var, fill = 0, sep = "_") %>% 
  left_join(mydf) %>% 
  select(x, day, everything())

And with base R we could do something like:

Click to copy

as.data.frame.matrix(table(rep(mydf$x, lengths(mydf$day)), unlist(mydf$day)))

Returns:

Click to copy

  Fri Mon Sat Sun Thurs Tues Wed
a   0   1   0   0     0    0   0
b   0   0   0   0     0    1   0
c   0   0   0   0     0    0   1
d   0   0   0   0     1    0   0
e   1   0   0   0     0    0   0
f   0   0   1   0     0    0   0
g   0   0   0   1     0    0   0
h   1   0   0   0     0    0   0
i   0   1   0   0     0    0   0

108

answered Oct 20 '22 16:10

tyluRp

Related questions
                            
                                Linear regression with specified slope
                            
                                dplyr row_number Error in rank
                            
                                Split r chunk header across lines in knitr
                            
                                Collapse absolutePanel in shiny?
                            
                                reqExecutions IBrokers package
                            
                                Stemming words using tm package in R does not work properly?
                            
                                R smooth.spline(): smoothing spline is not smooth but overfitting my data
                            
                                How can I interleave rows from 2 data frames together?
                            
                                Importing csv file with line breaks to R or Python Pandas
                            
                                R Shiny date slider animation by month (currently by day)
                            
                                How to print numbers divisible by 7
                            
                                How to pass multiple column names as input to group_by in dplyr [duplicate]
                            
                                Increment by one to each duplicate value
                            
                                Python version of R's ifelse statement
                            
                                Why is `speedglm` slower than `glm`?
                            
                                R data table: update join
                            
                                cbind with partially nested list
                            
                                How to make the table header bold with Knitr (for pdf output)?
                            
                                Levels function returning NULL
                            
                                How to save frames of gif created using gganimate package

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Mutating dummy variables in dplyr

Tags:

r

dplyr

dummy-variable

Lefkios Paikousis

People also ask

1 Answers

tyluRp

Recent Activity

Donate For Us