Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

String Manipulation/Agregation in dplyr

Tags:

r

dplyr

I want to perform group_by and do a string operation for a data frame using dplyr

df<-data.frame(varx=c("x1","x1","x2","x2","x2"),vary=c("y1","y2","y3","y4","y5"))

I want the output (newdf) to look like this:

newdf <- data.frame(varx=c("x1","x2"),catY=c("y1,y2","y3,y4,y5"))

I tried the following in dplyr

df %>% group_by(varx)%>%summarise(catY=paste(vary))
Error: expecting a single value

Also tried the following:

df %>% group_by(varx)%>%mutate(catY=paste(vary))

Source: local data frame [5 x 3]
Groups: varx

I can do it using basic data frame operation. Need help in understanding a way out in dplyr.

like image 405
Pradeep Avatar asked Sep 11 '14 11:09

Pradeep


1 Answers

The slightly shorter version of David's comment would be:

library(dplyr)
df %>% group_by(varx) %>% summarise(catY = toString(vary))

#Source: local data frame [2 x 2]
#
#  varx       catY
#1   x1     y1, y2
#2   x2 y3, y4, y5
like image 139
talat Avatar answered Oct 01 '22 11:10

talat