How to delete duplicates in string for each row

Question

Here is my sample data:

V1
"a b c c c d"
"a a b b c d"
"a b c d e f"

I want this output:

V1
"a b c d"
"a b c d"
"a b c d e f"

paste(unique(unlist(strsplit(x, split=" "))))

gets rid of duplicates from the entire dataframe, while I need it to be row by row.

Ronak Shah · Accepted Answer

Use sapply instead of unlist

df$V2 <- sapply(strsplit(df$V1, " "), function(x) paste0(unique(x), collapse = " "))

df
#           V1          V2
#1 a b c c c d     a b c d
#2 a a b b c d     a b c d
#3 a b c d e f a b c d e f

data

df <- structure(list(V1 = c("a b c c c d", "a a b b c d", "a b c d e f"
)), row.names = c(NA, -3L), class = "data.frame")

How to delete duplicates in string for each row

Tags:

r

auto

1 Answers

Ronak Shah

Recent Activity

Donate For Us

How to delete duplicates in string for each row

Tags:

r

auto

1 Answers

Ronak Shah

Related questions

Recent Activity

Donate For Us