I have a data frame like this:
x=c("01","01","01","22","22","03","03","03","35","35")
y=c("f","f","f","m","m","m","m","m","f","f")
df=data.frame(spn=x, sex=y)
seems like:
spn sex
1 01 f
2 01 f
3 01 f
4 22 m
5 22 m
6 03 m
7 03 m
8 03 m
9 35 f
10 35 f
What I'd like to do is to sort the df$spn and let it appears only once. the appropriate df$sex as well, like:
spn sex
1 01 f
2 03 m
3 22 m
4 35 f
How could I do that? many many thanks!
df <- df[order(df$spn), ]
> df[!duplicated(df), ]
spn sex
1 01 f
6 03 m
4 22 m
9 35 f
df2 = df[!duplicated(df), ] # Remove duplicated rows.
df3 = df2[order(df2$spn), ] # Sort by the spn column.
df3
# spn sex
#1 01 f
#6 03 m
#4 22 m
#9 35 f
Use unique then order:
df <- unique(df)
df[order(df$spn), ]
Using dplyr, data.table:
library(dplyr)
unique(df) %>% arrange(spn)
# spn sex
# 1 01 f
# 2 03 m
# 3 22 m
# 4 35 f
library(data.table)
unique(setDT(df))[ order(spn), ]
# spn sex
# 1: 01 f
# 2: 03 m
# 3: 22 m
# 4: 35 f
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With