Subset n number of rows from a dataframe, based on a categorical variable, in R

Tags:

I have a dataframe (say x) in R:

> x
Height  Weight Gender
5     60    m
5     70    m
6     80    m
4     90    m
4     60    m
5     70    f
5     80    f
6     60    f
4     90    f
4     60    f

I need an R code that will produce a new dataframe, say y, that takes the subset of X by Gender and only the first three rows of each gender (1:3) to give the result as follows.

>y
Height  Weight Gender
5       60      m
5       70      m
6       80      m
5       70      f
5       80      f
6       60      f

405

asked Apr 22 '15 16:04

gg-14

1 Answers

Try slice from dplyr

library(dplyr)
x %>%
    group_by(Gender) %>% 
    slice(1:3)

Or using data.table

library(data.table)
setDT(x)[,.SD[1:3] , Gender]

189

answered Nov 03 '22 16:11

akrun

Related questions
                            
                                Determine if data frame is empty
                            
                                Using a vector's print method in a data frame
                            
                                Change font sizes with style sheets for RStudio presentation
                            
                                R set variable equal to what function returns. Re-evaluate variable again each time it is called [duplicate]
                            
                                Topic modelling in R using phrases rather than single words
                            
                                ggplot2 : printing multiple plots in one page with a loop
                            
                                Rvest error: type 'externalptr'
                            
                                tbl_df and data.frame difference when using loops
                            
                                Weird lines appearing in the R graph
                            
                                Separate a column into multiple columns using tidyr::separate with sep=""
                            
                                How to drop columns in a nested data frame in R?
                            
                                Multiple series barplot
                            
                                Which selector to write in rvest package in R?
                            
                                R data.table replace NA with mean for numeric columns and most frequent value for nominal values
                            
                                Doing absolute descending sort of data.table through function?
                            
                                Efficient calling of F95 in R: use .Fortran or .Call?
                            
                                How to calculate dynamic panel models with lfe package
                            
                                Compiling RMarkdown with RStudio: why reading .RProfile?
                            
                                Count based on multiple conditions from other data.frame
                            
                                how to automatically update a slot of S4 class in R

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Subset n number of rows from a dataframe, based on a categorical variable, in R

Tags:

r

subset

rows

gg-14

People also ask

1 Answers

akrun

Recent Activity

Donate For Us