<p>Given a dataframe <code>df</code> with a column called <code>group</code>, how do you randomly sample <code>k</code> groups from it in dplyr? It should return all rows from <code>k</code> groups (given there are at least <code>k</code> unique values in <code>df$group</code>), and every group in <code>df</code> should be equally likely to be returned.</p>

<p>Just use <code>sample()</code> to choose some number of groups</p> <pre class="prettyprint"><code>iris %>% filter(Species %in% sample(levels(Species),2)) </code></pre>

Randomly sample groups

Tags:

Given a dataframe df with a column called group, how do you randomly sample k groups from it in dplyr? It should return all rows from k groups (given there are at least k unique values in df$group), and every group in df should be equally likely to be returned.

796

asked May 10 '16 21:05

Big Dogg

1 Answers

Just use sample() to choose some number of groups

iris %>% filter(Species %in% sample(levels(Species),2))

167

answered Sep 19 '22 06:09

MrFlick

Related questions
                            
                                R: use of factor
                            
                                Make scale_y_log10 to have the tickmarks at 0.01,0.1,1
                            
                                subset rows with (1) ALL and (2) ANY columns larger than a specific value
                            
                                Is it possible to specify command line parameters to R-script in RStudio?
                            
                                Replicating a dataframe as a whole n times
                            
                                Read FASTA into a dataframe and extract subsequences of FASTA file
                            
                                How to append a whole dataframe to a CSV in R
                            
                                Shifting a column down by one
                            
                                Adding a column with consecutive numbers in R
                            
                                Convert sequence of longitude and latitude to polygon via sf in R
                            
                                How to print 1000 decimals places of pi value?
                            
                                Ping a website in R
                            
                                RCurl: HTTP Authentication When Site Responds With HTTP 401 Code Without WWW-Authenticate
                            
                                R foreach with .combine=rbindlist
                            
                                bigrams instead of single words in termdocument matrix using R and Rweka
                            
                                R error in glmnet: NA/NaN/Inf in foreign function call
                            
                                R/regex with stringi/ICU: why is a '+' considered a non-[:punct:] character?
                            
                                Split character column into several binary (0/1) columns
                            
                                Display only months in dateRangeInput or dateInput for a shiny app [R programming]
                            
                                Add sheet to Excel file

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Randomly sample groups

Tags:

r

dplyr

Big Dogg

People also ask

1 Answers

MrFlick

Recent Activity

Donate For Us