R repeat function until condition met

Tags:

I am trying to generate a random sample that excludes certain "bad data." I do not know whether the data is "bad" until after I sample it. Thus, I need to make a random draw from the population and then test it. If the data is "good" then keep it. If the data is "bad" then randomly draw another and test it. I would like to do this until my sample size reaches 25. Below is a simplified example of my attempt to write a function that does this. Can anyone please tell me what I am missing?

df <- data.frame(NAME=c(rep('Frank',10),rep('Mary',10)), SCORE=rnorm(20))
df

random.sample <- function(x) {
  x <- df[sample(nrow(df), 1), ]
  if (x$SCORE > 0) return(x)
 #if (x$SCORE <= 0) run the function again
}

random.sample(df)

899

asked Dec 10 '13 23:12

user1491868

4 Answers

Here is a general use of a while loop:

random.sample <- function(x) {
  success <- FALSE
  while (!success) {
    # do something
    i <- sample(nrow(df), 1)
    x <- df[sample(nrow(df), 1), ]
    # check for success
    success <- x$SCORE > 0
  }
  return(x)
}

An alternative is to use repeat (syntactic sugar for while(TRUE)) and break:

random.sample <- function(x) {
  repeat {
    # do something
    i <- sample(nrow(df), 1)
    x <- df[sample(nrow(df), 1), ]
    # exit if the condition is met
    if (x$SCORE > 0) break
  }
  return(x)
}

where break makes you exit the repeat block. Alternatively, you could have if (x$SCORE > 0) return(x) to exit the function directly.

116

answered Oct 14 '22 09:10

flodel

use this after your first sample

while (any(bad <- (x$SCORE <= 0)))
   x[bad, ] <- df[sample(nrow(df), sum(bad)), ]

answered Oct 14 '22 08:10

Ricardo Saporta

You can just select the rows to sample directly like so (just 5):

> df <- data.frame(NAME=c(rep('Frank',10),rep('Mary',10)), SCORE=rnorm(20))
> df[sample(which(df$SCORE>0), 5),]


 NAME     SCORE
14  Mary 1.0858854
10 Frank 0.7037989
16  Mary 0.7688913
5  Frank 0.2067499
17  Mary 0.4391216

this is without replacement, for bootstrap put in replace=T.

answered Oct 14 '22 10:10

Stephen Henderson

 random.sample <- function(x) {
   x <- df[sample(nrow(df), 1), ]
   if (x$SCORE > 0) return(x)
   Recall(x)# run the function again
 }

 random.sample(df)
#   NAME    SCORE
#14 Mary 1.252566

It seems to me that this should work as well:

 df$SCORE[ df$SCORE > 0 ][ sample(1:sum(df$SCORE > 0), 1) ]
#[1] 0.6579631

answered Oct 14 '22 09:10

IRTFM

Related questions
                            
                                Working with hundreths of a second using the chron package or modifying the precision
                            
                                How can I satisfy my woes with R's `:` operator?
                            
                                Emacs+ESS+R: How to have help page open in new buffer
                            
                                How to setup R with LyX?
                            
                                How to extract a specific frequency range from a .wav file?
                            
                                How to change discrete ratio data into ordinal data in R?
                            
                                R: using package by unzipping it instead of installing it
                            
                                Connect points in qplot by adjacent y value, not x value
                            
                                Lookup values in a vectorized way
                            
                                all x axis labels are not displaying in 45 degree
                            
                                Efficiently replace a fixed position substring with a string of equal or larger length
                            
                                R equivalent of MATLAB's fmincon for constrained optimization?
                            
                                R: specifying color for different facets / panels in lattice
                            
                                Make points "look" under surface in R using lattice and wireframe
                            
                                Permanently replacing a function
                            
                                Newey-West standard errors with Mean Groups/Fama-MacBeth estimator
                            
                                R split numeric vector at position
                            
                                ggplot group by one categorical variable and color by a second one
                            
                                dplyr select using logical
                            
                                Keyboard shortcut for inserting roxygen #' comment start

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

R repeat function until condition met

Tags:

function

r

conditional-statements

repeat