Comparison of two vectors resulted after simulation

Tags:

I would like to apply the Rejection sampling method to simulate a random vector Y=(Y_1, Y_2) of a uniform distribution from a unit disc D = { (X_1 , X_2) \in R^2: \sqrt{x^2_1 + x^2_2} ≤ 1} such that X = (X_1 , X_ 2) is random vector of a uniform distribution in the square S = [−1, 1]^2 and the joint density f(y_1,y_2) = \frac{1}{\pi} 1_{D(y_1,y_2)}.

In the rejection method, we accept a sample generally if f(x) \leq C * g(x). I am using the following code to :

x=runif(100,-1,1)
y=runif(100,-1,1)

d=data.frame(x=x,y=y)
disc_sample=d[(d$x^2+d$y^2)<1,]
plot(disc_sample)

I have two questions:

{Using the above code, logically, the size of d should be greater than the size of disc_sample but when I call both of them I see there are 100 elements in each one of them. How could this be possible. Why the sizes are the same.} THIS PART IS SOLVED, thanks to the comment below.

The question now

Also, how could I reformulate my code to give me the total number of samples needed to get 100 samples follow the condition. i.e to give me the number of samples rejected until I got the 100 needed sample?

Thanks to the answer of r2evans but I am looking to write something simpler, a while loop to store all possible samples inside a matrix or a data frame instead of a list then to call from that data frame just the samples follow the condition. I modified the code from the answer without the use of the lists and without sapply function but it is not giving the needed result, it yields only one row.

i=0
samps <- data.frame()
goods <- data.frame()
nr <- 0L
sampsize <- 100L
needs <- 100L
while (i < needs) {
  samps <- data.frame(x = runif(1, -1, 1), y = runif(1, -1, 1))
  goods <- samps[(samps$x^2+samps$y^2)<1, ]
i = i+1
}

and I also thought about this:

i=0
j=0
samps <- matrix()
goods <- matrix()
needs <- 100

while (j < needs) {
  samps[i,1] <- runif(1, -1, 1)
  samps[i,2] <- runif(1, -1, 1)
  if (( (samps[i,1])**2+(samps[i,2])**2)<1){
  goods[j,1] <- samps[i,1]
  goods[j,2] <- samps[i,2]
}
else{
i = i+1
}
}

but it is not working.

I would be very grateful for any help to modify the code.

638

asked Mar 03 '20 00:03

Sophie Allan

1 Answers

As to your second question ... you cannot reformulate your code to know precisely how many it will take to get (at least) 100 resulting combinations. You can use a while loop and concatenate results until you have at least 100 such rows, and then truncate those over 100. Because using entropy piecewise (at scale) is "expensive", you might prefer to always over-estimate the rows you need and grab all at once.

(Edited to reduce "complexity" based on homework constraints.)

set.seed(42)
samps <- vector(mode = "list")
goods <- vector(mode = "list")
nr <- 0L
iter <- 0L
sampsize <- 100L
needs <- 100L
while (nr < needs && iter < 50) {
  iter <- iter + 1L
  samps[[iter]] <- data.frame(x = runif(sampsize, -1, 1), y = runif(sampsize, -1, 1))
  rows <- (samps[[iter]]$x^2 + samps[[iter]]$y^2) < 1
  goods[[iter]] <- samps[[iter]][rows, ]
  nr <- nr + sum(rows)
}
iter                # number of times we looped
# [1] 2
out <- head(do.call(rbind, goods), n = 100)
NROW(out)
# [1] 100
head(out) ; tail(out)
#            x          y
# 1  0.8296121  0.2524907
# 3 -0.4277209 -0.5668654
# 4  0.6608953 -0.2221099
# 5  0.2834910  0.8849114
# 6  0.0381919  0.9252160
# 7  0.4731766  0.4797106
#               x          y
# 221 -0.65673577 -0.2124462
# 231  0.08606199 -0.7161822
# 251 -0.37263236  0.1296444
# 271 -0.38589120 -0.2831997
# 28  -0.62909284  0.6840144
# 301 -0.50865171  0.5014720

answered Oct 24 '22 09:10

r2evans

Related questions
                            
                                geom_rect missing when converting ggplot2 to ggplotly
                            
                                How to fuzzy join based on multiple columns and conditions?
                            
                                Escape readLines() function in RStudio
                            
                                How to create custom SQL functions with R code in dbplyr?
                            
                                Any workaround to find optimal threshold for filtering raw features based on correlation matrix in R?
                            
                                How to construct arguments for case_when from data frame?
                            
                                Using a token from `googleAuthR` in `googlesheets`
                            
                                How to use your own image for geom_point in gganimate?
                            
                                Why does as.character drop decimal point?
                            
                                How to run "conda ***" in a system command in R
                            
                                Changing behavior for closure stored in data.table between R 3.4.3 and R 3.6.0
                            
                                Is there a way in R to detect how often a package is used?
                            
                                How to add outer track for circlize plot
                            
                                Fitting a local level Poisson (State Space Model)
                            
                                Unable to install various R packages probably after upgrading to Mac OS Catalina
                            
                                RSelenium concurrent users for multiple scenarios on Shiny
                            
                                Error: package or namespace load failed for ‘data.table’ in library.dynam(lib, package, package.lib): shared object ‘datatable.so’ not found
                            
                                Tidying financial data with mixed decimal and grouping digits
                            
                                How to read data from google drive using R in colab?
                            
                                Is there any explicit guarantee that dplyr operations preserve row order?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Comparison of two vectors resulted after simulation

Tags:

r

probability

simulation

sampling

Sophie Allan

People also ask

1 Answers

r2evans

Recent Activity

Donate For Us