I want to generate 10000 integer random numbers between 0 and 10^12. Usually, the code would look like this: <pre class="prettyprint"><code>x <- sample(0:1000000000000,10000,replace=T) </code></pre> But I get following error message: <pre class="prettyprint"><code>Error in 0:1000000000000 : result would be too long a vector </code></pre> Is there a more memory efficient method that doesn't have to put 10^12 integers in a vector just to get a sample of size 10000? If not, is there a way to increase the max size of the vector? I'm working on a 64bit OS with 12GB of free RAM.

The real problem lies in the fact that you cannot store the sequence of <code>0:10^12</code> into memory. By just defining 0 and 10^12 as boundaries of a uniform distribution, you could get what you seek: <pre class="prettyprint"><code>runif(10000, 0, 10^12) [1] 136086417828 280099797063 747063538991 250189170474 589044594904 [6] 65385828028 361086657969 186271687970 338900779840 649082854623 ........ </code></pre> This will draw from the uniform distribution (with replacement, though I doubt that matters). However, what you cannot see is that these are actually floating numbers. You can use <code>ceiling</code> to round them up: <pre class="prettyprint"><code>samp = runif(1, 0, 10^12) samp [1] 19199806033 samp == 19199806033 [1] FALSE ceiling(samp) == 19199806033 [1] TRUE </code></pre> So the full code would be: <pre class="prettyprint"><code>ceiling(runif(10000, 0, 10^12)) </code></pre> Further nitpicking: Note that this technically will not allow 0 to be there (since 0.0001 would be rounded up), so you could just draw from <pre class="prettyprint"><code>ceiling(runif(10000, -1, 10^12)) </code></pre> As Carl Witthoft mentions, numbers that do not fit into the size of an integer will not be integers obviously, so you cannot count on these numbers to be integers. You can still count on them to evaluate to <code>TRUE</code> when compared to the same floating number without decimals though.

Generate integer random numbers from range (0:10^12)

Tags:

I want to generate 10000 integer random numbers between 0 and 10^12. Usually, the code would look like this:

x <- sample(0:1000000000000,10000,replace=T)

But I get following error message:

Error in 0:1000000000000 : result would be too long a vector

Is there a more memory efficient method that doesn't have to put 10^12 integers in a vector just to get a sample of size 10000? If not, is there a way to increase the max size of the vector? I'm working on a 64bit OS with 12GB of free RAM.

301

asked Nov 07 '13 12:11

user1775213

1 Answers

The real problem lies in the fact that you cannot store the sequence of 0:10^12 into memory. By just defining 0 and 10^12 as boundaries of a uniform distribution, you could get what you seek:

runif(10000, 0, 10^12) [1] 136086417828 280099797063 747063538991 250189170474 589044594904 [6]  65385828028 361086657969 186271687970 338900779840 649082854623  ........

This will draw from the uniform distribution (with replacement, though I doubt that matters).

However, what you cannot see is that these are actually floating numbers.

You can use ceiling to round them up:

samp = runif(1, 0, 10^12) samp [1] 19199806033 samp == 19199806033 [1] FALSE ceiling(samp) == 19199806033 [1] TRUE

So the full code would be:

ceiling(runif(10000, 0, 10^12))

Further nitpicking:

Note that this technically will not allow 0 to be there (since 0.0001 would be rounded up), so you could just draw from

ceiling(runif(10000, -1, 10^12))

As Carl Witthoft mentions, numbers that do not fit into the size of an integer will not be integers obviously, so you cannot count on these numbers to be integers. You can still count on them to evaluate to TRUE when compared to the same floating number without decimals though.

answered Nov 03 '22 20:11

PascalVKooten

Related questions
                            
                                How to overcome an HTMLUnit ScriptException?
                            
                                How to pass parameters to mysql query callback in nodejs
                            
                                Check Wordpress Version in a mysql database
                            
                                In meteor is there a way to access array index in spacebars [duplicate]
                            
                                Adding method name in NLog
                            
                                get the last modification data of a file in git repo
                            
                                How to sort 2D array (numpy.ndarray) based to the second column in python? [duplicate]
                            
                                How to "zip" two arrays into hash
                            
                                C++ Vector initial capacity
                            
                                React: Retrieve dynamic child key upon event
                            
                                How can I promisify the MongoDB native Javascript driver using bluebird?
                            
                                Flatten a JSON document using jq

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With