Python random sample of two arrays, but matching indices

Tags:

I have two numpy arrays x and y, which have length 10,000. I would like to plot a random subset of 1,000 entries of both x and y. Is there an easy way to use the lovely, compact random.sample(population, k) on both x and y to select the same corresponding indices? (The y and x vectors are linked by a function y(x) say.)

Thanks.

992

asked Oct 21 '13 03:10

Cokes

4 Answers

You can use np.random.choice on an index array and apply it to both arrays:

idx = np.random.choice(np.arange(len(x)), 1000, replace=False)
x_sample = x[idx]
y_sample = y[idx]

124

answered Oct 20 '22 02:10

Jaime

Just zip the two together and use that as the population:

import random

random.sample(zip(xs,ys), 1000)

The result will be 1000 pairs (2-tuples) of corresponding entries from xs and ys.

answered Oct 20 '22 01:10

DaoWen

After test numpy.random.choice solution, I found out it was very slow for larger array.

numpy.random.randint should be much faster

example

x = np.arange(1e8)
y = np.arange(1e8)
idx = np.random.randint(0, x.shape[0], 10000)
return x[idx], y[idx]

answered Oct 20 '22 01:10

StoneLin

Using the numpy.random.randint function, you generate a list of random numbers, meaning that you can select certain datapoints twice.

answered Oct 20 '22 01:10

bananenpampe

Related questions
                            
                                Python OpenCV cv2 drawing rectangle with text
                            
                                Equivalent of j in NumPy
                            
                                Testing socket connection in Python
                            
                                How to compute the nth root of a very big integer
                            
                                Best way to determine if a sequence is in another sequence?
                            
                                Python urllib2 Progress Hook
                            
                                Python: Why does os.getcwd() sometimes crash with OSError?
                            
                                Convert image to a matrix in python
                            
                                Move an email in GMail with Python and imaplib
                            
                                `goto` in Python
                            
                                Matplotlib table formatting
                            
                                What does "\r" do in the following script?
                            
                                Python- how do I use re to match a whole string [duplicate]
                            
                                How to check for mock calls with wildcards?
                            
                                performing outer addition with numpy
                            
                                python refresh/reload
                            
                                Linewidth is added to the length of a line
                            
                                Compare only time part in datetime - Python
                            
                                What's the error of numpy.polyfit?
                            
                                How do I check for an EXACT word in a string in python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python random sample of two arrays, but matching indices

Tags:

python

random

numpy

Cokes

People also ask

4 Answers

Jaime

DaoWen

StoneLin

bananenpampe

Recent Activity

Donate For Us