I'm aware of <code>DataFrame.sample()</code>, but how can I do this and also remove the sample from the dataset? (Note: AFAIK this has nothing to do with sampling with replacement) For example here is the essence of what I want to achieve, this does not actually work: <pre class="prettyprint"><code>len(df) # 1000 df_subset = df.sample(300) len(df_subset) # 300 df = df.remove(df_subset) len(df) # 700 </code></pre>

If your index is unique <pre class="prettyprint"><code>df = df.drop(df_subset.index) </code></pre> <hr> example <pre class="prettyprint"><code>df = pd.DataFrame(np.arange(10).reshape(-1, 2)) </code></pre> <hr> sample <pre class="prettyprint"><code>df_subset = df.sample(2) df_subset </code></pre> <img src="https://i.stack.imgur.com/9iD0E.png" alt="enter image description here"> <hr> drop <pre class="prettyprint"><code>df.drop(df_subset.index) </code></pre> <img src="https://i.stack.imgur.com/4TU49.png" alt="enter image description here">

Pandas random sample with remove

Tags:

python

pandas

I'm aware of DataFrame.sample(), but how can I do this and also remove the sample from the dataset? (Note: AFAIK this has nothing to do with sampling with replacement)

For example here is the essence of what I want to achieve, this does not actually work:

len(df) # 1000

df_subset = df.sample(300)
len(df_subset) # 300

df = df.remove(df_subset)
len(df) # 700

885

asked Oct 03 '16 15:10

JakeCowton

1 Answers

If your index is unique

df = df.drop(df_subset.index)

example

df = pd.DataFrame(np.arange(10).reshape(-1, 2))

sample

df_subset = df.sample(2)
df_subset

enter image description here

drop

df.drop(df_subset.index)

enter image description here

126

answered Sep 19 '22 14:09

piRSquared

Related questions
                            
                                Selenium / Python - Selecting via css selector
                            
                                Empty list returned from ElementTree findall
                            
                                Redirect print to string list?
                            
                                How to change my django server time
                            
                                Integration of python in C# Application
                            
                                Python built-in sum function vs. for loop performance
                            
                                PyQt5: Keyboard shortcuts w/ QAction
                            
                                How to label and change the scale of Seaborn kdeplot's axes
                            
                                speech recognition python code not working
                            
                                Python HTML Encoding \xc2\xa0
                            
                                Replace all matches using re.findall()
                            
                                Python List object attribute 'append' is read-only
                            
                                Mock open() function used in a class method
                            
                                How to use pyinstaller?
                            
                                Python's json.load(sys.stdin) gets me u'...' instead of double quotes around Strings
                            
                                Why is a `for` over a Python list faster than over a Numpy array?
                            
                                Django annotate() error AttributeError: 'CharField' object has no attribute 'resolve_expression'
                            
                                Deprecated rolling window option in OLS from Pandas to Statsmodels
                            
                                Weighted correlation coefficient with pandas
                            
                                How to get odds-ratios and other related features with scikit-learn

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With