Please advise how can I replace half of values in a column to NA: <pre class="prettyprint"><code># Generate 500 values with a skewed distribution x1 <- round(rbeta(500,0.5,3)*100,0) # Assign variable to a data frame df <- data.frame(x1) # Replace 250 random values in a column 'x1' to NA df[sample(x1,250)] <- NA The following mistake is shown: Error in `[<-.data.frame`(`*tmp*`, sample(x1, 250), value = NA) : new columns would leave holes after existing columns </code></pre> I understand why the mistake is shown, but I would like to force the replacement. Please advise on how can I do that.

It seems like you need <pre class="prettyprint"><code>df$x1[sample(nrow(df),250)] <- NA </code></pre>

Replace random values in a column in a dataframe

Tags:

replace

r

Please advise how can I replace half of values in a column to NA:

# Generate 500 values with a skewed distribution
x1 <- round(rbeta(500,0.5,3)*100,0)

# Assign variable to a data frame
df <- data.frame(x1)

# Replace 250 random values in a column 'x1' to NA
df[sample(x1,250)] <- NA

The following mistake is shown:
Error in `[<-.data.frame`(`*tmp*`, sample(x1, 250), value = NA) : 
  new columns would leave holes after existing columns

I understand why the mistake is shown, but I would like to force the replacement. Please advise on how can I do that.

500

asked Jun 09 '17 13:06

Gregory

1 Answers

It seems like you need

df$x1[sample(nrow(df),250)] <- NA

answered Sep 21 '22 18:09

G5W

Related questions
                            
                                Count unique values of a column by pairwise combinations of another column and group by third column in R
                            
                                Cross-referencing in rticles
                            
                                How do I get unique element from a vector, keeping its name? [duplicate]
                            
                                Read column names as date format
                            
                                How can I maintain a color scheme across ggplots, while dropping unused levels in each plot?
                            
                                How to increase the size of the text in a Bayesian network plot with bnlearn in R
                            
                                R dplyr method to replace all empty factors with NA
                            
                                Adding multiple reactive plots and tables to Shiny app
                            
                                Group by aggregate dynamic column name matching
                            
                                Refering to a variable of the data frame passed in the 'data' parameter of ggplot function
                            
                                Speed up INSERT of 1 million+ rows into Postgres via R using COPY?
                            
                                How to plot a function family in ggplot2
                            
                                Print label on circle markers in leaflet in Rshiny
                            
                                How to do group matching in R?
                            
                                Running a GLM with a Gamma distribution, but data includes zeros
                            
                                Concatenating strings using group_by and summarise in r [duplicate]
                            
                                Why set.seed() affects sample() in R
                            
                                How add named element to R vector with name from a variable
                            
                                devtools equivalent of RStudio Build panel buttons
                            
                                Recommended way for variable scoping [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With