I am not sure how to loop over each column to replace the NA values with the column mean. When I am trying to replace for one column using the following, it works well. <pre class="prettyprint"><code>Column1[is.na(Column1)] <- round(mean(Column1, na.rm = TRUE)) </code></pre> The code for looping over columns is not working: <pre class="prettyprint"><code>for(i in 1:ncol(data)){ data[i][is.na(data[i])] <- round(mean(data[i], na.rm = TRUE)) } </code></pre> the values are not replaced. Can someone please help me with this?

A relatively simple modification of your code should solve the issue: <pre class="prettyprint"><code>for(i in 1:ncol(data)){ data[is.na(data[,i]), i] <- mean(data[,i], na.rm = TRUE) } </code></pre>

Replace missing values with column mean

Tags:

r

missing-data

imputation

I am not sure how to loop over each column to replace the NA values with the column mean. When I am trying to replace for one column using the following, it works well.

Column1[is.na(Column1)] <- round(mean(Column1, na.rm = TRUE))

The code for looping over columns is not working:

for(i in 1:ncol(data)){     data[i][is.na(data[i])] <- round(mean(data[i], na.rm = TRUE)) }

the values are not replaced. Can someone please help me with this?

685

asked Sep 14 '14 16:09

Nikita

1 Answers

A relatively simple modification of your code should solve the issue:

for(i in 1:ncol(data)){   data[is.na(data[,i]), i] <- mean(data[,i], na.rm = TRUE) }

173

answered Sep 19 '22 13:09

Thomas

Related questions
                            
                                Why TRUE == "TRUE" is TRUE in R?
                            
                                ggplot legends - change labels, order and title
                            
                                Test for numeric elements in a character string
                            
                                R shiny: display "loading..." message while function is running
                            
                                R strsplit with multiple unordered split arguments?
                            
                                Compare if two dataframe objects in R are equal?
                            
                                Essential skills of a Data Scientist [closed]
                            
                                Rescaling the y axis in bar plot causes bars to disappear : R ggplot2
                            
                                Forcing R output to be scientific notation with at most two decimals
                            
                                Sort a data.table fast by Ascending/Descending order
                            
                                backtransform `scale()` for plotting
                            
                                What techniques exists in R to visualize a "distance matrix"?
                            
                                Initializing data.frames() [duplicate]
                            
                                How to add variable key/value pair to list object?
                            
                                Replacing values from a column using a condition in R
                            
                                What does the R function `poly` really do?
                            
                                Force ggplot2 scatter plot to be square shaped
                            
                                Convert a numeric month to a month abbreviation
                            
                                Inserting an image to ggplot2
                            
                                How to subtract years?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With