I am trying to find the percentage of NAs in columns as well as inside the whole dataframe: The first method which I have commented gives me zero and the second method which is not commented gives me a matrix. Not sure what I am missing. Any hint is truly appreciated! <pre class="prettyprint"><code>cp.2006<-read.csv(file="cp2006.csv",head=TRUE) #countNAs <- function(x) { # sum(is.na(x)) #} #total=0 #for (i in col(cp.2006)) { # total=countNAs(i)+total #} #print(total) count<-apply(cp.2006, 1, function(x) sum(is.na(x))) dims<-dim(cp.2006) num<-dims[1]*dims[2] NApercentage<-(count/num) * 100 print(NApercentage) </code></pre>

<pre class="prettyprint"><code>x = data.frame(x = c(1, 2, NA, 3), y = c(NA, NA, 4, 5)) </code></pre> For the whole dataframe: <pre class="prettyprint"><code>sum(is.na(x))/prod(dim(x)) </code></pre> Or <pre class="prettyprint"><code>mean(is.na(x)) </code></pre> For columns: <pre class="prettyprint"><code>apply(x, 2, function(col)sum(is.na(col))/length(col)) </code></pre> Or <pre class="prettyprint"><code>colMeans(is.na(x)) </code></pre>

How to find the percentage of NAs in a data.frame?

Tags:

dataframe

r

csv

na

I am trying to find the percentage of NAs in columns as well as inside the whole dataframe:

The first method which I have commented gives me zero and the second method which is not commented gives me a matrix. Not sure what I am missing. Any hint is truly appreciated!

cp.2006<-read.csv(file="cp2006.csv",head=TRUE)

#countNAs <- function(x) { 
#  sum(is.na(x)) 
#} 
#total=0
#for (i in col(cp.2006)) {
#  total=countNAs(i)+total
#}
#print(total)
count<-apply(cp.2006, 1, function(x) sum(is.na(x)))
dims<-dim(cp.2006)
num<-dims[1]*dims[2]
NApercentage<-(count/num) * 100
print(NApercentage)

655

asked May 11 '14 19:05

Mona Jalal

1 Answers

x = data.frame(x = c(1, 2, NA, 3), y = c(NA, NA, 4, 5))

For the whole dataframe:

sum(is.na(x))/prod(dim(x))

mean(is.na(x))

For columns:

apply(x, 2, function(col)sum(is.na(col))/length(col))

colMeans(is.na(x))

answered Oct 02 '22 14:10

Fernando

Related questions
                            
                                match two columns with two other columns
                            
                                Remove rows in dataframe with factor ""
                            
                                R can't convert NaN to NA
                            
                                Converting a factor with 2 levels to binary values 0/1 in R [closed]
                            
                                R list get first item of each element
                            
                                Calculate Percentage Change in R using dplyr
                            
                                How to name the list of the group_split output in dplyr
                            
                                How can I revise my code to improve my processing speed
                            
                                Replace all values in a data.table given a condition
                            
                                removing a list of columns from a data.frame using subset [duplicate]
                            
                                How to save a graph as an a4 size pdf file under windows system? (R; ggplot2)
                            
                                R: adding 1 month to a date
                            
                                How to delete groups containing less than 3 rows of data in R? [duplicate]
                            
                                how insert zeros in seq in R
                            
                                Reduced row echelon form
                            
                                algorithm to round to the next order of magnitude in R
                            
                                How to overlay a line for an lm object on a ggplot2 scatterplot
                            
                                How to sort a matrix by all columns
                            
                                How to convert a vector of strings to Title Case
                            
                                Error in RShiny ui.r argument missing [closed]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With