How to select rows in a data.frame without NA values [closed]

Tags:

I have a data frame called data. I want to create a function f(data, collist). This function takes data and a list of columns from data itself, and returns only those rows from data, for which the mentioned column names in collist are not NA. I know it can be done using for loop, but I want to do it without using for loop.

Also, please let me know if it is generally more efficient in R to avoid loops.

Here is an example:

 A   B   C   D
 1   2  NA  NA
 2  NA  NA  NA
NA   3   7   5
NA   4   2  NA
 5   6  NA  NA

If collist contains B and C, then a reduced data frame with row number 1,3,4 would be returned. The reason being either B or C or both has NA in rows 2 and 5. I want a function, because I will be using this operation quite a number of times. Through this question, I will learn some new R tricks, as well as, make my whole program more elegant. Thanks.

808

asked Nov 08 '13 17:11

Sumit

1 Answers

It sounds like you are just looking for complete.cases. Here's an example:

#### SAMPLE DATA

set.seed(1)
m <- matrix(rnorm(20), 5)
m[sample(length(m), 7)] <- NA
mydf <- data.frame(m)
mydf
#           X1         X2        X3          X4
# 1         NA -0.8204684  1.511781 -0.04493361
# 2  0.1836433  0.4874291        NA          NA
# 3 -0.8356286  0.7383247        NA  0.94383621
# 4  1.5952808         NA -2.214700  0.82122120
# 5  0.3295078         NA        NA  0.59390132

#### SAMPLE EXTRACTION

collist <- c("X1", "X2")
mydf[complete.cases(mydf[collist]), collist]
#           X1        X2
# 2  0.1836433 0.4874291
# 3 -0.8356286 0.7383247

120

answered Oct 13 '22 11:10

A5C1D2H2I1M1N2O1R2T1

Related questions
                            
                                Constrained Newton-Raphson estimation
                            
                                R: Format data frame summary
                            
                                Stratified sampling with Random Forests in R
                            
                                Sample a random integer in Rcpp
                            
                                Seeking an better way to add columns in data.table from lookup table
                            
                                Using strptime %z with special timezone format
                            
                                Operator overloading for functions in R - strange behavior
                            
                                How to return values from gWidgets and handlers?
                            
                                Create a C-level file handle in RCurl for writing downloaded files
                            
                                Can we do binary search in data.table with OR select queries
                            
                                Updated world map for R "maps" package?
                            
                                Feeding data frame columns to xyplot panel functions
                            
                                Include pre-defined variable using inline code with knitr
                            
                                data.table bug, causing a segfault in R
                            
                                Axis labels for each bar and each group in bar charts with dodged groups
                            
                                Converting a column of type 'list' to multiple columns in a data frame
                            
                                Geographical borders incomplete using geom_polygon for plotting map - ggplot2
                            
                                Efficiently computing a linear combination of data.table columns
                            
                                Create a data frame using text input in Shiny
                            
                                I am trying to Make RDotNet work with C#, and I am encountering problems

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to select rows in a data.frame without NA values [closed]

Tags:

dataframe

r

Sumit

People also ask

1 Answers

A5C1D2H2I1M1N2O1R2T1

Recent Activity

Donate For Us