How to remove NA data in only one columns?

Q: How do I remove Na from data in R?

The na. omit() function returns a list without any rows that contain na values. It will drop rows with na value / nan values. This is the fastest way to remove na rows in the R programming language.

Q: What is the function to remove Na values from the data frames?

DataFrame-dropna() function The dropna() function is used to remove missing values. Determine if rows or columns which contain missing values are removed. 0, or 'index' : Drop rows which contain missing values. 1, or 'columns' : Drop columns which contain missing value.

Tags:

r

I have a file that looks like so:

date       A  B
2014-01-01 2  3
2014-01-02 5  NA
2014-01-03 NA NA
2014-01-04 7  11

If I use newdata <- na.omit(data) where data is the above table loaded via R, then I get only two data points. I get that since it will filter all instances of NA. What I want to do is to filter for each A and B so that I get three data points for A and only two for B. Clearly, my main data set is much larger than that and the numbers are different but neither should not matter.

How can I achieve that?

425

asked Jan 07 '14 17:01

Sardathrion - against SE abuse

1 Answers

Use is.na() on the relevant vector of data you wish to look for and index using the negated result. For exmaple:

R> data[!is.na(data$A), ]
        date A  B
1 2014-01-01 2  3
2 2014-01-02 5 NA
4 2014-01-04 7 11
R> data[!is.na(data$B), ]
        date A  B
1 2014-01-01 2  3
4 2014-01-04 7 11

is.na() returns TRUE for every element that is NA and FALSE otherwise. To index the rows of the data frame, we can use this logical vector, but we want its converse. Hence we use ! to imply the opposite (TRUE becomes FALSE and vice versa).

You can restrict which columns you return by adding an index for the columns after the , in [ , ], e.g.

R> data[!is.na(data$A), 1:2]
        date A
1 2014-01-01 2
2 2014-01-02 5
4 2014-01-04 7

answered Nov 09 '22 12:11

Gavin Simpson

Related questions
                            
                                How to create a time scatterplot with R?
                            
                                rworldmap package countrylist
                            
                                NA values in Rcpp conditional
                            
                                Element-wise max operation on sparse matrices in R
                            
                                Finding out which functions are called within a given function [duplicate]
                            
                                How to add abline with lattice xyplot function?
                            
                                Why doesn't lazy evaluation work in this R function? [duplicate]
                            
                                How to determine the geom type of each layer of a ggplot2 object?
                            
                                Root mean square error in R - mixed effect model
                            
                                How to change position of grid.draw
                            
                                re- installing R linux ubuntu: unmet dependencies R
                            
                                Replacing the "print" function in knitr chunk evaluation
                            
                                How to reference column names that start with a number, in data.table
                            
                                How can I change the color of the header in a xyplot?
                            
                                stacking columns into 1 column in R [duplicate]
                            
                                How to get number of rows for a specific value in a column
                            
                                Setting size of the rgl device
                            
                                How to remove a character in a variable of string type in R
                            
                                Is there maximum number of characters permissible in rownames or colnames in R?
                            
                                How to 'subset' a named vector in R?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With