Return FALSE for duplicated NA values when using the function duplicated()

Tags:

just wondering why duplicated behaves the way it does with NAs:

> duplicated(c(NA,NA,NA,1,2,2))
[1] FALSE  TRUE  TRUE FALSE FALSE  TRUE

where in fact

> NA == NA
[1] NA

is there a way to achieve that duplicated marks NAs as false, like this?

> duplicated(c(NA,NA,NA,1,2,2))
[1] FALSE  FALSE  FALSE FALSE FALSE  TRUE

577

asked Nov 27 '12 11:11

jamborta

1 Answers

You use the argument incomparables for the function duplicated like this :

> duplicated(c(NA,NA,NA,1,2,2))
[1] FALSE  TRUE  TRUE FALSE FALSE  TRUE
> duplicated(c(NA,NA,NA,1,2,2),incomparables=NA)
[1] FALSE FALSE FALSE FALSE FALSE  TRUE

It determines the values that cannot be compared (in this case NA) and returns FALSE for those values. See also ?duplicated

answered Nov 17 '22 22:11

Joris Meys

Related questions
                            
                                Adding 15 business days in lubridate
                            
                                Assign color to 2 different geoms and get 2 different legends
                            
                                Fill NA values with the trailing row value times a growth rate?
                            
                                R - Set execution time limit in loop
                            
                                Convert negative values to zero in dataframe in R
                            
                                Caret package findCorrelation() function
                            
                                Replace values in list
                            
                                Find string in data.frame
                            
                                ggplot2 remove axis label
                            
                                updateSelectinput throws a session not found error
                            
                                Visualizing two or more data points where they overlap (ggplot R)
                            
                                Dividing each cell in a data set by the column sum in R
                            
                                Use broom and tidyverse to run regressions on different dependent variables
                            
                                Accessing YAML parameters as macros within external LaTeX files
                            
                                Subsetting in R using OR condition with strings
                            
                                Margin adjustments when using ggplot's geom_tile()
                            
                                Numpy for R user?
                            
                                How to import CSV into sqlite using RSqlite?
                            
                                In R linear model, get p-values for only the interaction coefficients
                            
                                ggplot2 - is there a way to override global aesthetic mappings while reusing geom layers

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Return FALSE for duplicated NA values when using the function duplicated()

Tags:

comparison

r

duplicates

missing-data

jamborta

People also ask

1 Answers

Joris Meys

Recent Activity

Donate For Us