I have a 114 row by 16 column data frame where the rows are individuals, and the columns are either their names or NA. For example, the first 3 rows looks like this: <pre class="prettyprint"><code> name name.1 name.2 name.3 name.4 name.5 name.6 name.7 name.8 name.9 name.10 name.11 name.12 name.13 name.14 name.15 1 <NA> <NA> <NA> <NA> <NA> <NA> <NA> <NA> <NA> <NA> Aanestad <NA> Aanestad <NA> Aanestad <NA> 2 <NA> <NA> <NA> <NA> <NA> <NA> <NA> <NA> Ackerman <NA> Ackerman <NA> Ackerman <NA> Ackerman <NA> 3 <NA> <NA> <NA> <NA> <NA> <NA> Alarcon <NA> Alarcon <NA> Alarcon <NA> Alarcon <NA> <NA> <NA> </code></pre> I want to generate a list (if multiple unique names per row) or vector (if only one unique name per row) of all the unique names, with length 114. When I try <code>apply(x,1,unique)</code> I get a 2xNcol array where sometimes the first row cell is NA and sometimes the second row cell is NA. <pre class="prettyprint"><code> [,1] [,2] [,3] [,4] [,5] [,6] [,7] [,8] [,9] [1,] NA NA NA NA "Alquist" NA "Ayala" NA NA [2,] "Aanestad" "Ackerman" "Alarcon" "Alpert" NA "Ashburn" NA "Baca" "Battin" </code></pre> When what I'd like is just: <pre class="prettyprint"><code>Aanestad Ackerman Alarcon ... </code></pre> I can't seem to figure out how to apply unique() while ignoring NA. na.rm, na.omit etc don't seem to work. I feel like I'm missing something real simple ... Thanks!

<code>unique</code> does not appear to have an <code>na.rm</code> argument, but you can remove the missing values yourself before calling it: <pre class="prettyprint"><code>A <- matrix(c(NA,"A","A", "B", NA, NA, NA, NA, "C"), nr=3, byrow=TRUE) apply(A, 1, function(x)unique(x[!is.na(x)])) </code></pre> gives <pre class="prettyprint"><code>[1] "A" "B" "C" </code></pre>

Handling NA values in apply and unique

Tags:

r

unique

apply

I have a 114 row by 16 column data frame where the rows are individuals, and the columns are either their names or NA. For example, the first 3 rows looks like this:

            name name.1      name.2 name.3       name.4 name.5       name.6 name.7       name.8 name.9       name.10 name.11       name.12 name.13        name.14 name.15
1           <NA>   <NA>        <NA>   <NA>         <NA>   <NA>         <NA>   <NA>         <NA>   <NA>      Aanestad    <NA>      Aanestad    <NA>       Aanestad    <NA>
2           <NA>   <NA>        <NA>   <NA>         <NA>   <NA>         <NA>   <NA>     Ackerman   <NA>      Ackerman    <NA>      Ackerman    <NA>       Ackerman    <NA>
3           <NA>   <NA>        <NA>   <NA>         <NA>   <NA>      Alarcon   <NA>      Alarcon   <NA>       Alarcon    <NA>       Alarcon    <NA>           <NA>    <NA>

I want to generate a list (if multiple unique names per row) or vector (if only one unique name per row) of all the unique names, with length 114.

When I try apply(x,1,unique) I get a 2xNcol array where sometimes the first row cell is NA and sometimes the second row cell is NA.

    [,1]       [,2]       [,3]      [,4]     [,5]      [,6]      [,7]    [,8]   [,9]    
[1,] NA         NA         NA        NA       "Alquist" NA        "Ayala" NA     NA      
[2,] "Aanestad" "Ackerman" "Alarcon" "Alpert" NA        "Ashburn" NA      "Baca" "Battin"

When what I'd like is just:

Aanestad
Ackerman
Alarcon
...

I can't seem to figure out how to apply unique() while ignoring NA. na.rm, na.omit etc don't seem to work. I feel like I'm missing something real simple ...

Thanks!

938

asked Feb 15 '10 21:02

bshor

1 Answers

unique does not appear to have an na.rm argument, but you can remove the missing values yourself before calling it:

A <- matrix(c(NA,"A","A",
             "B", NA, NA,
              NA, NA, "C"), nr=3, byrow=TRUE)
apply(A, 1, function(x)unique(x[!is.na(x)]))

gives

[1] "A" "B" "C"

198

answered Nov 07 '22 14:11

Aniko

Related questions
                            
                                How to use map from purrr with dplyr::mutate to create multiple new columns based on column pairs
                            
                                Converting between matrix subscripts and linear indices (like ind2sub/sub2ind in matlab)
                            
                                Challenge: recoding a data.frame() — make it faster
                            
                                Replace NaN values in a list with zero (0)
                            
                                Calculating length of 95%-CI using dplyr
                            
                                Color schemes in R?
                            
                                How to plot a subset of a data frame in R?
                            
                                Closing active connections using RMySQL
                            
                                Function to split a matrix into sub-matrices in R
                            
                                Faster ways to calculate frequencies and cast from long to wide
                            
                                Assignment in R language
                            
                                How to change the background color of the Shiny Dashboard Body
                            
                                add column values based on other columns in data frame using for and if
                            
                                Cannot build R package "png" Fedora 20
                            
                                Remove all text between two brackets
                            
                                sf: Write Lat/Long from geometry into separate column and keep ID column
                            
                                RStudio Shiny renderDataTable font size
                            
                                Is it possible to skip NA values in "+" operator?
                            
                                Efficient filtering through multiple columns by group
                            
                                R Error - cannot change value of locked binding for 'df'

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With