I have a dataframe and list of columns in that dataframe that I'd like to drop. Let's use the <code>iris</code> dataset as an example. I'd like to drop <code>Sepal.Length</code> and <code>Sepal.Width</code> and use only the remaining columns. How do I do this using <code>select</code> or <code>select_</code> from the <code>dplyr</code> package? Here's what I've tried so far: <pre class="prettyprint"><code>drop.cols <- c('Sepal.Length', 'Sepal.Width') iris %>% select(-drop.cols) </code></pre> <blockquote> Error in -drop.cols : invalid argument to unary operator </blockquote> <pre class="prettyprint"><code>iris %>% select_(.dots = -drop.cols) </code></pre> <blockquote> Error in -drop.cols : invalid argument to unary operator </blockquote> <pre class="prettyprint"><code>iris %>% select(!drop.cols) </code></pre> <blockquote> Error in !drop.cols : invalid argument type </blockquote> <pre class="prettyprint"><code>iris %>% select_(.dots = !drop.cols) </code></pre> <blockquote> Error in !drop.cols : invalid argument type </blockquote> I feel like I'm missing something obvious because these seems like a pretty useful operation that should already exist. On Github, someone posted a similar issue, and Hadley said to use 'negative indexing'. That's what (I think) I've tried, but to no avail. Any suggestions?

Check the help on select_vars. That gives you some extra ideas on how to work with this. In your case: <pre class="prettyprint"><code>iris %>% select(-one_of(drop.cols)) </code></pre>

also try <pre class="prettyprint"><code>## Notice the lack of quotes iris %>% select (-c(Sepal.Length, Sepal.Width)) </code></pre>

Beyond <code>select(-one_of(drop.cols))</code> there are a couple other options for dropping columns using <code>select()</code> that do not involve defining all the specific column names (using the dplyr starwars sample data for some more variety in column names): <pre class="prettyprint"><code>starwars %>% select(-(name:mass)) %>% # the range of columns from 'name' to 'mass' select(-contains('color')) %>% # any column name that contains 'color' select(-starts_with('bi')) %>% # any column name that starts with 'bi' select(-ends_with('er')) %>% # any column name that ends with 'er' select(-matches('^f.+s$')) %>% # any column name matching the regex pattern select_if(~!is.list(.)) %>% # not by column name but by data type head(2) # A tibble: 2 x 2 homeworld species <chr> <chr> 1 Tatooine Human 2 Tatooine Droid </code></pre>

R dplyr: Drop multiple columns

Tags:

r

dplyr

I have a dataframe and list of columns in that dataframe that I'd like to drop. Let's use the iris dataset as an example. I'd like to drop Sepal.Length and Sepal.Width and use only the remaining columns. How do I do this using select or select_ from the dplyr package?

Here's what I've tried so far:

drop.cols <- c('Sepal.Length', 'Sepal.Width')
iris %>% select(-drop.cols)

Error in -drop.cols : invalid argument to unary operator

iris %>% select_(.dots = -drop.cols)

Error in -drop.cols : invalid argument to unary operator

iris %>% select(!drop.cols)

Error in !drop.cols : invalid argument type

iris %>% select_(.dots = !drop.cols)

Error in !drop.cols : invalid argument type

I feel like I'm missing something obvious because these seems like a pretty useful operation that should already exist. On Github, someone posted a similar issue, and Hadley said to use 'negative indexing'. That's what (I think) I've tried, but to no avail. Any suggestions?

516

asked Mar 07 '16 08:03

Navaneethan Santhanam

3 Answers

Check the help on select_vars. That gives you some extra ideas on how to work with this.

In your case:

iris %>% select(-one_of(drop.cols))

200

answered Oct 24 '22 00:10

phiver

also try

## Notice the lack of quotes
iris %>% select (-c(Sepal.Length, Sepal.Width))

answered Oct 24 '22 00:10

Miguel Rayon Gonzalez

Beyond select(-one_of(drop.cols)) there are a couple other options for dropping columns using select() that do not involve defining all the specific column names (using the dplyr starwars sample data for some more variety in column names):

starwars %>% 
  select(-(name:mass)) %>%        # the range of columns from 'name' to 'mass'
  select(-contains('color')) %>%  # any column name that contains 'color'
  select(-starts_with('bi')) %>%  # any column name that starts with 'bi'
  select(-ends_with('er')) %>%    # any column name that ends with 'er'
  select(-matches('^f.+s$')) %>%  # any column name matching the regex pattern
  select_if(~!is.list(.)) %>%     # not by column name but by data type
  head(2)

# A tibble: 2 x 2
homeworld species
  <chr>     <chr>  
1 Tatooine  Human  
2 Tatooine  Droid

answered Oct 24 '22 01:10

sbha

Related questions
                            
                                Define all functions in one .R file, call them from another .R file. How, if possible?
                            
                                Comma separator for numbers in R?
                            
                                List distinct values in a vector in R
                            
                                The cause of "bad magic number" error when loading a workspace and how to avoid it?
                            
                                R programming: How do I get Euler's number?
                            
                                Left align two graph edges (ggplot)
                            
                                Paste multiple columns together
                            
                                How to randomize (or permute) a dataframe rowwise and columnwise?
                            
                                Subscripts in plots in R
                            
                                How to remove outliers from a dataset
                            
                                dplyr summarise: Equivalent of ".drop=FALSE" to keep groups with zero length in output
                            
                                Relationship between R Markdown, Knitr, Pandoc, and Bookdown
                            
                                How to put labels over geom_bar for each bar in R with ggplot2
                            
                                How to change the default font size in ggplot2
                            
                                How can I manipulate the strip text of facet_grid plots?
                            
                                R for loop skip to next iteration ifelse
                            
                                R: Comment out block of code [duplicate]
                            
                                How to parse XML to R data frame
                            
                                How to change 'Maximum upload size exceeded' restriction in Shiny and save user file inputs?
                            
                                How to not run an example using roxygen2?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With