How to drop columns by name pattern in R?

Tags:

r

I have this dataframe:

state county city  region  mmatrix  X1 X2 X3    A1     A2     A3      B1     B2     B3      C1      C2      C3    1      1     1      1     111010   1  0  0     2     20    200       Push      8     12      NA      NA      NA   1      2     1      1     111010   1  0  0     4     NA    400       Shove      9     NA

Now I want to exclude columns whose names end with a certain string, say "1" (i.e. A1 and B1). I wrote this code:

df_redacted <- df[, -grep("\\1$", colnames(df))]

However, this seems to delete every column. How can I modify the code so that it only deletes the columns that matches the pattern (i.e. ends with "3" or any other string)?

The solution has to be able to handle a dataframe with has both numerical and categorical values.

223

asked Mar 27 '13 18:03

histelheim

1 Answers

I found a simple answer using dplyr/tidyverse. If your colnames contain "This", then all variables containing "This" will be dropped.

library(dplyr)  df_new <- df %>% select(-contains("This"))

128

answered Sep 29 '22 04:09

Samuel Saari

Related questions
                            
                                How to add different lines for facets
                            
                                How do I convert a factor into date format?
                            
                                Convert integer to class Date
                            
                                Function not found in R doParallel 'foreach' - Error in { : task 1 failed - "could not find function "raster""
                            
                                Non-numeric Argument to Binary Operator Error in R
                            
                                Installing of SparkR
                            
                                "Adding missing grouping variables" message in dplyr in R
                            
                                Looping over variables in ggplot
                            
                                How to remove "rows" with a NA value? [duplicate]
                            
                                ggplot2: histogram with normal curve
                            
                                In R base plot, move axis label closer to axis
                            
                                Stacked bar chart
                            
                                Error - replacement has [x] rows, data has [y]
                            
                                How to make a sunburst plot in R or Python?
                            
                                Remove rows in R matrix where all data is NA [duplicate]
                            
                                Change background color of R plot
                            
                                Find the index position of the first non-NA value in an R vector?
                            
                                Export data from R to Excel
                            
                                assign headers based on existing row in dataframe in R
                            
                                State name to abbreviation

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With