I want to delete all rows containing a value larger than <code>7</code> in a cell in an arbitrary column, either across all columns or across specific columns. <pre class="prettyprint"><code>a <- c(3,6,99,7,8,9) b <- c(99,6,3,4,5,6) c <- c(2,5,6,7,8,3) df <- data.frame (a,b,c) a b c 1 3 99 2 2 6 6 5 3 99 3 6 4 7 4 7 5 8 5 8 6 9 6 3 </code></pre> V1: I want to delete all rows containing values larger than 7, regardless of the column. <pre class="prettyprint"><code># result V1 a b c 2 6 6 5 4 7 4 7 </code></pre> V2: I want to delete all rows containing values larger than 7 in column b and c <pre class="prettyprint"><code># result V2 a b c 2 6 6 5 3 99 3 6 4 7 4 7 6 9 6 3 </code></pre> There are plenty of similar problems on SOF, but I couldn't find a solution to this problem. So far I can only find rows that include <code>7</code>using <code>res <- df[rowSums(df != 7) < ncol(df), ]</code>.

<code>rowSums</code> of the logical matrix <code>df > 7</code> gives the number of 'TRUE' per each row. We get '0' if there are no 'TRUE' for that particular row. By negating the results, '0' will change to 'TRUE", and all other values not equal to 0 will be FALSE. This can be used for subsetting. <pre class="prettyprint"><code>df[!rowSums(df >7),] # a b c #2 6 6 5 #4 7 4 7 </code></pre> For the 'V2', we use the same principle except that we are getting the logical matrix on a subset of 'df'. ie. selecting only the second and third columns. <pre class="prettyprint"><code>df[!rowSums(df[-1] >7),] # a b c #2 6 6 5 #3 99 3 6 #4 7 4 7 #6 9 6 3 </code></pre>

Delete rows in R if a cell contains a value larger than x

Q: How do I remove a row based on a cell value in R?

If we prefer to work with the Tidyverse package, we can use the filter() function to remove (or select) rows based on values in a column (conditionally, that is, and the same as using subset). Furthermore, we can also use the function slice() from dplyr to remove rows based on the index.

Tags:

r

I want to delete all rows containing a value larger than 7 in a cell in an arbitrary column, either across all columns or across specific columns.

a <- c(3,6,99,7,8,9)
b <- c(99,6,3,4,5,6)
c <- c(2,5,6,7,8,3)
df <- data.frame (a,b,c)

   a  b c
1  3 99 2
2  6  6 5
3 99  3 6
4  7  4 7
5  8  5 8
6  9  6 3

V1: I want to delete all rows containing values larger than 7, regardless of the column.

# result V1
   a  b c
2  6  6 5
4  7  4 7

V2: I want to delete all rows containing values larger than 7 in column b and c

# result V2
   a  b c
2  6  6 5
3 99  3 6
4  7  4 7
6  9  6 3

There are plenty of similar problems on SOF, but I couldn't find a solution to this problem. So far I can only find rows that include 7using res <- df[rowSums(df != 7) < ncol(df), ].

259

asked Apr 03 '15 08:04

rmuc8

1 Answers

rowSums of the logical matrix df > 7 gives the number of 'TRUE' per each row. We get '0' if there are no 'TRUE' for that particular row. By negating the results, '0' will change to 'TRUE", and all other values not equal to 0 will be FALSE. This can be used for subsetting.

df[!rowSums(df >7),]
#  a b c
#2 6 6 5
#4 7 4 7

For the 'V2', we use the same principle except that we are getting the logical matrix on a subset of 'df'. ie. selecting only the second and third columns.

df[!rowSums(df[-1] >7),]
#   a b c
#2  6 6 5
#3 99 3 6
#4  7 4 7
#6  9 6 3

190

answered Oct 23 '22 14:10

akrun

Related questions
                            
                                R: Can I include an R markdown file in a shiny ui.R file?
                            
                                Combine texreg, knitr, booktabs & dcolumn
                            
                                R data.table intersection of all groups
                            
                                How do I determine the number of significant figures in data in R?
                            
                                Using R, Randomly Assigning Students Into Groups Of 4
                            
                                Filter based on NA in dplyr
                            
                                Split camelCase Column names
                            
                                structure and attribute in R
                            
                                Issues with bind_rows() from dplyr - package loading error?
                            
                                psych: principal - loadings components
                            
                                Is there a "pause" function in R? [duplicate]
                            
                                Passing different forecasting method to hierarchical time series forecast in R?
                            
                                In regex, mystery Error: assertion 'tree->num_tags == num_tags' failed in executing regexp: file 'tre-compile.c', line 634
                            
                                How to filter cases in a data.table by multiple conditions defined in another data.table
                            
                                Efficient creation of tridiagonal matrices
                            
                                gsub apply combination in R
                            
                                R convert matrix to list
                            
                                5 minutes interval time to 15 minutes time interval mean data
                            
                                How can I simply add a css file to change the background color for my shiny app
                            
                                How to assign column names with fread in R?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With