I have a list of data frames, and would like to remove those with less than 2 rows off from mylist: <pre class="prettyprint"><code>a<-data.frame(x=c(1:4),y=c("m", "n", "o", "p")) b<-data.frame(x=c(2:6),y=c("q", "w", "e", "r", "t")) c<-data.frame(x=c(6,7),y=c("j","k"),z=c("$","#")) d<-data.frame(x="9",y="q",z="+") mylist<-list(a,b,c,d) for (i in length(mylist)){ if (nrow(mylist[[i]])<=2){ mylist<-mylist[-i] } else{ mylist<-myslit }} </code></pre> However it only seemed to remove data.frame d. Data frame c is still in "mylist" after running the for loop.

You can do this more easily using an apply loop: <pre class="prettyprint"><code>row_lt2 <- which(sapply(mylist, nrow) < 2) mylist[-row_lt2] [[1]] x y 1 1 m 2 2 n 3 3 o 4 4 p [[2]] x y 1 2 q 2 3 w 3 4 e 4 5 r 5 6 t [[3]] x y z 1 6 j $ 2 7 k # </code></pre> Notice I use negative indexing to remove items instead of selecting them.

To add to the other answers: this is exactly the type of thing the higher-order <code>Filter</code> function is made for: <pre class="prettyprint"><code>> Filter(function(x) {nrow(x) >= 2}, mylist) [[1]] x y 1 1 m 2 2 n 3 3 o 4 4 p [[2]] x y 1 2 q 2 3 w 3 4 e 4 5 r 5 6 t [[3]] x y z 1 6 j $ 2 7 k # </code></pre>

You can't do this procedure using <code>for</code> because the indices change. Using <code>for</code>, after removing line 2, you will examine line 3, but you need examine line 2 again (because the line 2 isn't more the same line as before). Change it to <code>repeat</code> or <code>while</code>. <pre class="prettyprint"><code>a<-data.frame(x=c(1:4),y=c("m", "n", "o", "p")) b<-data.frame(x=c(2:6),y=c("q", "w", "e", "r", "t")) c<-data.frame(x=c(6,7),y=c("j","k"),z=c("$","#")) d<-data.frame(x="9",y="q",z="+") mylist<-list(a,b,c,d) i <- 1 while (i <= length(mylist)) { if (nrow(mylist[[i]])<=2){ mylist<-mylist[-i] } else{ i <- i+1 } } </code></pre> Or just use @Paul solution... :P

Paul has provided an answer already, but your mistake has not been pointed out. Your code has two problems. First, you need to supply a range to your loop: <pre class="prettyprint"><code>for (i in 1:length(mylist)) </code></pre> or for (i in seq_along(length(mylist))) Without this, your initialization looked like <code>for (i in 4)</code> after evaluation, meaning that only one iteration was run, removing element 4 and not even looking at all previous elements. However, if you fix that problem, another one emerges. Namely, your list no longer has 4 elements after removing element 3. It only has 3 elements, while your <code>i</code> index will go up until 4, resulting in <code>subscript out of bounds</code> error. Therefore one can only suggest the approach using apply, as described by @Paul. Also, opposed to the assertion otherwise, it is possible to achieve the same using <code>for</code> loop, only your approach needs to be slightly different: <pre class="prettyprint"><code>for (i in 1:length(mylist)) { if (nrow(mylist[[i]])>2) { mylist2[i]<-mylist[i] } } print(mylist2) </code></pre> Here you select list elements that are greater than 2, and assign them to a new list. <code>Sapply</code> will be more speedy though.

R remove objects from a list with if else statement

Tags:

list

for-loop

r

if-statement

I have a list of data frames, and would like to remove those with less than 2 rows off from mylist:

a<-data.frame(x=c(1:4),y=c("m", "n", "o", "p"))
b<-data.frame(x=c(2:6),y=c("q", "w", "e", "r", "t"))
c<-data.frame(x=c(6,7),y=c("j","k"),z=c("$","#"))
d<-data.frame(x="9",y="q",z="+")
mylist<-list(a,b,c,d)

for (i in length(mylist)){
if (nrow(mylist[[i]])<=2){
mylist<-mylist[-i]
}
else{
mylist<-myslit
}}

However it only seemed to remove data.frame d. Data frame c is still in "mylist" after running the for loop.

319

asked Apr 23 '13 19:04

lamushidi

4 Answers

You can do this more easily using an apply loop:

row_lt2 <- which(sapply(mylist, nrow) < 2)
mylist[-row_lt2]
[[1]]
  x y
1 1 m
2 2 n
3 3 o
4 4 p

[[2]]
  x y
1 2 q
2 3 w
3 4 e
4 5 r
5 6 t

[[3]]
  x y z
1 6 j $
2 7 k #

Notice I use negative indexing to remove items instead of selecting them.

answered Sep 28 '22 07:09

Paul Hiemstra

To add to the other answers: this is exactly the type of thing the higher-order Filter function is made for:

> Filter(function(x) {nrow(x) >= 2}, mylist)
[[1]]
  x y
1 1 m
2 2 n
3 3 o
4 4 p

[[2]]
  x y
1 2 q
2 3 w
3 4 e
4 5 r
5 6 t

[[3]]
  x y z
1 6 j $
2 7 k #

answered Sep 28 '22 06:09

Jason Morgan

You can't do this procedure using for because the indices change. Using for, after removing line 2, you will examine line 3, but you need examine line 2 again (because the line 2 isn't more the same line as before). Change it to repeat or while.

a<-data.frame(x=c(1:4),y=c("m", "n", "o", "p"))
b<-data.frame(x=c(2:6),y=c("q", "w", "e", "r", "t"))
c<-data.frame(x=c(6,7),y=c("j","k"),z=c("$","#"))
d<-data.frame(x="9",y="q",z="+")
mylist<-list(a,b,c,d)

i <- 1
while (i <= length(mylist)) {
 if (nrow(mylist[[i]])<=2){
  mylist<-mylist[-i]
 }
 else{
  i <- i+1
 }
}

Or just use @Paul solution... :P

answered Sep 28 '22 06:09

Rcoster

Paul has provided an answer already, but your mistake has not been pointed out.

Your code has two problems. First, you need to supply a range to your loop:

for (i in 1:length(mylist))

or for (i in seq_along(length(mylist)))

Without this, your initialization looked like for (i in 4) after evaluation, meaning that only one iteration was run, removing element 4 and not even looking at all previous elements.

However, if you fix that problem, another one emerges. Namely, your list no longer has 4 elements after removing element 3. It only has 3 elements, while your i index will go up until 4, resulting in subscript out of bounds error.

Therefore one can only suggest the approach using apply, as described by @Paul.

Also, opposed to the assertion otherwise, it is possible to achieve the same using for loop, only your approach needs to be slightly different:

for (i in 1:length(mylist)) {
    if (nrow(mylist[[i]])>2)
    {
        mylist2[i]<-mylist[i]
    }
}  
print(mylist2)

Here you select list elements that are greater than 2, and assign them to a new list. Sapply will be more speedy though.

answered Sep 28 '22 08:09

Maxim.K

Related questions
                            
                                Calculate "group characteristics" without ddply and merge
                            
                                What is the R equivalent to Excel's =2*NORMSDIST(2)
                            
                                Obtaining last Friday's date
                            
                                Using "apply" to apply a function to a matrix where parameters are column-specific
                            
                                Make sequential numeric column names prefixed with a letter
                            
                                Convert integer to words
                            
                                Mutate multiple variable to create multiple new variables
                            
                                R function that returns a string literal
                            
                                How to collapse categories or recategorize variables?
                            
                                First circle of R hell. 0.1 != 0.3/3 [duplicate]
                            
                                How do I turn the numeric output of boxplot (with plot=FALSE) into something usable?
                            
                                check whether a variable is in increasing order in R
                            
                                Find nearest smaller number
                            
                                Select rows within a particular time range
                            
                                Fast way to replace all blanks with NA in R data.table
                            
                                How to speed up R packages installation in docker
                            
                                Write a Sparse Matrix to a CSV in R
                            
                                Plot probability with ggplot2 (not density)
                            
                                Group integer vector into consecutive runs
                            
                                fast subsetting in R

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With