Getting subset of of data based on multiple column values

Tags:

I am trying to remove rows based on whether or not columns 2 and 3 contain 0's. I keep getting very strange results. I tried to write it without subset initially because I read somewhere that subset should only be used for small amounts of data because of the memory cost. Neither attempt worked for me however. Can someone explain what I did wrong?

df <- data.frame(val1=c(1,2,3), val2=c(4,0,5), val3=c(3,0,6))
subset(df,df>0,c(2,3))
data.frame(df[df[,c(2,3)]!=0])

starting dataframe:

   val1   val2   val3
1  1       4       3
1  2       0       0
3  3       5       6

end goal:

   val1   val2   val3
1  1       4       3
3  3       5       6

649

asked Sep 29 '15 18:09

Rilcon42

1 Answers

Using the subset, we create a logical index based on the 2nd and third columns.

subset(df, subset=!(val2==0|val3==0))

as subset argument works on columns and not on matrices. We can also use [ instead of subset.

df[!(df[,2]==0|df[,3]==0),]

Regarding the second answer in the OP's post

df[,c(2,3)]!=0 #returns a matrix
#      val2  val3
#[1,]  TRUE  TRUE
#[2,] FALSE FALSE
#[3,]  TRUE  TRUE

For subsetting rows, we need only a single logical index per each row.

Another option is rowSums (if you want to remove rows that are 0 for both column 2 and 3)

 df[rowSums(df[2:3])!=0,]

i.e.

df$val3[2] <- 2

will return all the rows with rowSums while the other methods return rows 1 and 3.

The equivalent option with subset is &

subset(df, !(val2==0 & val3==0))

156

answered Nov 15 '22 03:11

akrun

Related questions
                            
                                Set color for NA Value with spplot in R
                            
                                R - Scraping an HTML table with rvest when there are missing <tr> tags
                            
                                R: Insert multiple rows (variable number) in data frame
                            
                                Get the (t-1) data within groups
                            
                                Understanding tree structure in R gbm package
                            
                                R:Count the daily number of a variable distinguish per ID
                            
                                geom_bar: color gradient and cross hatches (using gridSVG), transparency issue
                            
                                Error: please supply starting values
                            
                                R data.table multi column recode/sub-assign [duplicate]
                            
                                Get aspect ratio for lat-long plots
                            
                                alignment and offsets in rollapply
                            
                                Kruskal - Wallis p-value matrix for data subsets with R
                            
                                Shiny: Switching between reactive data sets with rhandsontable
                            
                                Quantmod Oscillators
                            
                                Histogram with a jittery rug
                            
                                Is there a way to make the density() function in R use counts vs. probability?
                            
                                Using zoo's rollsum within data.table on timestamped transactions
                            
                                R - creating dataframe from colMeans function
                            
                                Make a 2D legend for a plot - bi-variate choropleth maps
                            
                                RStudio : Rook does not work?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Getting subset of of data based on multiple column values

Tags:

r

subset

Rilcon42

People also ask

1 Answers

akrun

Recent Activity

Donate For Us