I have the following data frame: <pre class="prettyprint"><code>> str(df) 'data.frame': 3149 obs. of 9 variables: $ mkod : int 5029 5035 5036 5042 5048 5050 5065 5071 5072 5075 ... $ mad : Factor w/ 65 levels "Akgün Kasetçilik ",..: 58 29 59 40 56 11 33 34 19 20 ... $ yad : Factor w/ 44 levels "BAKUGAN","BARBIE",..: 1 1 1 1 1 1 1 1 1 1 ... $ donem: int 201101 201101 201101 201101 201101 201101 201101 201101 201101 201101 ... $ sayi : int 201101 201101 201101 201101 201101 201101 201101 201101 201101 201101 ... $ plan : int 2 2 3 2 2 2 7 3 2 7 ... $ sevk : int 2 2 3 2 2 2 6 3 2 7 ... $ iade : int 0 0 3 1 2 2 6 2 2 3 ... $ satis: int 2 2 0 1 0 0 0 1 0 4 ... </code></pre> I want to remove 21 specific rows from this data frame. <pre class="prettyprint"><code>> a <- df[df$plan==0 & df$sevk==0,] > nrow(a) [1] 21 </code></pre> So when I remove those 21 rows, I will have a new data frame with 3149 - 21 = 3128 rows. I found the following solution: <pre class="prettyprint"><code>> b <- df[df$plan!=0 | df$sevk!=0,] > nrow(b) [1] 3128 </code></pre> My above solution uses a modified logical expression (<code>!=</code> instead of <code>==</code> and <code>|</code> instead of <code>&</code>). Other than modifying the original logical expression, how can I obtain the new data frame without those 21 rows? I need something like that: <pre class="prettyprint"><code>> df[-a,] #does not work </code></pre> EDIT (especially for the downvoters, I hope they understand why I need an alternative solution): I asked for a different solution because I'm writing a long code, and there are various variable assignments (like <code>a</code>'s in my example) in various parts of my code. So, when I need to remove rows in advancing parts of my code, I don't want to go back and try to write the inverse of the logical expressions inside <code>a</code>-like expressions. That's why <code>df[-a,]</code> is more usable for me.

Just negate your logical subscript: <pre class="prettyprint"><code>a <- df[!(df$plan==0 & df$sevk==0),] </code></pre>

You can use the <code>rownames</code> to specify a "complementary" dataframe. Its easier if they are numerical rownames: <pre class="prettyprint"><code>df[-as.numeric(rownames(a)),] </code></pre> But more generally you can use: <pre class="prettyprint"><code>df[setdiff(rownames(df),rownames(a)),] </code></pre>

Removing rows from R data frame

Tags:

I have the following data frame:

> str(df) 'data.frame':   3149 obs. of  9 variables:  $ mkod : int  5029 5035 5036 5042 5048 5050 5065 5071 5072 5075 ...  $ mad  : Factor w/ 65 levels "Akgün Kasetçilik         ",..: 58 29 59 40 56 11 33 34 19 20 ...  $ yad  : Factor w/ 44 levels "BAKUGAN","BARBIE",..: 1 1 1 1 1 1 1 1 1 1 ...  $ donem: int  201101 201101 201101 201101 201101 201101 201101 201101 201101 201101 ...  $ sayi : int  201101 201101 201101 201101 201101 201101 201101 201101 201101 201101 ...  $ plan : int  2 2 3 2 2 2 7 3 2 7 ...  $ sevk : int  2 2 3 2 2 2 6 3 2 7 ...  $ iade : int  0 0 3 1 2 2 6 2 2 3 ...  $ satis: int  2 2 0 1 0 0 0 1 0 4 ...

I want to remove 21 specific rows from this data frame.

> a <- df[df$plan==0 & df$sevk==0,] > nrow(a) [1] 21

So when I remove those 21 rows, I will have a new data frame with 3149 - 21 = 3128 rows. I found the following solution:

> b <- df[df$plan!=0 | df$sevk!=0,] > nrow(b) [1] 3128

My above solution uses a modified logical expression (!= instead of == and | instead of &). Other than modifying the original logical expression, how can I obtain the new data frame without those 21 rows? I need something like that:

> df[-a,] #does not work

EDIT (especially for the downvoters, I hope they understand why I need an alternative solution): I asked for a different solution because I'm writing a long code, and there are various variable assignments (like a's in my example) in various parts of my code. So, when I need to remove rows in advancing parts of my code, I don't want to go back and try to write the inverse of the logical expressions inside a-like expressions. That's why df[-a,] is more usable for me.

711

asked Oct 27 '11 11:10

Mehper C. Palavuzlar

2 Answers

Just negate your logical subscript:

a <- df[!(df$plan==0 & df$sevk==0),]

answered Sep 19 '22 14:09

Joshua Ulrich

You can use the rownames to specify a "complementary" dataframe. Its easier if they are numerical rownames:

df[-as.numeric(rownames(a)),]

But more generally you can use:

df[setdiff(rownames(df),rownames(a)),]

answered Sep 17 '22 14:09

James

Related questions
                            
                                Rails: How to limit number of items in has_many association (from Parent)
                            
                                Javascript read html from url into string
                            
                                What does useMethod mean here?
                            
                                How to customize Html.ValidationMessageFor in ASP MVC
                            
                                Robust Hand Detection via Computer Vision
                            
                                Facebook Object Debugger: property 'og:url' could not be parsed as type 'url'
                            
                                curly braces when define array
                            
                                Copy data to and from the same table and change the value of copied data in one column to a specified value
                            
                                How to use Sqoop in Java Program?
                            
                                Can functions accept abstract base classes as arguments?
                            
                                Rails 3.2 undefined method `key?' for nil:NilClass
                            
                                Need a stored procedure that inserts a row and returns the ID

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With