I'm trying to figure out how to use <code>merge()</code> to update a data frame. Take for example the data frame <code>foo</code> <pre class="prettyprint"><code>foo <- data.frame(index=c('a', 'b', 'c', 'd'), value=c(100, 101, NA, NA)) </code></pre> Which has the following values <pre class="prettyprint"><code>index value 1 a 100 2 b 101 3 c NA 4 d NA </code></pre> And the data frame <code>bar</code> <pre class="prettyprint"><code>bar <- data.frame(index=c('c', 'd'), value=c(200, 201)) </code></pre> Which has the following values: <pre class="prettyprint"><code> index value 1 c 200 2 d 201 </code></pre> When I run the following <code>merge()</code> function to update the values for <code>c</code> and <code>d</code> <pre class="prettyprint"><code>merge(foo, bar, by='index', all=T) </code></pre> It results in this output: <pre class="prettyprint"><code> index value.x value.y 1 a 100 NA 2 b 101 NA 3 c NA 200 4 d NA 201 </code></pre> I'd like the output of <code>merge()</code> to avoid the creation of, in this specific example, of <code>value.x</code> and <code>value.y</code> but only retain the original column of <code>value</code> Is there a simple way of doing this?

Doesn't <code>merge()</code> always bind columns together? Does <code>replace()</code> work? <pre class="prettyprint"><code>foo$value <- replace(foo$value, foo$index %in% bar$index, bar$value) </code></pre> or <code>match()</code> so the order matters <pre class="prettyprint"><code>foo$value[match(bar$index, foo$index)] <- bar$value </code></pre>

Use merge() to update a data frame with values from a second data frame

Tags:

I'm trying to figure out how to use merge() to update a data frame.

Take for example the data frame foo

foo <- data.frame(index=c('a', 'b', 'c', 'd'), value=c(100, 101, NA, NA))

Which has the following values

index value 1     a   100 2     b   101 3     c    NA 4     d    NA

And the data frame bar

bar <- data.frame(index=c('c', 'd'), value=c(200, 201))

Which has the following values:

 index value 1     c   200 2     d   201

When I run the following merge() function to update the values for c and d

merge(foo, bar, by='index', all=T)

It results in this output:

 index value.x value.y 1     a     100      NA 2     b     101      NA 3     c      NA     200 4     d      NA     201

I'd like the output of merge() to avoid the creation of, in this specific example, of value.x and value.y but only retain the original column of value Is there a simple way of doing this?

610

asked Jul 06 '10 20:07

andrewj

1 Answers

Doesn't merge() always bind columns together? Does replace() work?

foo$value <- replace(foo$value, foo$index %in% bar$index, bar$value)

or match() so the order matters

foo$value[match(bar$index, foo$index)] <- bar$value

answered Oct 29 '22 22:10

apeescape

Related questions
                            
                                This JavaScript syntax I haven't seen till now, what does it do really?
                            
                                The State of contenteditable
                            
                                System.exit(num) or throw a RuntimeException from main?
                            
                                Using .aggregate() on a value introduced using .extra(select={...}) in a Django Query?
                            
                                NSFetchedResultsController ignores fetchLimit?
                            
                                Does quicksort with randomized median-of-three do appreciably better than randomized quicksort?
                            
                                Repository Pattern without an ORM
                            
                                MySQL varchar(2000) vs text?
                            
                                Is jQuery Mobile ready for production use?
                            
                                How to get the height of an iframe with javascript from inside the iframe? What about pages with multiple iframes?
                            
                                jqXHR.getAllResponseHeaders() won't return all headers
                            
                                Optimizing SQL queries by removing Sort operator in Execution plan

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With