I want to create an unique ID in R based on two columns of latitude and longitude so that duplicated locations have the same cluster ID. For example: <pre class="prettyprint"><code>LAT LONG Cluster_ID 13.5330 -15.4180 1 13.5330 -15.4180 1 13.5330 -15.4180 1 13.5330 -15.4180 1 13.5330 -15.4170 2 13.5330 -15.4170 2 13.5330 -15.4170 2 13.5340 -14.9350 3 13.5340 -14.9350 3 13.5340 -15.9170 4 13.3670 -14.6190 5 </code></pre>

The data: <pre class="prettyprint"><code>dat <- read.table(text=" LAT LONG 13.5330 -15.4180 13.5330 -15.4180 13.5330 -15.4180 13.5330 -15.4180 13.5330 -15.4170 13.5330 -15.4170 13.5330 -15.4170 13.5340 -14.9350 13.5340 -14.9350 13.5340 -15.9170 13.3670 -14.6190", header = TRUE) </code></pre> These commands create an id variable starting with <code>1</code>: <pre class="prettyprint"><code>comb <- with(dat, paste(LAT, LONG)) within(dat, Cluster_ID <- match(comb, unique(comb))) </code></pre> The output: <pre class="prettyprint"><code> LAT LONG Cluster_ID 1 13.533 -15.418 1 2 13.533 -15.418 1 3 13.533 -15.418 1 4 13.533 -15.418 1 5 13.533 -15.417 2 6 13.533 -15.417 2 7 13.533 -15.417 2 8 13.534 -14.935 3 9 13.534 -14.935 3 10 13.534 -15.917 4 11 13.367 -14.619 5 </code></pre>

Add ID column by group [duplicate]

Tags:

I want to create an unique ID in R based on two columns of latitude and longitude so that duplicated locations have the same cluster ID.

For example:

LAT        LONG    Cluster_ID 13.5330 -15.4180   1 13.5330 -15.4180   1 13.5330 -15.4180   1 13.5330 -15.4180   1 13.5330 -15.4170   2 13.5330 -15.4170   2 13.5330 -15.4170   2 13.5340 -14.9350   3 13.5340 -14.9350   3 13.5340 -15.9170   4 13.3670 -14.6190   5

474

asked Nov 26 '12 14:11

jonestats

2 Answers

Here's one way using interaction.

d <- read.table(text='LAT LONG 13.5330 -15.4180  13.5330 -15.4180  13.5330 -15.4180  13.5330 -15.4180  13.5330 -15.4170  13.5330 -15.4170  13.5330 -15.4170  13.5340 -14.9350  13.5340 -14.9350  13.5340 -15.9170  13.3670 -14.6190', header=TRUE)  d <- transform(d, Cluster_ID = as.numeric(interaction(LAT, LONG, drop=TRUE)))  #       LAT    LONG Cluster_ID # 1  13.533 -15.418          2 # 2  13.533 -15.418          2 # 3  13.533 -15.418          2 # 4  13.533 -15.418          2 # 5  13.533 -15.417          3 # 6  13.533 -15.417          3 # 7  13.533 -15.417          3 # 8  13.534 -14.935          4 # 9  13.534 -14.935          4 # 10 13.534 -15.917          1 # 11 13.367 -14.619          5

EDIT: Incorporated @Spacedman's suggestion to supply drop=TRUE to interaction.

181

answered Sep 28 '22 18:09

Matthew Plourde

The data:

dat <- read.table(text=" LAT        LONG 13.5330 -15.4180 13.5330 -15.4180 13.5330 -15.4180 13.5330 -15.4180 13.5330 -15.4170 13.5330 -15.4170 13.5330 -15.4170 13.5340 -14.9350 13.5340 -14.9350 13.5340 -15.9170 13.3670 -14.6190", header = TRUE)

These commands create an id variable starting with 1:

comb <- with(dat, paste(LAT, LONG)) within(dat, Cluster_ID <- match(comb, unique(comb)))

The output:

      LAT    LONG Cluster_ID 1  13.533 -15.418          1 2  13.533 -15.418          1 3  13.533 -15.418          1 4  13.533 -15.418          1 5  13.533 -15.417          2 6  13.533 -15.417          2 7  13.533 -15.417          2 8  13.534 -14.935          3 9  13.534 -14.935          3 10 13.534 -15.917          4 11 13.367 -14.619          5

answered Sep 28 '22 16:09

Sven Hohenstein

Related questions
                            
                                Incrementation in Lua
                            
                                slim-lang's if-else nested code
                            
                                How to move an element in the DOM?
                            
                                PHP - Many variables or One array?
                            
                                Package Manager Console not found
                            
                                Overwriting an existing Heroku app
                            
                                Perplexed by SVG viewBox, width, height, etc
                            
                                How to change href attribute using JavaScript after opening the link in a new window?
                            
                                What's LazyList?
                            
                                Error: NOTE: Failed to notify 'operator' via email. when trying to send e-mail when job fails
                            
                                Simple awk command issue (FS, OFS related)
                            
                                How to alias a sequence of tasks?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With