K Means Clustering in R - ignoring row id

Tags:

r

I have data frame as follows:

X1      X2         X3
3   102.20000   26.07667 
4   115.00000   25.12500
5   36.70000    35.05545

Where column X1 denotes unique identifier for a row while X2, X3 are features

I want to perform scaling before performing k means clustering on a data,

 mydata <- scale(mydata)


  X1               X2            X3
-11715.6     -12.2200734    -9.7826627
-11714.6       0.5799266    -10.7343294
-11713.6      -77.7200734   -0.8038748

I don't want column X1 to scale but want it to remain on data frame. Any way to do it?

235

asked Jul 24 '15 09:07

1 Answers

You can tag the unique identifier on to the data frame rows via their rownames.

rownames(mydata) = mydata$X1
mydata$X1 = NULL
mydata = scale(mydata)

If you then want to perform k-means on the scaled data, I would just leave the row names as the identifiers to do any analysis. You can put them back whenever you want with mydata$X1 = rownames(mydata).

answered Sep 29 '22 14:09

Akhil Nair

Related questions
                            
                                How can I use stat_smooth to show one line, on a two factor figure?
                            
                                geom_rect() main variable not found
                            
                                R - Combine cor.mtest and p.adjust
                            
                                How to plot a lagged time series?
                            
                                R read.csv how to ignore carriage return?
                            
                                Removing rows/columns with only one element from a binary matrix
                            
                                Print/save Excel (.xlsx) sheet to PDF using R
                            
                                How can I suppress (not print) line numbers?
                            
                                Sequence index plots in ggplot2 using geom_tile( )
                            
                                Quickly split a large vector into chunks in R
                            
                                Error message using read_excel "Error: std::bad_alloc"
                            
                                Set ggmap boundary based on Latitude and Longitude
                            
                                Replacing strings with lookup table dplyr
                            
                                Add new line to text in UI of shiny app
                            
                                RcppArmadillo: Issue with memory usage
                            
                                Why does subsetting change with tbl_df in dlpyr?
                            
                                Dygraph's %>% replacing Dplyr's
                            
                                R RODBC Show all tables
                            
                                Why does geocode keep returning the wrong address but Google Maps works correctly
                            
                                Error creating R data.table with date-time POSIXlt

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

K Means Clustering in R - ignoring row id

Tags:

r

Sarit Adhikari

People also ask

1 Answers

Akhil Nair

Recent Activity

Donate For Us