I have a read large csv file into a data frame. Data in the csv file are from multiple web sites representing user information. For example here is the structure of the data frame. <pre class="prettyprint"><code>user_id, number_of_logins, number_of_images, web 001, 34, 3, aa.com 002, 4, 4, aa.com 034, 3, 3, aa.com 001, 12, 4, bb.com 002, 1, 3, bb.com 034, 2, 2, cc.com </code></pre> as you can see once I bring the data into the data frame user_id is no longer a unique id and this causes all the analysis. I am trying to add another columns prior to <code>user_id</code> which is something like <code>"generated_uid"</code> and pretty much use the index of the <code>data.frame</code> to be filled by that column. What's the best way to accomplish this.

You can add a sequence of numbers very easily with <pre class="prettyprint"><code>data$ID <- seq.int(nrow(data)) </code></pre> If you are already using <code>library(tidyverse)</code>, you can use <pre class="prettyprint"><code>data <- tibble::rowid_to_column(data, "ID") </code></pre>

Add an index (numeric ID) column to large data frame [duplicate]

Tags:

dataframe

r

I have a read large csv file into a data frame. Data in the csv file are from multiple web sites representing user information. For example here is the structure of the data frame.

user_id, number_of_logins, number_of_images, web 001, 34, 3, aa.com 002, 4, 4, aa.com 034, 3, 3, aa.com 001, 12, 4, bb.com 002, 1, 3, bb.com 034, 2, 2, cc.com

as you can see once I bring the data into the data frame user_id is no longer a unique id and this causes all the analysis. I am trying to add another columns prior to user_id which is something like "generated_uid" and pretty much use the index of the data.frame to be filled by that column. What's the best way to accomplish this.

617

asked May 07 '14 13:05

add-semi-colons

1 Answers

You can add a sequence of numbers very easily with

data$ID <- seq.int(nrow(data))

If you are already using library(tidyverse), you can use

data <- tibble::rowid_to_column(data, "ID")

178

answered Sep 28 '22 13:09

MrFlick

Related questions
                            
                                Exception handling in R [closed]
                            
                                Convert a vector into a list, each element in the vector as an element in the list
                            
                                Remove facet_wrap labels completely
                            
                                R spreading multiple columns with tidyr [duplicate]
                            
                                How do you specifically order ggplot2 x axis instead of alphabetical order? [duplicate]
                            
                                Suppress output of a function
                            
                                Converting year and month ("yyyy-mm" format) to a date?
                            
                                Fitting a density curve to a histogram in R
                            
                                Working with dictionaries/lists in R
                            
                                In R, how to find the standard error of the mean?
                            
                                ggplot2 plot area margins?
                            
                                Read all files in a folder and apply a function to each data frame
                            
                                How to deal with "data of class uneval" error from ggplot2?
                            
                                Gantt charts with R [closed]
                            
                                Intelligent point label placement in R
                            
                                Add (insert) a column between two columns in a data.frame
                            
                                How do I extract just the number from a named number (without the name)?
                            
                                Does R have an assert statement as in python?
                            
                                How to left align text in annotate from ggplot2
                            
                                Count number of rows by group using dplyr

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With