I need to read the ''wdbc.data' in the following data folder: http://archive.ics.uci.edu/ml/machine-learning-databases/breast-cancer-wisconsin/ Doing this in R is easy using command read.csv but as the header is missing how can I add it? I have the information but don't know how to do this and I'd prefer do not edit the data file.

You can do the following: Load the data: <pre class="prettyprint"><code>test <- read.csv( "http://archive.ics.uci.edu/ml/machine-learning-databases/breast-cancer-wisconsin/breast-cancer-wisconsin.data", header=FALSE) </code></pre> Note that the default value of the <code>header</code> argument for <code>read.csv</code> is <code>TRUE</code> so in order to get all lines you need to set it to <code>FALSE</code>. Add names to the different columns in the data.frame <pre class="prettyprint"><code>names(test) <- c("A","B","C","D","E","F","G","H","I","J","K") </code></pre> or alternative and faster as I understand (not reloading the entire dataset): <pre class="prettyprint"><code>colnames(test) <- c("A","B","C","D","E","F","G","H","I","J","K") </code></pre>

You can also use <code>colnames</code> instead of names if you have <code>data.frame</code> or <code>matrix</code>

You can also solve this problem by creating an array of values and assigning that array: <pre class="prettyprint"><code>newheaders <- c("a", "b", "c", ... "x") colnames(data) <- newheaders </code></pre>

How to add header to a dataset in R?

Q: How do I give a column name to a Dataframe in R?

Method 1: using colnames() method colnames() method in R is used to rename and replace the column names of the data frame in R. The columns of the data frame can be renamed by specifying the new column names as a vector. The new name replaces the corresponding old name of the column in the data frame.

3 Answers

You can do the following:

Load the data:

test <- read.csv(
          "http://archive.ics.uci.edu/ml/machine-learning-databases/breast-cancer-wisconsin/breast-cancer-wisconsin.data",
          header=FALSE)

Note that the default value of the header argument for read.csv is TRUE so in order to get all lines you need to set it to FALSE.

Add names to the different columns in the data.frame

names(test) <- c("A","B","C","D","E","F","G","H","I","J","K")

or alternative and faster as I understand (not reloading the entire dataset):

colnames(test) <- c("A","B","C","D","E","F","G","H","I","J","K")

179

answered Oct 21 '22 08:10

Jochem

You can also use colnames instead of names if you have data.frame or matrix

answered Oct 21 '22 07:10

user1436187

You can also solve this problem by creating an array of values and assigning that array:

newheaders <- c("a", "b", "c", ... "x")
colnames(data) <- newheaders

answered Oct 21 '22 09:10

William Lage

Related questions
                            
                                Is there an R equivalent of python's string `format` function?
                            
                                How to filter a table's row based on an external vector?
                            
                                3 Dimensional Array Names in R
                            
                                How Do I connect two coordinates with a line using Leaflet in R
                            
                                pass character strings to ggplot2 within a function
                            
                                How to efficiently partially apply a function in R?
                            
                                using stat_function and facet_wrap together in ggplot2 in R
                            
                                How to manage multiple package locations (folders) in R?
                            
                                How can I paste 100000 without it being shortened to 1e+05? [duplicate]
                            
                                How to skip an error in a loop
                            
                                non-numeric argument to binary operator [closed]
                            
                                Including Bibliography in RMarkdown document with use of the knitcitations
                            
                                R function prcomp fails with NA's values even though NA's are allowed
                            
                                How do I position two legends independently in ggplot
                            
                                How to add a factor column to dataframe based on a conditional statement from another column?
                            
                                Centering a plot within a fluidRow in Shiny
                            
                                Downloading Yahoo stock prices in R
                            
                                Subset based on variable column name
                            
                                Selecting non-consecutive columns in R tables
                            
                                turning off case sensitivity in r

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to add header to a dataset in R?

Tags:

r

statistics

dataset

blueSurfer

People also ask

3 Answers

Jochem

user1436187

William Lage

Recent Activity

Donate For Us