I have been using R for a little while, but I am still struggling with factors and data frames. Here's my question. I am trying to pre-allocate a data frame composed of several columns of different types, as follows: <pre class="prettyprint"><code>cb <- data.frame(S=character(1000), I=numeric(1000), A=as.Date(rep(0,1000), origin = "1900-01-01"), SD=as.POSIXct(rep(0,1000), origin = "1900-01-01 00:00:00"), CC=numeric(1000), stringsAsFactors=FALSE) </code></pre> which gets met the data frame types that I want (output of str(cb)): <pre class="prettyprint"><code>'data.frame': 1000 obs. of 5 variables: $ S : chr "" "" "" "" ... $ I : num 0 0 0 0 0 0 0 0 0 0 ... $ A : Date, format: "1900-01-01" "1900-01-01" "1900-01-01" "1900-01-01" ... $ SD: POSIXct, format: "1900-01-01" "1900-01-01" "1900-01-01" "1900-01-01" ... $ CC: num 0 0 0 0 0 0 0 0 0 0 ... </code></pre> When I assign the first item in the data frame, CC and I become characters: <pre class="prettyprint"><code>cb[1, ] <- c("ABCD", 4, "2005-12-12", "2008-04-03 20:30", 3) </code></pre> output of str(cb): <pre class="prettyprint"><code>'data.frame': 1000 obs. of 5 variables: $ S : chr "ABCD" "" "" "" ... $ I : chr "4" "0" "0" "0" ... $ A : Date, format: "2005-12-12" "1900-01-01" "1900-01-01" "1900-01-01" ... $ SD: POSIXct, format: "2008-04-03 20:30:00" "1900-01-01 00:00:00" "1900-01-01 00:00:00" "1900-01-01 00:00:00" ... $ CC: chr "3" "0" "0" "0" ... </code></pre> which makes it rather unusable for my purposes. When I omit stringsAsFactors=FALSE in the data.frame definition, I (obviously) get a different error message (having set warn to 2): <pre class="prettyprint"><code>Error in `[<-.factor`(`*tmp*`, iseq, value = "ABCD") : (converted from warning) invalid factor level, NAs generated </code></pre> which I understand but I am not sure how to overcome either. What am I doing wrong? How can I make sure to keep the numeric type for columns I and SD? Thanks so much for your help. Cheers B

You can't mix types in a vector, so your vector is being coerced to character. <pre class="prettyprint"><code>R> c("ABCD", 4, "2005-12-12", "2008-04-03 20:30", 3) [1] "ABCD" "4" [3] "2005-12-12" "2008-04-03 20:30" [5] "3" </code></pre> <code>[<-.data.frame</code> then coerces the numeric columns of your data.frame to character, so the column will be one type; though I find it a bit inconsistent that it doesn't also convert the Date/POSIXt fields to character as well... You can mix types in a list. This replacement works because data.frames are lists underneath. <pre class="prettyprint"><code>cb[1, ] <- list("ABCD", 4, "2005-12-12", "2008-04-03 20:30", 3) </code></pre> When you look back at your code later, it might make more sense to replace one row of your data.frame with a 1-row data.frame: <pre class="prettyprint"><code>cb[1, ] <- data.frame("ABCD", 4, "2005-12-12", "2008-04-03 20:30", 3, stringsAsFactors=FALSE) </code></pre>

Data frames with mixed data types

Tags:

type-conversion

dataframe

r

I have been using R for a little while, but I am still struggling with factors and data frames. Here's my question.

I am trying to pre-allocate a data frame composed of several columns of different types, as follows:

cb <- data.frame(S=character(1000), I=numeric(1000), A=as.Date(rep(0,1000), origin = "1900-01-01"), SD=as.POSIXct(rep(0,1000), origin = "1900-01-01 00:00:00"), CC=numeric(1000), stringsAsFactors=FALSE)

which gets met the data frame types that I want (output of str(cb)):

'data.frame':   1000 obs. of  5 variables:
 $ S : chr  "" "" "" "" ...
 $ I : num  0 0 0 0 0 0 0 0 0 0 ...
 $ A : Date, format: "1900-01-01" "1900-01-01" "1900-01-01" "1900-01-01" ...
 $ SD: POSIXct, format: "1900-01-01" "1900-01-01" "1900-01-01" "1900-01-01" ...
 $ CC: num  0 0 0 0 0 0 0 0 0 0 ...

When I assign the first item in the data frame, CC and I become characters:

cb[1, ] <- c("ABCD", 4, "2005-12-12", "2008-04-03 20:30", 3)

output of str(cb):

'data.frame':   1000 obs. of  5 variables:
 $ S : chr  "ABCD" "" "" "" ...
 $ I : chr  "4" "0" "0" "0" ...
 $ A : Date, format: "2005-12-12" "1900-01-01" "1900-01-01" "1900-01-01" ...
 $ SD: POSIXct, format: "2008-04-03 20:30:00" "1900-01-01 00:00:00" "1900-01-01 00:00:00" "1900-01-01 00:00:00" ...
 $ CC: chr  "3" "0" "0" "0" ...

which makes it rather unusable for my purposes.

When I omit stringsAsFactors=FALSE in the data.frame definition, I (obviously) get a different error message (having set warn to 2):

Error in `[<-.factor`(`*tmp*`, iseq, value = "ABCD") : 
  (converted from warning) invalid factor level, NAs generated

which I understand but I am not sure how to overcome either.

What am I doing wrong? How can I make sure to keep the numeric type for columns I and SD? Thanks so much for your help.

Cheers

372

asked Apr 15 '13 21:04

bdu

1 Answers

You can't mix types in a vector, so your vector is being coerced to character.

R> c("ABCD", 4, "2005-12-12", "2008-04-03 20:30", 3)
[1] "ABCD"             "4"               
[3] "2005-12-12"       "2008-04-03 20:30"
[5] "3"

[<-.data.frame then coerces the numeric columns of your data.frame to character, so the column will be one type; though I find it a bit inconsistent that it doesn't also convert the Date/POSIXt fields to character as well...

You can mix types in a list. This replacement works because data.frames are lists underneath.

cb[1, ] <- list("ABCD", 4, "2005-12-12", "2008-04-03 20:30", 3)

When you look back at your code later, it might make more sense to replace one row of your data.frame with a 1-row data.frame:

cb[1, ] <- data.frame("ABCD", 4, "2005-12-12", "2008-04-03 20:30", 3,
                      stringsAsFactors=FALSE)

193

answered Oct 01 '22 06:10

Joshua Ulrich

Related questions
                            
                                How to get the name of a data.frame within a list?
                            
                                how to find similar sentences / phrases in R?
                            
                                Subset by samples for an ExpressionSet object
                            
                                Running an R script using a Windows shortcut
                            
                                named parameters with same name
                            
                                How can I get a list of all possible partitions of a vector in R?
                            
                                How to open a local html file from R in an operating system independent way?
                            
                                Merge neighboring regions in R (aggregate spatial data)?
                            
                                igraph fixed node coordinates layout
                            
                                R : How to search for a regex in a vector over elements outwardly?
                            
                                Function to save R list into separate Excel worksheets
                            
                                Is there an R package to parse geophysical "Log Ascii Standard" Files (.las files)?
                            
                                what does '[[' mean in the function lapply(x, '[[', VarNames[[type]]) in R?
                            
                                plotting list object using ggplot [closed]
                            
                                R legend for color density scatterplot produced using smoothScatter
                            
                                Adding annotation (segment / arrow) in only certain facet ggplot [duplicate]
                            
                                Use sub-/superscript and special characters in legend texts of R plots
                            
                                Change distance between x-axis ticks in ggplot2
                            
                                2 factor histogram analysis
                            
                                Create a dll dynamic library from C in R (Windows)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With