Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Converting a factor to numeric without losing information R (as.numeric() doesn't seem to work) [duplicate]

Tags:

r

r-factor

Possible Duplicate:
R - How to convert a factor to an integer\numeric in R without a loss of information

The following fact about the as.numeric() function has been brought to my attention

> blah<-c("4","8","10","15") > blah [1] "4"  "8"  "10" "15" > blah.new<-as.factor(blah) > blah.new [1] 4  8  10 15 Levels: 10 15 4 8 > blah.new1<-as.numeric(blah.new) > blah.new1 [1] 3 4 1 2 

When I convert a factor with levels 4, 8, 10, and 15 to a quantitative variable using as.numeric(), every number is converted to a ranking, and the original values are lost.

How do I take the vector 'blah.new' that has levels 10,15, 4, and 8, and convert it to the numeric values 10, 15, 4, and 8?

(This issue has arisen because of a dataset where a quantitative variable is read by read.table() to be a factor)

Thank you!!!!

*****Update: FIGURED IT OUT******

blah.new1<-as.numeric(as.character(blah.new)) 

However, I am wondering where in documentation for as.numeric() does it say that this function converts arguments into a list of rankings?

like image 919
Michael Avatar asked Sep 30 '11 14:09

Michael


People also ask

How do I convert a factor variable into a numeric in R?

There are two steps for converting factor to numeric: Step 1: Convert the data vector into a factor. The factor() command is used to create and modify factors in R. Step 2: The factor is converted into a numeric vector using as. numeric().

How does as numeric work in R?

as. numeric attempts to coerce its argument to numeric type (either integer or real). is. numeric returns TRUE if its argument is of type real or type integer and FALSE otherwise.


1 Answers

First, factor consists of indices and levels. This fact is very very important when you are struggling with factor.

For example,

> z <- factor(letters[c(3, 2, 3, 4)])  # human-friendly display, but internal structure is invisible > z [1] c b c d Levels: b c d  # internal structure of factor > unclass(z) [1] 2 1 2 3 attr(,"levels") [1] "b" "c" "d" 

here, z has 4 elements.
The index is 2, 1, 2, 3 in that order.
The level is associated with each index: 1 -> b, 2 -> c, 3 -> d.

Then, as.numeric converts simply the index part of factor into numeric.
as.character handles the index and levels, and generates character vector expressed by its level.

?as.numeric says that Factors are handled by the default method.

like image 197
kohske Avatar answered Oct 08 '22 05:10

kohske