Converting a grouped continous variable into rows in R

Tags:

r

linear-regression

I have a data frame with these values dummy vales and I want to do lm regression on them. One of the variables is a grouped continuous variable as shown below

df <- data.frame("y" = c(10, 11, 12, 13, 14),
                 "x" = as.factor(c("100-102", "103-105", "106-108", "109-111", "112-114")))

I want to regress y~x, One way is to replace the x factors with their mean numeric values. This is easily done using regular expression.

Another way is to create the additional rows and expand your dataset so it looks like this

data.frame("y" = c(10, 10, 10, 11, 11, 11......),
           "x" = c(100, 101, 102, 103, 104, 105......))

Is there a function that will do this?

I'm thinking of first creating additional variables like x1, x2, x3 and then use reshape2 package to convert the x columns to rows.

834

asked Feb 09 '13 22:02

MySchizoBuddy

2 Answers

A data.table solution. This should be really fast on large data.frame's as well.

require(data.table)
dt <- data.table(df, key="y")
dt[, list(x=seq(sub("-.*$", "", x), sub(".*-", "", x))),by=y]

If you have more columns and you don't want each combinations while splitting by column x, then this is the code to use:

require(data.table)
dt <- data.table(df)
# get all column names except "x"
key.cols <- setdiff(names(df), "x") 
# set the data.table columns to key.cols
setkeyv(dt, key.cols)
dt.out <- dt[, list(x=seq(sub("-.*$", "", x), sub(".*-", "", x))), by = key.cols]

This should give you what you expect.

129

answered Sep 30 '22 10:09

Arun

require(stringr)
require(foreach)

foreach(i=1:nrow(df), .combine=rbind) %do% {
  s <- as.numeric(str_extract_all(df$x[i], "[0-9]+")[[1]])
  data.frame(y=rep(df$y[i], s[2]-s[1]+1), x=seq(s[1], s[2]))  
}

If your data.frame is really big you can go along with %dopar%.

answered Sep 30 '22 09:09

redmode

Related questions
                            
                                How to compute confusion matrix on Iris dataset?
                            
                                R + httr and EC2 api authentication issues
                            
                                How can I omit interactions using stargazer or xtable?
                            
                                Send a text string containing double quotes to function
                            
                                Facet labelling with R
                            
                                ggplot year by year comparison
                            
                                Using a @ sign with roxygen2 [duplicate]
                            
                                Save matrix to .csv file in R without losing format
                            
                                How do POSIXct timezones work in R
                            
                                How to get OCI lib to work on red hat machine with R Oracle?
                            
                                Comparing values of two tables with respect to a tolerance in R
                            
                                How to shift bins in a rose diagram using package 'circular' in R
                            
                                How to use C api of xts package in Rcpp
                            
                                Loops with captions with knitr
                            
                                ggplot2 + Date structure using scale X
                            
                                Implementations of local regression and local likelihood methods
                            
                                How to read output from linux process status (ps) command in R?
                            
                                R data.table subsetting a subset
                            
                                What is the difference between cor and cor.test in R
                            
                                Vectorize for loop over data frame in R

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With