I am looking to utilise the caret package with a metric that is not one of the default options. For the example below I use the Metrics package. I have read all of the relevant questions on StackOverflow as well as the guide on the caret website but still am receiving errors. In the example below I wish to use Mean Absolute Error. create a function: <pre class="prettyprint"><code>maefunction<-function(data, lev=NULL, model=NULL){ require(Metrics) MAE<-mae(data[, "obs"], data[, "pred"]) out<-c(MAE) out } </code></pre> Now I insert the function into the trainControl <pre class="prettyprint"><code>library(caret) GBM<-train(train$result~., data=train, method="gbm", trControl=trainControl(summaryFunction=maefunction), metric=MAE) </code></pre> I receive the following message <pre class="prettyprint"><code>Error in list_to_dataframe(res, attr(.data, "split_labels"), .id, id_as_factor) : Results must be all atomic, or all data frames In addition: Warning messages: 1: In if (metric %in% c("Accuracy", "Kappa")) stop(paste("Metric", : the condition has length > 1 and only the first element will be used 2: In if (metric == "ROC" & !ctrl$classProbs) stop("train()'s use of ROC codes requires class probabilities. See the classProbs option of trainControl()") : the condition has length > 1 and only the first element will be used 3: In if (!(metric %in% perfNames)) { : the condition has length > 1 and only the first element will be used 4: In train.default(x, y, weights = w, ...) : The metric "4" was not in the result set. will be used instead.The metric "0.5" was not in the result set. will be used instead. </code></pre>

I think that you have to use a named vector (see the example below). I didn't explicitly say that in the documentation so I will update that section. Max <pre class="prettyprint"><code>library(mlbench) data(BostonHousing) maeSummary <- function (data, lev = NULL, model = NULL) { out <- mae(data$obs, data$pred) names(out) <- "MAE" out } mControl <- trainControl(summaryFunction = maeSummary) marsGrid <- expand.grid(degree = 1, nprune = (1:10) * 2) set.seed(1) earthFit <- train(medv ~ ., data = BostonHousing, "earth", tuneGrid = marsGrid, metric = "MAE", maximize = FALSE, trControl = mControl) </code></pre>

User Defined Metric in Caret Package

Tags:

r

I am looking to utilise the caret package with a metric that is not one of the default options. For the example below I use the Metrics package. I have read all of the relevant questions on StackOverflow as well as the guide on the caret website but still am receiving errors.

In the example below I wish to use Mean Absolute Error.

create a function:

maefunction<-function(data, lev=NULL, model=NULL){
  require(Metrics)
  MAE<-mae(data[, "obs"], data[, "pred"])
  out<-c(MAE)
  out
}

Now I insert the function into the trainControl

library(caret)
GBM<-train(train$result~., data=train, method="gbm", trControl=trainControl(summaryFunction=maefunction), metric=MAE)

I receive the following message

Error in list_to_dataframe(res, attr(.data, "split_labels"), .id, id_as_factor) : 
Results must be all atomic, or all data frames
In addition: Warning messages:
1: In if (metric %in% c("Accuracy", "Kappa")) stop(paste("Metric",  :
  the condition has length > 1 and only the first element will be used
2: In if (metric == "ROC" & !ctrl$classProbs) stop("train()'s use of ROC codes requires                class probabilities. See the classProbs option of trainControl()") :
  the condition has length > 1 and only the first element will be used
3: In if (!(metric %in% perfNames)) { :
  the condition has length > 1 and only the first element will be used
4: In train.default(x, y, weights = w, ...) :
  The metric "4" was not in the result set.  will be used instead.The metric "0.5" was    not in the result set.  will be used instead.

222

asked Mar 16 '14 08:03

srepho

1 Answers

I think that you have to use a named vector (see the example below). I didn't explicitly say that in the documentation so I will update that section.

Max

library(mlbench)
data(BostonHousing)

maeSummary <- function (data,
                        lev = NULL,
                        model = NULL) {
   out <- mae(data$obs, data$pred)  
   names(out) <- "MAE"
   out
}

mControl <- trainControl(summaryFunction = maeSummary)
marsGrid <- expand.grid(degree = 1, nprune = (1:10) * 2)

set.seed(1)
earthFit <- train(medv ~ .,
                  data = BostonHousing, 
                  "earth",
                  tuneGrid = marsGrid,
                  metric = "MAE",
                  maximize = FALSE,
                  trControl = mControl)

120

answered Sep 20 '22 03:09

topepo

Related questions
                            
                                Why R code with `{}` is faster than that with `()`?
                            
                                how to simulate correlated binary data with R? [duplicate]
                            
                                How to find offset diagonal of a matrix?
                            
                                How to put an apply equivalent to any for loop
                            
                                Error when using %dopar% instead of %do% in R (package doParallel)
                            
                                Aggregate15 minute data to hourly
                            
                                zoo column name for single column object
                            
                                Evaluate many functions using one data in R
                            
                                How to simulate an AR(1) process with arima.sim and an estimated model?
                            
                                Extracting a specific word using gsub and regex
                            
                                "invalid argument type" error with all.equal. R
                            
                                Equivalent of boxplot lwd parameter for bwplot
                            
                                How to connect points of different groups by a line using ggplot
                            
                                In R, can I make the table() function return the number of NA values in a named element?
                            
                                How to convert multiple columns to individual rows in R
                            
                                How to sum values of array in each dimension into one matrix
                            
                                R - svd() function - infinite or missing values in 'x'
                            
                                Error in read.table: !header: invalid argument type
                            
                                Getting observations corresponding to each quartile
                            
                                Reading in multiple png files in order to create a new plot with grid.arrange

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With