R caret model evaluation with alternate performance metric

Tags:

I'm using R's caret package to do some grid search and model evaluation. I have a custom evaluation metric that is a weighted average of absolute error. Weights are assigned at the observation level.

X <- c(1,1,2,0,1) #feature 1
w <- c(1,2,2,1,1) #weights
Y <- 1:5 #target, continuous

#assume I run a model using X as features and Y as target and get a vector of predictions

mymetric <- function(predictions, target, weights){

v <- sum(abs(target-predictions)*weights)/sum(weights) 
return(v)
}

Here an example is given on how to use summaryFunction to define a custom evaluation metric for caret's train(). To quote:

The trainControl function has a argument called summaryFunction that specifies a function for computing performance. The function should have these arguments:

data is a reference for a data frame or matrix with columns called obs and pred for the observed and predicted outcome values (either numeric data for regression or character values for classification). Currently, class probabilities are not passed to the function. The values in data are the held-out predictions (and their associated reference values) for a single combination of tuning parameters. If the classProbs argument of the trainControl object is set to TRUE, additional columns in data will be present that contains the class probabilities. The names of these columns are the same as the class levels. lev is a character string that has the outcome factor levels taken from the training data. For regression, a value of NULL is passed into the function. model is a character string for the model being used (i.e. the value passed to the method argument of train).

I cannot quite figure out how to pass the observation weights to summaryFunction.

411

asked Apr 14 '14 16:04

ADJ

1 Answers

You can't pass in weights directly to the summary function, which is an oversight since you can pass them to the modeling function. If the underlying model accommodates weights, they are used to produce the predicted values.

I'll add that to the next release.

Max

answered Sep 23 '22 22:09

topepo

Related questions
                            
                                How to separate edge label from edge in igraph?
                            
                                Memory leak when using package XML on Windows
                            
                                Problems with opening RStudio
                            
                                Plus sign between ggplot2 and other function (R) [duplicate]
                            
                                Increase space between legend title and labels in ggplot2
                            
                                glmer logit - interaction effects on probability scale (replicating `effects` with `predict`)
                            
                                Can I run an SQL update statement using only dplyr syntax in R
                            
                                R RKEA - Not enough training instances with class labels (required: 1, provided: 0)!
                            
                                Row conditional column operations in data.table
                            
                                How to make keras in R use the tensorflow installed by Python
                            
                                Why does runif() have less unique values than rnorm()?
                            
                                What is .Rproj.user/ and can I safely delete it?
                            
                                .Rmd compilation failed due to LaTeX
                            
                                Running R inside a buffer in Vim
                            
                                How can I convert an mcmc.list to a bugs object?
                            
                                Can I use a knitr inline expression in the title of a latex document?
                            
                                Why is using update on a lm inside a grouped data.table losing its model data?
                            
                                What to do when a package is not available for our R version? [duplicate]
                            
                                Parsing HTML file in R
                            
                                Compare two dates in R

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

R caret model evaluation with alternate performance metric

Tags:

r

r-caret

evaluation-function

ADJ

People also ask

1 Answers

topepo

Recent Activity

Donate For Us