Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Error in running h2o.ensemble

I am getting error while running h2o.ensemble in R. This is the error output

[1] "Cross-validating and training base learner 1: h2o.glm.wrapper"
  |======================================================================| 100%
[1] "Cross-validating and training base learner 2: h2o.randomForest.1"
  |==============                                                        |  19%

Got exception 'class java.lang.AssertionError', with msg 'null'
java.lang.AssertionError
    at hex.tree.DHistogram.scoreMSE(DHistogram.java:323)
    at hex.tree.DTree$DecidedNode$FindSplits.compute2(DTree.java:441)
    at hex.tree.DTree$DecidedNode.bestCol(DTree.java:421)
    at hex.tree.DTree$DecidedNode.<init>(DTree.java:449)
    at hex.tree.SharedTree.makeDecided(SharedTree.java:489)
    at hex.tree.SharedTree$ScoreBuildOneTree.onCompletion(SharedTree.java:436)
    at jsr166y.CountedCompleter.__tryComplete(CountedCompleter.java:425)
    at jsr166y.CountedCompleter.tryComplete(CountedCompleter.java:383)
    at water.MRTask.compute2(MRTask.java:683)
    at water.H2O$H2OCountedCompleter.compute(H2O.java:1069)
    at jsr166y.CountedCompleter.exec(CountedCompleter.java:468)
    at jsr166y.ForkJoinTask.doExec(ForkJoinTask.java:263)
    at jsr166y.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:974)
    at jsr166y.ForkJoinPool.runWorker(ForkJoinPool.java:1477)
    at jsr166y.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:104)


Error: 'null'

This is my code that i am using. I am using this script for regression problem. "sales" column is for output prediction. Rest of the columns are for training.

response <- "Sales"
predictors <- setdiff(names(train), response)

h2o.glm.1 <- function(..., alpha = 0.0) h2o.glm.wrapper(..., alpha = alpha)
h2o.glm.2 <- function(..., alpha = 0.5) h2o.glm.wrapper(..., alpha = alpha)
h2o.glm.3 <- function(..., alpha = 1.0) h2o.glm.wrapper(..., alpha = alpha)
h2o.randomForest.1 <- function(..., ntrees = 200, nbins = 50, seed = 1) h2o.randomForest.wrapper(..., ntrees = ntrees, nbins = nbins, seed = seed)
h2o.randomForest.2 <- function(..., ntrees = 200, sample_rate = 0.75, seed = 1) h2o.randomForest.wrapper(..., ntrees = ntrees, sample_rate = sample_rate, seed = seed)
h2o.gbm.1 <- function(..., ntrees = 100, seed = 1) h2o.gbm.wrapper(..., ntrees = ntrees, seed = seed)
h2o.gbm.6 <- function(..., ntrees = 100, col_sample_rate = 0.6, seed = 1) h2o.gbm.wrapper(..., ntrees = ntrees, col_sample_rate = col_sample_rate, seed = seed)
h2o.gbm.8 <- function(..., ntrees = 100, max_depth = 3, seed = 1) h2o.gbm.wrapper(..., ntrees = ntrees, max_depth = max_depth, seed = seed)
h2o.deeplearning.1 <- function(..., hidden = c(500,500), activation = "Rectifier", epochs = 50, seed = 1)  h2o.deeplearning.wrapper(..., hidden = hidden, activation = activation, seed = seed)
h2o.deeplearning.6 <- function(..., hidden = c(50,50), activation = "Rectifier", epochs = 50, seed = 1)  h2o.deeplearning.wrapper(..., hidden = hidden, activation = activation, seed = seed)
h2o.deeplearning.7 <- function(..., hidden = c(100,100), activation = "Rectifier", epochs = 50, seed = 1)  h2o.deeplearning.wrapper(..., hidden = hidden, activation = activation, seed = seed)

print("learning starts ")
#### Customized base learner library
learner <- c("h2o.glm.wrapper",
             "h2o.randomForest.1", "h2o.randomForest.2",
             "h2o.gbm.1", "h2o.gbm.6", "h2o.gbm.8",
             "h2o.deeplearning.1", "h2o.deeplearning.6", "h2o.deeplearning.7")

metalearner <- "h2o.glm.wrapper"
#
#Train with new library:
fit <- h2o.ensemble(
  x =  predictors, 
  y= response,
  training_frame=train,
  family = "gaussian", 
  learner = learner, 
  metalearner = metalearner,
  cvControl = list(V = 5))

All columns of train data are numeral. I am using R version 3.2.2.

like image 264
saurabh agarwal Avatar asked Dec 09 '15 05:12

saurabh agarwal


2 Answers

The updated way to do this is

h2o.init(nthreads=-1,enable_assertions = FALSE)
like image 110
Andrew Troiano Avatar answered Oct 29 '22 19:10

Andrew Troiano


As suggested by Spencer Aiello

Setting the assertion to FALSE in the h2o initialisation might do the trick

h2o.init(nthreads=-1, assertion = FALSE)

Make sure that you properly shutdown/restart h2o before applying the changes

h2o.shutdown()
h2o.init(nthreads=-1, assertion = FALSE)
like image 23
Lod Avatar answered Oct 29 '22 20:10

Lod