GridSearch with Keras Neural Networks

Tags:

I'm trying to perform parameters tuning for a neural network built with keras. This is my code with a comment on the line that causes the error:

Click to copy

from sklearn.cross_validation import StratifiedKFold, cross_val_score
from sklearn import grid_search
from sklearn.metrics import classification_report
import multiprocessing

from keras.models import Sequential
from keras.layers import Dense
from sklearn.preprocessing import LabelEncoder
from keras.utils import np_utils
from keras.wrappers.scikit_learn import KerasClassifier
import numpy as np


def tuning(X_train,Y_train,X_test,Y_test):

  in_size=X_train.shape[1]
  num_cores=multiprocessing.cpu_count()
  model = Sequential()
  model.add(Dense(in_size, input_dim=in_size, init='uniform', activation='relu'))
  model.add(Dense(8, init='uniform', activation='relu'))
  model.add(Dense(1, init='uniform', activation='sigmoid'))
  model.compile(loss='binary_crossentropy', optimizer='adam', metrics=['accuracy'])

  batch_size = [10, 20, 40, 60, 80, 100]
  epochs = [10,20]
  param_grid = dict(batch_size=batch_size, nb_epoch=epochs)

  k_model = KerasClassifier(build_fn=model, verbose=0)
  clf = grid_search.GridSearchCV(estimator=k_model, param_grid=param_grid, cv=StratifiedKFold(Y_train, n_folds=10, shuffle=True, random_state=1234),
                   scoring="accuracy", verbose=100, n_jobs=num_cores)

  clf.fit(X_train, Y_train) #ERROR HERE

  print("Best parameters set found on development set:")
  print()
  print(clf.best_params_)
  print()
  print("Grid scores on development set:")
  print()
  for params, mean_score, scores in clf.grid_scores_:
    print("%0.3f (+/-%0.03f) for %r"
        % (mean_score, scores.std() * 2, params))
  print()
  print("Detailed classification report:")
  print()
  print("The model is trained on the full development set.")
  print("The scores are computed on the full evaluation set.")
  print()
  y_true, y_pred = Y_test, clf.predict(X_test)
  print(classification_report(y_true, y_pred))
  print()

And this is the errors report:

Click to copy

 clf.fit(X_train, Y_train)
  File "/usr/local/lib/python2.7/dist-packages/sklearn/grid_search.py", line 804, in fit
    return self._fit(X, y, ParameterGrid(self.param_grid))
  File "/usr/local/lib/python2.7/dist-packages/sklearn/grid_search.py", line 553, in _fit
    for parameters in parameter_iterable
  File "/usr/local/lib/python2.7/dist-packages/sklearn/externals/joblib/parallel.py", line 800, in __call__
    while self.dispatch_one_batch(iterator):
  File "/usr/local/lib/python2.7/dist-packages/sklearn/externals/joblib/parallel.py", line 658, in dispatch_one_batch
    self._dispatch(tasks)
  File "/usr/local/lib/python2.7/dist-packages/sklearn/externals/joblib/parallel.py", line 566, in _dispatch
    job = ImmediateComputeBatch(batch)
  File "/usr/local/lib/python2.7/dist-packages/sklearn/externals/joblib/parallel.py", line 180, in __init__
    self.results = batch()
  File "/usr/local/lib/python2.7/dist-packages/sklearn/externals/joblib/parallel.py", line 72, in __call__
    return [func(*args, **kwargs) for func, args, kwargs in self.items]
  File "/usr/local/lib/python2.7/dist-packages/sklearn/cross_validation.py", line 1531, in _fit_and_score
    estimator.fit(X_train, y_train, **fit_params)
  File "/usr/local/lib/python2.7/dist-packages/keras/wrappers/scikit_learn.py", line 135, in fit
    **self.filter_sk_params(self.build_fn.__call__))
TypeError: __call__() takes at least 2 arguments (1 given)

Am I missing something? The grid search goes well with random forests, svm and logistic regression. I only have problems with Neural Networks.

417

asked Jan 05 '17 12:01

Stefano Sandonà

1 Answers

Here the error indicates that the build_fn needs to have 2 arguments as indicated from the # of parameters from param_grid.

So you need to explicitly define an new function and use that as build_fn=make_model

Click to copy

def make_model(batch_size, nb_epoch):
    model = Sequential()
    model.add(Dense(in_size, input_dim=in_size, init='uniform', activation='relu'))
    model.add(Dense(8, init='uniform', activation='relu'))
    model.add(Dense(1, init='uniform', activation='sigmoid'))
    model.compile(loss='binary_crossentropy', optimizer='adam', metrics=['accuracy'])
    return model

Also check keras/examples/mnist_sklearn_wrapper.py where GridSearchCV is used for hyper-parameter search.

184

answered Sep 19 '22 18:09

indraforyou

Related questions
                            
                                Color Range Python
                            
                                Python numpy.random.normal
                            
                                panda dataframe to ordered dictionary
                            
                                How to select related in django model so it wont generate a lot of subqueries
                            
                                How to create a second None in Python? Making a singleton object where the id is always the same
                            
                                Python lxml etree.tostring() returns empty string running on mod_wsgi
                            
                                Creating PyPi package - Could not find a version that satisfies the requirement iso8601 [duplicate]
                            
                                How to add edge in mesh using Maya Python API 2.0
                            
                                ConcatOp : Dimensions of inputs should match
                            
                                Spark Dataframes: Skewed Partition after Join
                            
                                Pandas idiomatic way to custom fillna
                            
                                Reshaping Pandas Dataframe with Grouped Data (Long to Wide)
                            
                                Django: Update multiple objects attributes
                            
                                isinstance not working for Decimal in AppEngine
                            
                                Pandas read_csv, reading a boolean with missing values specified as an int
                            
                                Removing text while processing the image
                            
                                uWSGI NOT working with .ini file
                            
                                Python: understanding (None for g in g if (yield from g) and False)
                            
                                Efficiently check if an element occurs at least n times in a list
                            
                                why can't I import geopandas?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

GridSearch with Keras Neural Networks

Tags:

python

machine-learning

keras

Stefano Sandonà

People also ask

1 Answers

indraforyou

Recent Activity

Donate For Us