I've used Scikit-learn's GridSearchCV before to optimize the hyperparameters of my models, but just wondering if a similar tool exists to optimize hyperparameters for Tensorflow (for instance number of epochs, learning rate, sliding window size etc.) And if not, how can I implement a snippet that effectively runs all different combinations?

Even though it does not seem to be explicitly documented (in version 1.2), the package <code>tf.contrib.learn</code> (included in TensorFlow) defines classifiers that are supposed to be compatible with scikit-learn... However, looking at the source, it seems you need to explicitly set the environment variable <code>TENSORFLOW_SKLEARN</code> (e.g. to <code>"1"</code>) to actually get this compatibility. If this works, you can already use <code>GridSearchCV</code> (see this test case). That said, there are a few alternatives. I don't know about any specific to TensorFlow, but hyperopt, Scikit-Optimize or SMAC3 should all be valid options. MOE and Spearmint look like used to be good choices but now don't seem too maintained. Alternatively, you can look into a service like SigOpt (a company by the original author of MOE). Edit About running all possible combinations of parameters, the core logic, if you want to implement it yourself, is not really complicated. You can just define lists with the possible values for each parameter and then run through all the combinations with <code>itertools.product</code>. Something like: <pre class="prettyprint"><code>from itertools import product param1_values = [...] param2_values = [...] param3_values = [...] for param1, param2, param3 in product(param1_values, param2_values param3_values): run_experiment(param1, param2, param3) </code></pre> Note however that grid search can be prohibitively expensive to run in many cases, and even doing just a random search in the parameters space will probably be more efficient (more about that in this publication).

Another viable (and documented) option for grid search with Tensorflow is Ray Tune. It's a scalable framework for hyperparameter tuning, specifically for deep learning/reinforcement learning. You can try out a fast tutorial here. It also takes care of Tensorboard logging and efficient search algorithms (ie, <code>HyperOpt</code> integration and HyperBand) in about 10 lines of Python. <pre class="prettyprint"><code>from ray import tune def train_tf_model(config): for i in range(num_epochs): accuracy = train_one_epoch(model) tune.report(acc=accuracy) tune.run(train_tf_model, config={ "alpha": tune.grid_search([0.2, 0.4, 0.6]), "beta": tune.grid_search([1, 2]), }) </code></pre> (Disclaimer: I contribute actively to this project!)

Hyperparameter Tuning of Tensorflow Model

Tags:

I've used Scikit-learn's GridSearchCV before to optimize the hyperparameters of my models, but just wondering if a similar tool exists to optimize hyperparameters for Tensorflow (for instance number of epochs, learning rate, sliding window size etc.)

And if not, how can I implement a snippet that effectively runs all different combinations?

898

asked Jun 28 '17 12:06

mamafoku

2 Answers

Even though it does not seem to be explicitly documented (in version 1.2), the package tf.contrib.learn (included in TensorFlow) defines classifiers that are supposed to be compatible with scikit-learn... However, looking at the source, it seems you need to explicitly set the environment variable TENSORFLOW_SKLEARN (e.g. to "1") to actually get this compatibility. If this works, you can already use GridSearchCV (see this test case).

That said, there are a few alternatives. I don't know about any specific to TensorFlow, but hyperopt, Scikit-Optimize or SMAC3 should all be valid options. MOE and Spearmint look like used to be good choices but now don't seem too maintained.

Alternatively, you can look into a service like SigOpt (a company by the original author of MOE).

Edit

About running all possible combinations of parameters, the core logic, if you want to implement it yourself, is not really complicated. You can just define lists with the possible values for each parameter and then run through all the combinations with itertools.product. Something like:

from itertools import product

param1_values = [...]
param2_values = [...]
param3_values = [...]
for param1, param2, param3 in product(param1_values, param2_values param3_values):
    run_experiment(param1, param2, param3)

Note however that grid search can be prohibitively expensive to run in many cases, and even doing just a random search in the parameters space will probably be more efficient (more about that in this publication).

159

answered Sep 30 '22 16:09

jdehesa

Another viable (and documented) option for grid search with Tensorflow is Ray Tune. It's a scalable framework for hyperparameter tuning, specifically for deep learning/reinforcement learning.

You can try out a fast tutorial here.

It also takes care of Tensorboard logging and efficient search algorithms (ie, HyperOpt integration and HyperBand) in about 10 lines of Python.

from ray import tune

def train_tf_model(config):  
    for i in range(num_epochs):
        accuracy = train_one_epoch(model)
        tune.report(acc=accuracy)

tune.run(train_tf_model,
         config={
            "alpha": tune.grid_search([0.2, 0.4, 0.6]),
            "beta": tune.grid_search([1, 2]),
         })

(Disclaimer: I contribute actively to this project!)

answered Sep 30 '22 18:09

richliaw

Related questions
                            
                                Evaluate Typescript from string?
                            
                                checking integer overflow in python
                            
                                How to change the datetime tick label frequency for matplotlib plots?
                            
                                react-intl - accessing nested messages
                            
                                How can I add onKeyPress event to react material-ui textfield?
                            
                                How to sort or order results docker ps --format?
                            
                                React with JetBrains WebStorm auto import with absolute path instead of relative
                            
                                FIrebase deploy error: Cannot find module 'firebase-admin'
                            
                                Jest mocking default exports - require vs import
                            
                                What does -fwrapv do?
                            
                                What is the best Environment.SpecialFolder for store application data in Xamarin.Forms?
                            
                                Swift - Lazy Var vs. Let when creating views programmatically (saving memory)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With