Hyperparameter tune for Tensorflow

1 Answers

You can try out Ray Tune, a simple library for scaling hyperparameter search. I mainly use it for Tensorflow model training, but it's agnostic to the framework - works seamlessly with PyTorch, Keras, etc. Here's the docs page - ray.readthedocs.io/en/latest/tune.html

You can use it to run distributed versions of state-of-the-art algorithms such as HyperBand or Bayesian Optimization in about 10 lines of code.

As an example to run 4 parallel evaluations at a time:

import ray
import ray.tune as tune
from ray.tune.hyperband import HyperBandScheduler


def train_model(config, reporter):  # add the reporter parameter
    model = build_tf_model(config["alpha"], config["beta"])
    loss = some_loss_function(model)
    optimizer = tf.AdamOptimizer(loss)

    for i in range(20):
        optimizer.step()
        stats = get_statistics()
        reporter(timesteps_total=i, 
                 mean_accuracy=stats["accuracy"])

ray.init(num_cpus=4)
tune.run(train_model,
    name="my_experiment",
    stop={"mean_accuracy": 100}, 
    config={ 
        "alpha": tune.grid_search([0.2, 0.4, 0.6]), 
        "beta": tune.grid_search([1, 2]) 
    },
    scheduler=HyperBandScheduler(reward_attr="mean_accuracy"))

You also don't need to change your code if you want to run this script on a cluster.

Disclaimer: I work on this project - let me know if you have any feedback!

107

answered Sep 30 '22 03:09

richliaw

Related questions
                            
                                Best resource for learning about prefetching a buffer in C on Intel/AMD 64 bit
                            
                                C++ stack and scope
                            
                                Is the gcc insane optimisation level (-O3) not insane enough?
                            
                                CPython string addition optimisation failure case
                            
                                Why do some C compilers set the return value of a function in weird places?
                            
                                Help with optimizing C# function via C and/or Assembly
                            
                                Puzzle: Sort an array of 0's and 1's in one parse.
                            
                                Symfony 2 performance optimisations
                            
                                MySQL index for MIN and MAX
                            
                                Understanding the while loop in Tensorflow
                            
                                Optimize multiply and add
                            
                                find total number of (i,j) pairs in array such that i<j and a[i]>a[j]
                            
                                cudaMallocHost vs malloc for better performance shows no difference
                            
                                Remove all commas, dots and lowercase the string with single iteration
                            
                                How minimal can an SVG be?
                            
                                How to map function directly over list of lists?
                            
                                Performance Considerations for throwing Exceptions
                            
                                What is the Speed Difference Between Database and Web Service Calls?
                            
                                Java String Comparison: style choice or optimization?
                            
                                Python: suggestion how to improve to write in streaming text file in Python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Hyperparameter tune for Tensorflow

Tags:

optimization

machine-learning

tensorflow

bayesian

hyperparameters

Mark

People also ask

1 Answers

richliaw

Recent Activity

Donate For Us