How to implement custom environment in keras-rl / OpenAI GYM?

Tags:

I'm a complete newbie to Reinforcement Learning and have been searching for a framework/module to easily navigate this treacherous terrain. In my search I've come across two modules keras-rl & OpenAI GYM.

I can get both of them two work on the examples they have shared on their WIKIs but they come with predefined environments and have little or no information on how to setup my own custom environment.

I would be really thankful if anyone could point me towards a tutorial or just explain it to me on how can i setup a non-game environment?

644

asked Jun 10 '17 03:06

Manipal King

1 Answers

I've been working on these libraries for some time and can share some of my experiments.

Let us first consider as an example of custom environment a text environment, https://github.com/openai/gym/blob/master/gym/envs/toy_text/hotter_colder.py

For a custom environment, a couple of things should be defined.

Constructor__init__ method
Action space
Observation space (see https://github.com/openai/gym/tree/master/gym/spaces for all available gym spaces (it's a kind of data structure))
_seed method (not sure that it's mandatory)
_step method accepting action as a param and returning observation (state after action), reward (for transition to new observational state), done (boolean flag), and some optional additional info.
_reset method that implements logic of fresh start of episode.

Optionally, you can create a _render method with something like

 def _render(self, mode='human', **kwargs):
        outfile = StringIO() if mode == 'ansi' else sys.stdout
        outfile.write('State: ' + repr(self.state) + ' Action: ' + repr(self.action_taken) + '\n')
        return outfile

And also, for better code flexibility, you can define logic of your reward in _get_reward method and changes to observation space from taking action in _take_action method.

191

answered Sep 19 '22 11:09

Andriy Lazorenko

Related questions
                            
                                Keras initialize large embeddings layer with pretrained embeddings
                            
                                Keras "pickle_safe": What does it mean to be "pickle safe", or alternatively, "non picklable" in Python?
                            
                                Python Keras LSTM learning converges too fast on high loss
                            
                                Train only some word embeddings (Keras)
                            
                                K.gradients(loss, input_img)[0] return "None". (Keras CNN visualization with tensorflow backend)
                            
                                Tensorflow InvalidArgumentError (indices) while training with Keras
                            
                                expected dense to have shape but got array with shape
                            
                                Input shape in keras (This loss expects targets to have the same shape as the output)
                            
                                Low NVIDIA GPU Usage with Keras and Tensorflow
                            
                                Applying callbacks in a custom training loop in Tensorflow 2.0
                            
                                What does train_on_batch() do in keras model?
                            
                                keras predict always output same value in multi-classification
                            
                                AttributeError: module 'tensorflow' has no attribute 'name_scope' with Keras
                            
                                restore_best_weights issue keras early stopping
                            
                                How to use Merge layer (concat function) on Keras 2.0.0?
                            
                                Keras and TensorBoard - AttributeError: 'Sequential' object has no attribute '_get_distribution_strategy'
                            
                                Training only one output of a network in Keras
                            
                                Why is accuracy different between Keras model.fit and model.evaluate?
                            
                                TypeError: softmax() got an unexpected keyword argument 'axis'
                            
                                How do I load a keras saved model with custom Optimizer

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to implement custom environment in keras-rl / OpenAI GYM?

Tags:

keras

reinforcement-learning

openai-gym

keras-rl

Manipal King

People also ask

1 Answers

Andriy Lazorenko

Recent Activity

Donate For Us