Why is episode done after 200 time steps (Gym environment MountainCar)?

Tags:

2 Answers

The current newest version of gym force-stops environment in 200 steps even if you don't use env.monitor. To avoid this, use env = gym.make("MountainCar-v0").env

answered Oct 11 '22 10:10

Scitator

Copied from https://github.com/openai/gym/wiki/FAQ:

Environments are intended to have various levels of difficulty, in order to benchmark the ability of reinforcement learning agents to solve them. Many of the environments are beyond the current state of the art, so don't expect to solve all of them. (If you do, please apply).

If you want to experiment with a variant of an environment that behaves differently, you should give it a new name so that you won't erroneously compare your agent running on an easy variant to someone else's agent running on the original environment. For instance, the MountainCar environment is hard partly because there's a limit of 200 timesteps after which it resets to the beginning. Successful agents must solve it in less than 200 timesteps. For testing purposes, you could make a new environment MountainCarMyEasyVersion-v0 with different parameters by adapting one of the calls to register found in gym/gym/envs/__init__.py:

gym.envs.register(
    id='MountainCarMyEasyVersion-v0',
    entry_point='gym.envs.classic_control:MountainCarEnv',
    max_episode_steps=250,      # MountainCar-v0 uses 200
    reward_threshold=-110.0,
)
env = gym.make('MountainCarMyEasyVersion-v0')

Because these environment names are only known to your code, you won't be able to upload it to the scoreboard.

answered Oct 11 '22 11:10

catherio

Related questions
                            
                                Error while installing PyGraphviz (Mac OS X, Anaconda)
                            
                                How to install Matplotlib's basemap?
                            
                                Python installation error no matching distribution found for pyplot
                            
                                Python: Evenly space output data with varying string lengths
                            
                                Getting nose to ignore a function with 'test' in the name
                            
                                How do I use distributed DNN training in TensorFlow?
                            
                                Python pandas merge keyerror
                            
                                NLTK was unable to find stanford-postagger.jar! Set the CLASSPATH environment variable
                            
                                How to open new tab in Chrome with Selenium-chromeDriver in Python
                            
                                What is the value of None in memory?
                            
                                Django command: How to insert newline in the help text?
                            
                                'ListSerializer' object is not callable
                            
                                Pymongo $in Query Not Working
                            
                                Horizontal layout of patches in legend (matplotlib)
                            
                                Coreference resolution in python nltk using Stanford coreNLP
                            
                                pandas.read_html not support decimal comma
                            
                                Can't broadcast input array from shape (3,1) into shape (3,)
                            
                                pandas.concat of multiple data frames using only common columns
                            
                                How to get the 3 items with the highest value from dictionary? [duplicate]
                            
                                Write spark dataframe to file using python and '|' delimiter

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why is episode done after 200 time steps (Gym environment MountainCar)?

Tags:

python

openai-gym

needRhelp

People also ask

2 Answers

Scitator

catherio

Recent Activity

Donate For Us