I was working on CartPole-v0 provided by openai gym. I noticed that my program never scores greater 200. At some point it is stuck at 200 and no more. I was wondering if there is any configuration I might have missed in openai gym. Has anybody found this problem?
CartPole-v0
gives a reward of 1.0
for every step your agent is "alive".
The environment is registered with these lines of code:
register(
id='CartPole-v0',
entry_point='gym.envs.classic_control:CartPoleEnv',
max_episode_steps=200,
reward_threshold=195.0,
)
which, in the current version of the repository, can be found here.
That max_episode_steps=200
means that an episode automatically terminates after 200
steps. So, the maximum score you can get is 200
.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With