I am new to reinforcement learning and I am working on the RL of a custom environment in OpenAI gym with RLlib. When I create a custom environment, do I need to specify the number of episodes in the __init__()
method? ALso, when I train the agent with
for _ in range(10):
trainer.train()
how many time steps are taken in one iteration? is it equal to number of episodes defined in the custom environment? Thank you.
I think what you need to set for the max number of steps in one episode is the hyperparameter `horizon'
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With