ray.rllib.utils.exploration.random.Random.on_episode_start#

Random.on_episode_start(policy: Policy, *, environment: BaseEnv = None, episode: int = None, tf_sess: tf.Session | None = None)#

Handles necessary exploration logic at the beginning of an episode.

Parameters:
  • policy – The Policy object that holds this Exploration.

  • environment – The environment object we are acting in.

  • episode – The number of the episode that is starting.

  • tf_sess – In case of tf, the session object.