ray.rllib.utils.exploration.exploration.Exploration.on_episode_end#

Exploration.on_episode_end(policy: Policy, *, environment: BaseEnv = None, episode: int = None, tf_sess: tf.Session | None = None)[source]#

Handles necessary exploration logic at the end of an episode.

Parameters:
  • policy – The Policy object that holds this Exploration.

  • environment – The environment object we are acting in.

  • episode – The number of the episode that is starting.

  • tf_sess – In case of tf, the session object.