Exploration.on_episode_start(policy: Policy, *, environment: BaseEnv = None, episode: int = None, tf_sess: tf.Session | None = None)[source]#

Handles necessary exploration logic at the beginning of an episode.

  • policy – The Policy object that holds this Exploration.

  • environment – The environment object we are acting in.

  • episode – The number of the episode that is starting.

  • tf_sess – In case of tf, the session object.