ray.rllib.callbacks.callbacks.RLlibCallback.on_episode_start#

Callback run right after an Episode has been started.

This method gets called after a SingleAgentEpisode or MultiAgentEpisode instance has been reset with a call to env.reset() by the EnvRunner.

Parameters:

episode – The just started (after env.reset()) SingleAgentEpisode or MultiAgentEpisode object.
env_runner – Reference to the EnvRunner running the env and episode.
metrics_logger – The MetricsLogger object inside the env_runner. Can be used to log custom metrics during env/episode stepping.
env – The gym.Env or gym.vector.Env object running the started episode.
env_index – The index of the sub-environment that is about to be reset (within the vector of sub-environments of the BaseEnv).
rl_module – The RLModule used to compute actions for stepping the env. In single-agent mode, this is a simple RLModule, in multi-agent mode, this is a MultiRLModule.
kwargs – Forward compatibility placeholder.