ray.rllib.utils.exploration.stochastic_sampling.StochasticSampling.__init__#

StochasticSampling.__init__(action_space: gymnasium.spaces.Space, *, framework: str, model: ModelV2, random_timesteps: int = 0, **kwargs)[source]#

Initializes a StochasticSampling Exploration object.

Parameters:
  • action_space – The gym action space used by the environment.

  • framework – One of None, “tf”, “torch”.

  • model – The ModelV2 used by the owning Policy.

  • random_timesteps – The number of timesteps for which to act completely randomly. Only after this number of timesteps, actual samples will be drawn to get exploration actions.