ray.rllib.utils.exploration.exploration.Exploration#
- class ray.rllib.utils.exploration.exploration.Exploration(action_space: gymnasium.spaces.Space, *, framework: str, policy_config: dict, model: ModelV2, num_workers: int, worker_index: int)[source]#
Implements an exploration strategy for Policies.
An Exploration takes model outputs, a distribution, and a timestep from the agent and computes an action to apply to the environment using an implemented exploration schema.
Methods
- param action_space:
The action space in which to explore.
Hook for preparations before policy.compute_actions() is called.
Returns a (possibly) exploratory action and its log-likelihood.
May add optimizer(s) to the Policy's own
optimizers
.Returns the current exploration state.
Handles necessary exploration logic at the end of an episode.
Handles necessary exploration logic at the beginning of an episode.
Handles post-processing of done episode trajectories.
Sets the Exploration object's state to the given values.