ray.rllib.policy.policy.Policy.get_exploration_state#

Policy.get_exploration_state() → Dict[str, numpy.array | jnp.ndarray | tf.Tensor | torch.Tensor][source]#

Returns the state of this Policy’s exploration component.

Returns:: Serializable information on the self.exploration object.