ray.rllib.utils.tf_utils.zero_logps_from_actions#
- ray.rllib.utils.tf_utils.zero_logps_from_actions(actions: numpy.array | jnp.ndarray | tf.Tensor | torch.Tensor | dict | tuple) numpy.array | jnp.ndarray | tf.Tensor | torch.Tensor [source]#
Helper function useful for returning dummy logp’s (0) for some actions.
- Parameters:
actions – The input actions. This can be any struct of complex action components or a simple tensor of different dimensions, e.g. [B], [B, 2], or {“a”: [B, 4, 5], “b”: [B]}.
- Returns:
A 1D tensor of 0.0 (dummy logp’s) matching the batch dim of
actions
(shape=[B]).