ray.rllib.utils.torch_utils.huber_loss#
- ray.rllib.utils.torch_utils.huber_loss(x: numpy.array | jnp.ndarray | tf.Tensor | torch.Tensor, delta: float = 1.0) numpy.array | jnp.ndarray | tf.Tensor | torch.Tensor [source]#
Computes the huber loss for a given term and delta parameter.
Reference: https://en.wikipedia.org/wiki/Huber_loss Note that the factor of 0.5 is implicitly included in the calculation.
- Formula:
L = 0.5 * x^2 for small abs x (delta threshold) L = delta * (abs(x) - 0.5*delta) for larger abs x (delta threshold)
- Parameters:
x – The input term, e.g. a TD error.
delta – The delta parmameter in the above formula.
- Returns:
The Huber loss resulting from
x
anddelta
.