ray.rllib.models.distributions.Distribution.kl#
- abstract Distribution.kl(other: Distribution, **kwargs) numpy.array | jnp.ndarray | tf.Tensor | torch.Tensor [source]#
The KL-divergence between two distributions.
- Parameters:
other – The other distribution.
**kwargs – Forward compatibility placeholder.
- Returns:
The KL-divergence between the two distributions.