Try Ray with $100 credit — Start now

ray.rllib.models.distributions.Distribution.kl#

abstract Distribution.kl(other: Distribution, **kwargs) → numpy.array | jnp.ndarray | tf.Tensor | torch.Tensor[source]#

The KL-divergence between two distributions.

Parameters:

other – The other distribution.
**kwargs – Forward compatibility placeholder.

Returns:

The KL-divergence between the two distributions.